Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics

dc.contributor.advisorDerpanis, Konstantinos
dc.contributor.authorKowal, Matthew Paul
dc.date.accessioned2026-03-10T16:09:46Z
dc.date.available2026-03-10T16:09:46Z
dc.date.copyright2025-08-06
dc.date.issued2026-03-10
dc.date.updated2026-03-10T16:09:45Z
dc.degree.disciplineElectrical Engineering & Computer Science
dc.degree.levelDoctoral
dc.degree.namePhD - Doctor of Philosophy
dc.description.abstractThis dissertation advances the interpretability of deep vision models, with a particular focus on disentangling representations across space, layers, and time. As deep learning systems increasingly underpin critical applications, understanding their internal representations and decision-making processes is essential. The dissertation is structured into three parts that address this challenge from complementary perspectives. The first part introduces a framework for quantifying static and dynamic information in spatiotemporal models, offering a principled measure of how such models encode temporal dependencies compared with static counterparts. The second part presents a novel methodology for discovering and localizing semantically meaningful concepts within these spatiotemporal models, facilitating a deeper understanding of the internal features that drive predictions. The third part extends this analysis by identifying interlayer concept circuits, i.e., structured pathways through which concepts propagate across layers, revealing how information flows and transforms within deep image models. Together, these contributions provide a toolkit for interpreting complex neural architectures and lay the groundwork for more transparent and accountable artificial intelligence systems in dynamic visual domains. Overall, this dissertation provides new tools and insights for understanding the multilayered and spatiotemporal characteristics of deep vision models.
dc.identifier.urihttps://hdl.handle.net/10315/43575
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer science
dc.subjectArtificial intelligence
dc.subject.keywordsConcept-based interpretability
dc.subject.keywordsDeep vision models
dc.subject.keywordsSpatiotemporal models
dc.subject.keywordsStatic and dynamic information
dc.subject.keywordsConcept discovery
dc.subject.keywordsInterlayer concept circuits
dc.subject.keywordsComputer vision
dc.subject.keywordsInternal representations
dc.subject.keywordsExplainable artificial intelligence
dc.subject.keywordsVideo Dynamics
dc.titleDisentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kowal_Matthew_Paul_2025_PhD.pdf
Size:
88.79 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: