Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics

Kowal, Matthew Paul

Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics

dc.contributor.advisor	Derpanis, Konstantinos
dc.contributor.author	Kowal, Matthew Paul
dc.date.accessioned	2026-03-10T16:09:46Z
dc.date.available	2026-03-10T16:09:46Z
dc.date.copyright	2025-08-06
dc.date.issued	2026-03-10
dc.date.updated	2026-03-10T16:09:45Z
dc.degree.discipline	Electrical Engineering & Computer Science
dc.degree.level	Doctoral
dc.degree.name	PhD - Doctor of Philosophy
dc.description.abstract	This dissertation advances the interpretability of deep vision models, with a particular focus on disentangling representations across space, layers, and time. As deep learning systems increasingly underpin critical applications, understanding their internal representations and decision-making processes is essential. The dissertation is structured into three parts that address this challenge from complementary perspectives. The first part introduces a framework for quantifying static and dynamic information in spatiotemporal models, offering a principled measure of how such models encode temporal dependencies compared with static counterparts. The second part presents a novel methodology for discovering and localizing semantically meaningful concepts within these spatiotemporal models, facilitating a deeper understanding of the internal features that drive predictions. The third part extends this analysis by identifying interlayer concept circuits, i.e., structured pathways through which concepts propagate across layers, revealing how information flows and transforms within deep image models. Together, these contributions provide a toolkit for interpreting complex neural architectures and lay the groundwork for more transparent and accountable artificial intelligence systems in dynamic visual domains. Overall, this dissertation provides new tools and insights for understanding the multilayered and spatiotemporal characteristics of deep vision models.
dc.identifier.uri	https://hdl.handle.net/10315/43575
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Computer science
dc.subject	Artificial intelligence
dc.subject.keywords	Concept-based interpretability
dc.subject.keywords	Deep vision models
dc.subject.keywords	Spatiotemporal models
dc.subject.keywords	Static and dynamic information
dc.subject.keywords	Concept discovery
dc.subject.keywords	Interlayer concept circuits
dc.subject.keywords	Computer vision
dc.subject.keywords	Internal representations
dc.subject.keywords	Explainable artificial intelligence
dc.subject.keywords	Video Dynamics
dc.title	Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Kowal_Matthew_Paul_2025_PhD.pdf
Size:: 88.79 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Electrical Engineering and Computer Science