Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics
| dc.contributor.advisor | Derpanis, Konstantinos | |
| dc.contributor.author | Kowal, Matthew Paul | |
| dc.date.accessioned | 2026-03-10T16:09:46Z | |
| dc.date.available | 2026-03-10T16:09:46Z | |
| dc.date.copyright | 2025-08-06 | |
| dc.date.issued | 2026-03-10 | |
| dc.date.updated | 2026-03-10T16:09:45Z | |
| dc.degree.discipline | Electrical Engineering & Computer Science | |
| dc.degree.level | Doctoral | |
| dc.degree.name | PhD - Doctor of Philosophy | |
| dc.description.abstract | This dissertation advances the interpretability of deep vision models, with a particular focus on disentangling representations across space, layers, and time. As deep learning systems increasingly underpin critical applications, understanding their internal representations and decision-making processes is essential. The dissertation is structured into three parts that address this challenge from complementary perspectives. The first part introduces a framework for quantifying static and dynamic information in spatiotemporal models, offering a principled measure of how such models encode temporal dependencies compared with static counterparts. The second part presents a novel methodology for discovering and localizing semantically meaningful concepts within these spatiotemporal models, facilitating a deeper understanding of the internal features that drive predictions. The third part extends this analysis by identifying interlayer concept circuits, i.e., structured pathways through which concepts propagate across layers, revealing how information flows and transforms within deep image models. Together, these contributions provide a toolkit for interpreting complex neural architectures and lay the groundwork for more transparent and accountable artificial intelligence systems in dynamic visual domains. Overall, this dissertation provides new tools and insights for understanding the multilayered and spatiotemporal characteristics of deep vision models. | |
| dc.identifier.uri | https://hdl.handle.net/10315/43575 | |
| dc.language | en | |
| dc.rights | Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests. | |
| dc.subject | Computer science | |
| dc.subject | Artificial intelligence | |
| dc.subject.keywords | Concept-based interpretability | |
| dc.subject.keywords | Deep vision models | |
| dc.subject.keywords | Spatiotemporal models | |
| dc.subject.keywords | Static and dynamic information | |
| dc.subject.keywords | Concept discovery | |
| dc.subject.keywords | Interlayer concept circuits | |
| dc.subject.keywords | Computer vision | |
| dc.subject.keywords | Internal representations | |
| dc.subject.keywords | Explainable artificial intelligence | |
| dc.subject.keywords | Video Dynamics | |
| dc.title | Disentangling Visual Concepts Across Space and Time: From Image Hierarchies to Video Dynamics | |
| dc.type | Electronic Thesis or Dissertation |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Kowal_Matthew_Paul_2025_PhD.pdf
- Size:
- 88.79 MB
- Format:
- Adobe Portable Document Format