EAGLE-APT: Edge-Aware Provenance Graph Learning with Node Encoding for Advanced Persistent Threat Detection and Attribution from System Audit Log

dc.contributor.advisorHabibi Lashkari, Arash
dc.contributor.authorAbbaszadeh Darban, Reza
dc.date.accessioned2026-03-10T16:07:32Z
dc.date.available2026-03-10T16:07:32Z
dc.date.copyright2025-10-27
dc.date.issued2026-03-10
dc.date.updated2026-03-10T16:07:32Z
dc.degree.disciplineComputer Science
dc.degree.levelMaster's
dc.degree.nameMSc - Master of Science
dc.description.abstractAdvanced Persistent Threats (APTs) represent some of the most challenging forms of cyberattacks, characterized by stealth, persistence, and multi-stage operations that evade traditional defenses. Detecting and attributing such campaigns to a known APT group requires methods that can capture long-term coordinated malicious activity within complex system interactions. This research introduces EAGLE-APT, an Edge-Aware Provenance Graph Learning framework with Node Encoding for APT detection and attribution from system audit logs. The proposed architecture comprises five core components: a provenance graph generator, a node feature extractor, a type-specific feature encoder, a malicious node detector, and an attribution module. The process begins with the provenance graph generator, which converts raw audit logs into heterogeneous provenance graphs that capture system entities and their causal relationships. These graphs are then enriched by the node feature extractor, which incorporates both semantic and structural information to represent the behavior of each entity more effectively. Next, the type-specific feature encoder transforms heterogeneous node features into a unified embedding space, ensuring that diverse data types contribute meaningfully to the representation. Building on this foundation, the malicious node detector utilizes an edge-aware graph neural network to identify suspicious nodes, taking into account both the contextual importance of neighbors and the nature of their connections. Finally, the attribution module analyzes the detected malicious subgraphs and classifies them into known APT groups, offering a foundation for informed response and defense strategies. To support evaluation, a comprehensive dataset of simulated APT campaigns was generated in a controlled enterprise environment, capturing realistic multi-stage attack behaviors. Together, these contributions provide both a novel framework for end-to-end detection and attribution and a reproducible dataset that can serve as a basis for advancing future research in APT defense.
dc.identifier.urihttps://hdl.handle.net/10315/43557
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectComputer science
dc.subjectComputer engineering
dc.subject.keywordsAdvanced persistent threats
dc.subject.keywordsAttack detection
dc.subject.keywordsAttack attribution
dc.subject.keywordsCybersecurity forensics
dc.subject.keywordsProvenance graph learning
dc.subject.keywordsGraph neural networks
dc.subject.keywordsExplainable GNNs
dc.subject.keywordsSystem audit logs
dc.subject.keywordsBCCC-APT-Log2025 dataset
dc.titleEAGLE-APT: Edge-Aware Provenance Graph Learning with Node Encoding for Advanced Persistent Threat Detection and Attribution from System Audit Log
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Abbaszadeh_Darban_Reza_2025_MSc.pdf
Size:
21.2 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description:

Collections