XAI-Driven Malicious Encrypted Traffic Detection and Characterization to Enhance Information Security

dc.contributor.advisorHabibi Lashkari, Arash
dc.contributor.authorSharma, Adit
dc.date.accessioned2025-07-23T15:11:48Z
dc.date.available2025-07-23T15:11:48Z
dc.date.copyright2025-03-20
dc.date.issued2025-07-23
dc.date.updated2025-07-23T15:11:47Z
dc.degree.disciplineInformation Systems and Technology
dc.degree.levelMaster's
dc.degree.nameMA - Master of Arts
dc.description.abstractSecuring information through encryption is essential in data communication, but to effectively detect malicious activities, it is crucial to distinguish between encrypted and non-encrypted traffic. Traditional encrypted traffic classification methods, including rule-based systems and conventional machine learning approaches, often struggle with scalability, generalization, and class imbalance, leading to suboptimal classification performance. This study introduces a novel hybrid model for encrypted traffic classification, integrating Multi-Head Attention mechanisms for feature enhancement and LightGBM as the final classifier. The proposed model follows a two-step classification process: first, performing binary classification to separate encrypted and non-encrypted traffic, and second, applying multi-class classification to categorize encrypted traffic into TOR, VPN, I2P, Zeronet, and Freenet. To improve model interpretability, SHAP is employed to validate the importance of attention-based features, while LIME provides insights into misclassified instances, enabling adjustments such as weight threshold tuning and handling class imbalances. Furthermore, this study incorporates a refined dataset preprocessing pipeline, leveraging NTL Flowlyzer—an advanced traffic analyzer that extracts over 400 features, including entropy-based attributes. To address class imbalance issues, strategic adjustments such as SMOTE augmentation for Freenet and class-specific threshold tuning were applied based on SHAP and LIME insights, resulting in improved classification performance. The experimental evaluation demonstrates that the proposed hybrid model outperforms existing approaches in accuracy, precision ,and recall while maintaining efficiency in both time and computational complexity. By integrating explainable AI techniques and adaptive optimization strategies, our approach enhances classification performance and improves the transparency and interpretability of encrypted traffic detection. These findings contribute to advancing cybersecurity by enabling more robust and interpretable encrypted traffic classification models.
dc.identifier.urihttps://hdl.handle.net/10315/42974
dc.languageen
dc.rightsAuthor owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subjectInformation technology
dc.subjectComputer science
dc.subject.keywordsEncrypted traffic classification
dc.subject.keywordsNetwork traffic analysis
dc.subject.keywordsMachine learning
dc.subject.keywordsDeep learning
dc.subject.keywordsHybrid models
dc.subject.keywordsExplainable AI
dc.subject.keywordsSHAP
dc.subject.keywordsLIME
dc.subject.keywordsCybersecurity datasets
dc.subject.keywordsTraffic anomaly detection
dc.titleXAI-Driven Malicious Encrypted Traffic Detection and Characterization to Enhance Information Security
dc.typeElectronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sharma_Adit_2025_MA.pdf
Size:
15.01 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.87 KB
Format:
Plain Text
Description:
Loading...
Thumbnail Image
Name:
YorkU_ETDlicense.txt
Size:
3.39 KB
Format:
Plain Text
Description: