Machine Learning-Based Defences Against Advanced 'Session-Replay' Web Bots

Sadeghpour, Shadi

Machine Learning-Based Defences Against Advanced 'Session-Replay' Web Bots

dc.contributor.advisor	Vlajic, Natalija
dc.contributor.author	Sadeghpour, Shadi
dc.date.accessioned	2024-03-18T18:01:41Z
dc.date.available	2024-03-18T18:01:41Z
dc.date.issued	2024-03-16
dc.date.updated	2024-03-16T10:45:15Z
dc.degree.discipline	Electrical Engineering & Computer Science
dc.degree.level	Doctoral
dc.degree.name	PhD - Doctor of Philosophy
dc.description.abstract	The widespread adoption of the Internet has brought about significant benefits for modern society, but has also led to an increase in malicious activities, particularly through the use of web bots. While some bots serve useful purposes, the proliferation of malicious web bots poses a significant threat to Internet security, impacting individuals, businesses, governments, and society as a whole. The emergence of AI-powered web bots capable of mimicking human behavior and evading detection has further exacerbated this problem. This dissertation aims to deepen our understanding of advanced web bots and the web bot attacks that often signal fraudulent online activities. In particular, we focus on session-replay web bots, the latest and most advanced type of web bots, which present an especially difficult challenge in online domains where multiple genuine human users frequently exhibit similar behavioral patterns, such as news, banking, or gaming sites. To achieve our research objectives, we have meticulously curated an extensive dataset encompassing both human and bot-generated data. Additionally, we have developed our own prototype of advanced session-replay bot (the so-called ReBot), which has enabled us to accurately simulate the attacks conducted by this particular category of web bots. Moreover, by infusing randomness into the design of ReBot, we have been able to achieve varying degrees of bot and attack evasiveness. From the defenders perspective, and by leveraging state-of-the-art deep learning algorithms, we have proposed several effective strategies for detection of advanced session-replay bot attacks. One of our proposed techniques deploys the concept of moving-target defence in the form of webpage randomization which is particularly challenging for the attacker to overcome. This thesis also explores the utilization of generative machine learning models for the purpose of generating synthetic bots sessions. The ability to synthesize advance session-replay bots - as opposed to looking for real-world instances of these bots or evidence of their activity in real-world logs - is of critical importance if we are to make timely and effective advances in the field of web bot detection and defence.
dc.identifier.uri	https://hdl.handle.net/10315/41896
dc.language	en
dc.rights	Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.
dc.subject	Computer science
dc.subject.keywords	Machine learning
dc.subject.keywords	Session-replay attacks
dc.subject.keywords	Web bots
dc.subject.keywords	Cybersecurity
dc.subject.keywords	Defense mechanisms
dc.subject.keywords	Deep learning
dc.subject.keywords	Bot detection
dc.subject.keywords	Web security
dc.subject.keywords	Attack Mitigation
dc.title	Machine Learning-Based Defences Against Advanced 'Session-Replay' Web Bots
dc.type	Electronic Thesis or Dissertation

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Shadi_Sadeghpour_Thesis_Dissertation_PhD_December-2023.pdf
Size:: 7.89 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: license.txt
Size:: 1.87 KB
Format:: Plain Text
Description:

Download

Name:: YorkU_ETDlicense.txt
Size:: 3.39 KB
Format:: Plain Text
Description:

Download

Collections

Electrical Engineering and Computer Science