Jurnal Teknologi dan Sistem Komputer | |
Malicious URLs detection using data streaming algorithms | |
Muyideen Abdulraheem1  Idowu Dauda Oladipo1  Abdullateef Oluwagbemiga Balogun1  Kayode Sakariyah Adewole1  Muiz Olalekan Raheem1  Omotola Fatimah Baker1  | |
[1] Department of Computer Science, Faculty of Communication and Information Sciences, University of Ilorin. PMB 1515 Ilorin, Kwara State, Nigeria; | |
关键词: data streaming; phishing; naïve bayes; machine learning; hoeffding tree.; | |
DOI : 10.14710/jtsiskom.2021.13965 | |
来源: DOAJ |
【 摘 要 】
As a result of advancements in technology and technological devices, data is now spawned at an infinite rate, emanating from a vast array of networks, devices, and daily operations like credit card transactions and mobile phones. Datastream entails sequential and real-time continuous data in the inform of evolving stream. However, the traditional machine learning approach is characterized by a batch learning model. Labeled training data are given apriori to train a model based on some machine learning algorithms. This technique necessitates the entire training sample to be readily accessible before the learning process. The training procedure is mainly done offline in this setting due to the high training cost. Consequently, the traditional batch learning technique suffers severe drawbacks, such as poor scalability for real-time phishing websites detection. The model mostly requires re-training from scratch using new training samples. This paper presents the application of streaming algorithms for detecting malicious URLs based on selected online learners: Hoeffding Tree (HT), Naïve Bayes (NB), and Ozabag. Ozabag produced promising results in terms of accuracy, Kappa and Kappa Temp on the dataset with large samples while HT and NB have the least prediction time with comparable accuracy and Kappa with Ozabag algorithm for the real-time detection of phishing websites.
【 授权许可】
Unknown