会议论文详细信息
International Research and Innovation Summit 2017
Analysis of Na?ve Bayes Algorithm for Email Spam Filtering across Multiple Datasets
Rusland, Nurul Fitriah^1 ; Wahid, Norfaradilla^1 ; Kasim, Shahreen^1 ; Hafit, Hanayanti^1
Department of Web Technology, Faculty of Computer Science and Information Technology, Universiti Tun Hussein Onn Malaysia, Malaysia^1
关键词: Bayes algorithms;    E-mail spam;    F measure;    Filtering technique;    Multiple data sets;    Neutral network;    Random forests;    Weka tool;   
Others  :  https://iopscience.iop.org/article/10.1088/1757-899X/226/1/012091/pdf
DOI  :  10.1088/1757-899X/226/1/012091
来源: IOP
PDF
【 摘 要 】

E-mail spam continues to become a problem on the Internet. Spammed e-mail may contain many copies of the same message, commercial advertisement or other irrelevant posts like pornographic content. In previous research, different filtering techniques are used to detect these e-mails such as using Random Forest, Naïve Bayesian, Support Vector Machine (SVM) and Neutral Network. In this research, we test Naïve Bayes algorithm for e-mail spam filtering on two datasets and test its performance, i.e., Spam Data and SPAMBASE datasets [8]. The performance of the datasets is evaluated based on their accuracy, recall, precision and F-measure. Our research use WEKA tool for the evaluation of Naïve Bayes algorithm for e-mail spam filtering on both datasets. The result shows that the type of email and the number of instances of the dataset has an influence towards the performance of Naïve Bayes.

【 预 览 】
附件列表
Files Size Format View
Analysis of Na?ve Bayes Algorithm for Email Spam Filtering across Multiple Datasets 365KB PDF download
  文献评价指标  
  下载次数:2次 浏览次数:18次