期刊论文

【摘要】

Representation of text is a significant task in Natural Language Processing (NLP) and in recent years Deep Learning (DL) and Machine Learning (ML) have been widely used in various NLP tasks like topic classification, sentiment analysis and language translation. Until very recently, little work has been devoted to semantic analysis in phishing detection or phishing email detection. The novelty of this study is in using deep semantic analysis to capture inherent characteristics of the text body. One-hot encoding was used with DL and ML techniques to classify emails as phishing or non-phishing. A comparison of various parameters and hyperparameters was performed for DL. The results of various ML models, Naïve Bayes, SVM, Decision Tree, as well as DL models, Convolutional Neural Networks (CNN) and Long Short Term Memory (LSTM), were presented. The DL models performed better than the ML models in terms of accuracy, but the ML models performed better than the DL models in terms of computation time. CNN with Word Embedding performed the best in terms of accuracy (96.34%), demonstrating the effectiveness of semantic analysis in phishing email detection.

【授权许可】

CC BY

【预览】

附件列表
Files	Size	Format	View
RO202107250000276ZK.pdf	490KB	PDF	download

Journal of computer sciences
Machine Learning and Deep Learning for Phishing Email Classification using One-Hot Encoding
article
Sikha Bagui¹ Debarghya Nandi² Subhash Bagui¹ Robert Jamie White¹
[1] The University of West Florida, United States;University of Illinois at Chicago, United States
关键词: One-Hot Encoding; Phishing Email Classification; Deep Learning; Machine Learning; Convolutional Neural Networks; Long Short Term Memory;
DOI : 10.3844/jcssp.2021.610.623
学科分类：计算机科学（综合）
来源: Science Publications
PDF


	文献评价指标
	下载次数：8次	浏览次数：4次

【 摘 要 】

【 授权许可】

【 预 览 】

【摘要】

【授权许可】

【预览】