期刊论文详细信息
Cybersecurity
A DGA domain names detection modeling method based on integrating an attention mechanism and deep neural network
article
Ren, Fangli1  Jiang, Zhengwei1  Wang, Xuren2  Liu, Jian1 
[1] Institute of Information Engineering, Chinese Academy of Sciences;University of Chinese Academy of Sciences;College of Information Engineering, Capital Normal University
关键词: Domain generation algorithm;    Malware;    Attention mechanism;    Deep learning;   
DOI  :  10.1186/s42400-020-00046-6
学科分类:社会科学、人文和艺术(综合)
来源: Springer
PDF
【 摘 要 】

Command and control (C2) servers are used by attackers to operate communications. To perform attacks, attackers usually employee the Domain Generation Algorithm (DGA), with which to confirm rendezvous points to their C2 servers by generating various network locations. The detection of DGA domain names is one of the important technologies for command and control communication detection. Considering the randomness of the DGA domain names, recent research in DGA detection applyed machine learning methods based on features extracting and deep learning architectures to classify domain names. However, these methods are insufficient to handle wordlist-based DGA threats, which generate domain names by randomly concatenating dictionary words according to a special set of rules. In this paper, we proposed a a deep learning framework ATT-CNN-BiLSTM for identifying and detecting DGA domains to alleviate the threat. Firstly, the Convolutional Neural Network (CNN) and bidirectional Long Short-Term Memory (BiLSTM) neural network layer was used to extract the features of the domain sequences information; secondly, the attention layer was used to allocate the corresponding weight of the extracted deep information from the domain names. Finally, the different weights of features in domain names were put into the output layer to complete the tasks of detection and classification. Our extensive experimental results demonstrate the effectiveness of the proposed model, both on regular DGA domains and DGA that hard to detect such as wordlist-based and part-wordlist-based ones. To be precise,we got a F1 score of 98.79% for the detection and macro average precision and recall of 83% for the classification task of DGA domain names.

【 授权许可】

CC BY   

【 预 览 】
附件列表
Files Size Format View
RO202108110000129ZK.pdf 1105KB PDF download
  文献评价指标  
  下载次数:5次 浏览次数:0次