2nd International Symposium on Application of Materials Science and Energy Materials | |
Research on Automatic Proofreading Method of Sensitive Information in Content Security | |
材料科学;能源学 | |
Gong, Yonggang^1 ; Li, Yuying^1 ; Lian, Xiaoqin^1 ; Fu, Junying^1 | |
School of Computer and Information Engineering, Beijing Technology and Business University, Beijing Key Laboratory of Food Safety Big Data Technology, Beijing, China^1 | |
关键词: Accuracy rate; Automatic processing; Content security; False alarm rate; Practical engineering applications; Rule processing; Sensitive informations; SVM(support vector machine); | |
Others : https://iopscience.iop.org/article/10.1088/1757-899X/490/6/062060/pdf DOI : 10.1088/1757-899X/490/6/062060 |
|
学科分类:材料科学(综合) | |
来源: IOP | |
【 摘 要 】
Aiming at the problem of automatic proofreading of sensitive information in mass text content, an automatic proofreading method based on the combination of rule and SVM (Support Vector Machine) is proposed. To classify sensitive information based on important sensitive information provided in the "Newly Prohibited Texts and Cautions in Xinhua News Reports"(newest revision) and related central and online texts.According to the different categories, the paper constructs the classification processing rule base, designs the corresponding rules automatic Processing algorithm, and realizes the sensitive information automatic proofreading, At the same time, using the SVM model to analyze the result of the rule processing with emotion, which greatly reduces the false alarm rate. The test result shows that the recall rate of method is 89.98%, the accuracy rate is 98.31%, and 100, 000 + text content is processed per second, which solves the key difficult problems in the practical engineering application.
【 预 览 】
Files | Size | Format | View |
---|---|---|---|
Research on Automatic Proofreading Method of Sensitive Information in Content Security | 334KB | download |