IEEE Access | |
A Multimodel Fusion Engine for Filtering Webpages | |
Zehong Cao1  Ziyun Deng2  Tingqin He3  Weiping Ding4  | |
[1] Centre for Artificial Intelligence, Faculty of Engineering and Information Technologies, University of Technology Sydney, Ultimo, NSW, Australia;College of Economics and Trade, Changsha Commerce and Tourism College, Changsha, China;National Supercomputing Center in Changsha, Hunan University, Changsha, China;School of Computer Science and Technology, Nantong University, Nantong, China; | |
关键词: Multimodel; fusion; engine design; webpage filtering; | |
DOI : 10.1109/ACCESS.2018.2878897 | |
来源: DOAJ |
【 摘 要 】
Fusing multiple existing models for filtering webpages can mitigate the shortcomings of individual filtering models. To provide an engine for such fusion, we propose a multimodel fusion engine for filtering webpages for the extraction of target webpages. This engine can handle large datasets of webpages crawled from websites and supports five individual filtering models and the fusion of any two of them. There are two possible fusion methods: one is to simultaneously satisfy the conditions of both individual models, and the other is to satisfy the conditions of one of the two individual models. We present the functions, architecture, and software design of the proposed engine. We use recall ratio (RR) and precision ratio (PR) as the evaluation indices of the filtering models and propose rules describing how PR and RR change when individual models are fused. We use 200 000 webpages collected by crawling the popular online shopping website “
【 授权许可】
Unknown