学位论文详细信息
Entity finder: A system for entity web page retrieval using pseudo-relevance feedback
Information Retrieval;Web Search;Relevance Feedback
Wang, Rui ; Zhai ; ChengXiang
关键词: Information Retrieval;    Web Search;    Relevance Feedback;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/49645/Rui_Wang.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Collecting all the online information about a particular entity (e.g., a person or a product) is a task commonly needed in many applications. In many cases, there often already exists a database with limited information about interesting entities, but those databases usually suffer from incompleteness and out-of-date problems. But with the increasing amount of information available on the World Wide Web, crawling and searching the web may be an attractive technological approach that can help update a database to make it more complete and up to date. In this thesis, we propose a retrieval system that crawls and searches the web in order to complete and update the information about an entity already existent in a database maintained by an organization.Taking the information stored in the database as input, this system can crawl the web and retrieve the web pages mentioning the entities in the database. We study several approaches to solving this special retrieval problem, and propose a novel pseudo-relevance feedback approach to improve the retrieval accuracy. We evaluate our system over a dataset containing 112 alumni in the College of Engineering of the University of Illinois, and show that our system can effectively retrieve relevant pages of alumni on the web and that the novel pseudo-relevance feedback method outperforms a simple baseline approach.

【 预 览 】
附件列表
Files Size Format View
Entity finder: A system for entity web page retrieval using pseudo-relevance feedback 1681KB PDF download
  文献评价指标  
  下载次数:4次 浏览次数:10次