Implementing a Metasearch Framework with Content-directed Result Merging
Search;Content-direction;Metasearch
Selvadurai, Santthosh Babu ; Dr. Munindar P. Singh, Committee Member,Dr.Gregory T. Byrd, Committee Chair,Dr. Yan Solihin, Committee Member,Selvadurai, Santthosh Babu ; Dr. Munindar P. Singh ; Committee Member ; Dr.Gregory T. Byrd ; Committee Chair ; Dr. Yan Solihin ; Committee Member
A metasearch engine is a system that provides integrated access to multiple existing search engine mechanisms. Once a query is executed on a metasearch engine, the system passes the query to its participating component search engines, collects the individual results and merges them into a single ranked list. Metasearch engines increase the search coverage of the Web, help solve the scalability issues in searching the internet, and improve the retrieval effectiveness, and consequently the relevance, of results.Result merging is a key constituent of metasearch engines. When results from several search engines are collected, the metasearch system has to merge them into a unified list. The effectiveness of the metasearch mechanism and the relevance of the result set are closely related to the result merging algorithm used. The purpose of this research is to build a flexible, general purpose metasearch framework and explore a content-directed result merging approach to rank results. Here the content-direction is provided to the framework by the user in the form of documents or text artifacts. A modular metasearch application programming interface (API) based on Java has been implemented. The API framework provides interfaces and utilities to develop components of a metasearch system like segregators, scheduler, aggregators, and search service providers. A prototype metasearch engine has been built based on this framework to study the content-directed result merging algorithm.
【 预 览 】
附件列表
Files
Size
Format
View
Implementing a Metasearch Framework with Content-directed Result Merging