学位论文详细信息
Index design for information retrieval applications using database concepts
Information Retrieval;Databases;Index Design;Entity Search
Mohit Vyas, - ; Chang ; Kevin
关键词: Information Retrieval;    Databases;    Index Design;    Entity Search;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/108068/MOHITVYAS-THESIS-2020.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】
Integrating database (DB) and information retrieval (IR) technologies has long been an important problem. Recent growth in tagged textual data, which is a combination of structured knowledge bases and unstructured text data, has made principled index design for IR applications even more desirable. In this work, we propose a framework for designing IR applications using DB concepts. We model an IR application as a specialized DB system that is required to support only a fixed predefined query workload rather than general SQL queries. The physical design of an information retrieval system is optimized for a prespecified target query workload. We then identify IR indexes as basic building blocks of the design of an IR application. An IR index is formalized as a look-up function built over a materialized view. The overall ”index design problem” is then to find an index design with lowest cost from a space of candidate designs. Since the space of candidate designs can be doubly exponential in the size of query workload, we give polynomial time heuristic algorithm with provable guarantees for a weighted set cover based relaxation of the original problem. This is then combined with an overarching branch-and-bound based optimality preserving state-space search algorithm that efficiently prunes the state-space by using the above heuristic algorithm. Finally, we show via experiments how the proposed framework can be used to obtain index designs for different kinds IR applications in practice.
【 预 览 】
附件列表
Files Size Format View
Index design for information retrieval applications using database concepts 714KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:31次