学位论文详细信息
Structured entity querying over unstructured text
structured query;unstructured dataset;rank
Jiang, Jiahui ; Chang ; Kevin Chen-chuan
关键词: structured query;    unstructured dataset;    rank;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/88060/JIANG-THESIS-2015.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

The web is a collection of unstructured webpages. This characteristic makes it very hard for users to search complex queries -- in most of the times, queries are just plain text and it is very difficult for users to describe the internal relationships that they contain.The EntityRank system allows users to specify the target entities for which they are interested within the query, which brings us one step closer to being able to describe the entity relationships. But this expressiveness is still limited because the queries are still in a flat format.On the other hand, SQL queries for relational databases are very expressive. Users can easily specify multiple entities and the relationships between them. But in order to use SQL queries, we need to extract all the entities and relationships existing in the domain beforehand. The cost of maintaining the tables is very large. Also it is very hard if we want to add or modify the schema after the initial domain design.Thus we want to build a system that can combine the advantages of the flexibility and informativeness of unstructured webpages and the expressiveness of SQL queries. In this work we design a system which allows users to use structured SQL queries on unstructured webpages using the help of the EntityRank system. We design the conceptual framework to map the concepts between two systems and also propose a ranking algorithm for the final results.

【 预 览 】
附件列表
Files Size Format View
Structured entity querying over unstructured text 290KB PDF download
  文献评价指标  
  下载次数:11次 浏览次数:13次