科技报告详细信息
Using Bitmap Indexing Technology for Combined Numerical and TextQueries
Stockinger, Kurt ; Cieslewicz, John ; Wu, Kesheng ; Rotem, Doron ; Shoshani, Arie
Lawrence Berkeley National Laboratory
关键词: Compression;    99;    Bitmap Index Full-Text Index Inverted File Performanceevaluation Database And Information Retrieval;    Algorithms;    Numerical Data;   
DOI  :  10.2172/918636
RP-ID  :  LBNL--61768
RP-ID  :  DE-AC02-05CH11231
RP-ID  :  918636
美国|英语
来源: UNT Digital Library
PDF
【 摘 要 】

In this paper, we describe a strategy of using compressedbitmap indices to speed up queries on both numerical data and textdocuments. By using an efficient compression algorithm, these compressedbitmap indices are compact even for indices with millions of distinctterms. Moreover, bitmap indices can be used very efficiently to answerBoolean queries over text documents involving multiple query terms.Existing inverted indices for text searches are usually inefficient forcorpora with a very large number of terms as well as for queriesinvolving a large number of hits. We demonstrate that our compressedbitmap index technology overcomes both of those short-comings. In aperformance comparison against a commonly used database system, ourindices answer queries 30 times faster on average. To provide full SQLsupport, we integrated our indexing software, called FastBit, withMonetDB. The integrated system MonetDB/FastBit provides not onlyefficient searches on a single table as FastBit does, but also answersjoin queries efficiently. Furthermore, MonetDB/FastBit also provides avery efficient retrieval mechanism of result records.

【 预 览 】
附件列表
Files Size Format View
918636.pdf 382KB PDF download
  文献评价指标  
  下载次数:16次 浏览次数:24次