学位论文详细信息
Template B+ trees: an index scheme for fast data streams with distributed append-only stores
B+ trees;Template B+ Trees;distributed;append-only datastore
Mazumdar, Parijat ; Winslett ; Marianne
关键词: B+ trees;    Template B+ Trees;    distributed;    append-only datastore;   
Others  :  https://www.ideals.illinois.edu/bitstream/handle/2142/90645/MAZUMDAR-THESIS-2016.pdf?sequence=1&isAllowed=y
美国|英语
来源: The Illinois Digital Environment for Access to Learning and Scholarship
PDF
【 摘 要 】

Distributed systems are now commonly used to manage massive data flooding from the physical world, such as user-generated content from online social media and communication records from mobile phones. The new generation of distributed data management systems, such as HBase, Cassandra and Riak, are designed to perform queries and tuple insertions only. Other database operations such as deletions and updates are simulated by appending the keys associated with the target tuples to operation logs. Such an append-only store architecture maximizes the processing throughput on incoming data, but potentially incurs higher costs during query processing, because additional computation is needed to generate consistent snapshots of the database. Indexing is the key to enable efficient query processing by fast data retrieval and aggregation under such a system architecture.This thesis presents a new in-memory indexing scheme for distributed append-only stores. Our new scheme utilizes traditional index structures based on B+ trees and their variants to create an efficient in-memory template-based tree without the overhead of expensive node splits. We also propose the use of optimized domain partitioning and multi-thread insertion techniques to exploit the advantages of the template B+ tree structure. Our empirical evaluations show that insertion throughput is five times higher with template B+ trees than with HBase, on a variety of real and synthetic workloads.

【 预 览 】
附件列表
Files Size Format View
Template B+ trees: an index scheme for fast data streams with distributed append-only stores 947KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:12次