IEEE Access | |
SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores | |
Hoyoung Lee1  Young Ik Eom2  Minho Lee3  | |
[1] Department of Computer Science and Engineering, Sungkyunkwan University, Suwon, South Korea;Department of Electrical and Computer Engineering, College of Computing and Informatics, Sungkyunkwan University, Suwon, South Korea;Department of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, South Korea; | |
关键词: Persistent key-value stores; LSM-tree; storage management; database management systems (DBMS); compaction; merge policy; | |
DOI : 10.1109/ACCESS.2021.3098736 | |
来源: DOAJ |
【 摘 要 】
Persistent key-value stores have been widely adopted as storage engines for modern IT infrastructures because they provide high performance with simple design principles. Moreover, many key-value stores commonly employ LSM-tree as their index structure due to its attractive features such as high write throughput and storage space efficiency. Unfortunately, LSM-tree has critical drawbacks in that it leads to write/read amplification problem. One of the prevalent solutions for remedying the write amplification problem is the tiering merge policy that reduces the number of rewrites by delaying merge operations. However, in spite of this advantage, the tiering merge policy may lead to a side-effect that induces high read amplification, increasing search/scan cost for upcoming read operations. In this paper, we concentrate on mitigating the high read amplification problem of the tiering merge policy, while maintaining its low write amplification. To achieve this, we propose a novel LSM-tree scheme, called Spatially Fragmented LSM-tree (SFM), which delays merge operations only for the non-read-intensive key-spaces. For this, SFM identifies the read intensity of each key-spaces by dynamically estimating their read/write hotness. We have implemented SFM based on PebblesDB and evaluated the performance benefits of our scheme under real-world workloads of Facebook. Experimental results clearly show that our scheme improves throughput by up to
【 授权许可】
Unknown