学位论文详细信息
Efficient programming of massive-memory machines
Key-value store;Address space;Operating system;Big data;Memory;Scalability;Concurrency
Merritt, Alexander Marshall ; Gavrilovska, Ada Computer Science Schwan, Karsten Kim, Taesoo Ramachandran, Umakishore Qureshi, Moinuddin K. Milojicic, Dejan S. ; Gavrilovska, Ada
University:Georgia Institute of Technology
Department:Computer Science
关键词: Key-value store;    Address space;    Operating system;    Big data;    Memory;    Scalability;    Concurrency;   
Others  :  https://smartech.gatech.edu/bitstream/1853/59202/1/MERRITT-DISSERTATION-2017.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

New and emerging memory technologies combined with enormous growths in data collection and mining within industry are giving rise to servers with massive pools of main memory — terabytes of memory, disaggregated bandwidth across tens of sockets, and hundreds of cores. But, these systems are proving difficult to program efficiently, posing scalability challenges for all layers in the software stack, specifically in managing in-memory data sets. Larger and longer-lived data sets managed by key-value stores require minimizing over- commitments of memory, but current designs trade off performance scalability and memory bloat. Furthermore, opaque operating system abstractions like virtual memory and ill-matched, non-portable interfaces used to manipulate them make the expression of semantic relationships between applications and their data difficult: sharing in-memory data sets requires careful control over internal address mappings, but mmap, ASLR, and friends remove this control.To explore and address these challenges, this dissertation is composed of two pieces: (1) We introduce and compare a new design for key-value stores, a multi-head log-structured allocator whose design makes explicit use of a machine’s configuration to support linear scalability of common read- and write-heavy access patterns. Our implementation of this design, called Nibble, is written in 4k lines of Rust. (2) Going beyond key-value stores, the second part of this dissertation introduces new general support within the operating system enabling applications to more explicitly manage and share pointer-based in-memory data: we introduce explicit control over address space allocation and layout by promoting address spaces as an explicit abstraction. Processes may associate with multiple address spaces, and threads may arbitrarily switch between them to access infinite data set sizes without encountering typical bottlenecks from legacy mmap interfaces. Our implementation of this design is in DragonFly BSD.

【 预 览 】
附件列表
Files Size Format View
Efficient programming of massive-memory machines 2220KB PDF download
  文献评价指标  
  下载次数:6次 浏览次数:14次