学位论文详细信息
Scaling synchronization primitives
OS;Concurrency;Mutual exclusion;File system;Scalability;Timestamping;Database
Kashyap, Sanidhya ; Kim, Taesoo Min, Changwoo Computer Science Gavrilovska, Ada Calciu, Irina Arulraj, Joy ; Kim, Taesoo
University:Georgia Institute of Technology
Department:Computer Science
关键词: OS;    Concurrency;    Mutual exclusion;    File system;    Scalability;    Timestamping;    Database;   
Others  :  https://smartech.gatech.edu/bitstream/1853/63677/1/KASHYAP-DISSERTATION-2020.pdf
美国|英语
来源: SMARTech Repository
PDF
【 摘 要 】

Over the past decade, multicore machines have become the norm. A single machine is capable of having thousands of hardware threads or cores. Even cloud providers offer suchlarge multicore machines for data processing engines and databases. Thus, a fundamental question arises is how efficient are existing synchronization primitives— timestamping and locking—that developers use for designing concurrent, scalable, and performant applications. This dissertation focuses on understanding the scalability aspect of these primitives, andpresents new algorithms and approaches, that either leverage the hardware or the applicationdomain knowledge, to scale up to hundreds of cores. First, the thesis presents Ordo , a scalable ordering or timestamping primitive, that formsthe basis of designing scalable timestamp-based concurrency control mechanisms. Ordo relies on invariant hardware clocks and provides a notion of a globally synchronized clockwithin a machine. We use the Ordo primitive to redesign a synchronization mechanism and concurrency control mechanisms in databases and software transactional memory. Later, this thesis focuses on the scalability aspect of locks in both virtualized and non-virtualized scenarios. In a virtualized environment, we identify that these locks suffer fromvarious preemption issues due to a semantic gap between the hypervisor shceduler and a virtual machine scheduler—the double scheduling problem. We address this problemby bridging this gap, in which both the hypervisor and virtual machines share minimal scheduling information to avoid the preemption problems. Finally, we focus on the design of lock algorithms in general. We find that locks in practice have discrepancies from locks in design. For example, popular spinlocks suffer from excessive cache-line bouncing in multicore (NUMA) systems, while state-of-the-art locks exhibit sub-par single-thread performance. We classify several dominating factors that impact the performance of lock algorithms. We then propose a new technique, shuffling, that can dynamically accommodate all these factors, without slowing down the critical path of the lock. The key idea of shuffling is to re-order the queue of threads waiting to acquire the lock with some pre-established policy. Using shuffling, we propose a family of locking algorithms, called SHFLLOCKS that respect all factors, efficiently utilize waiters, and achieve the best performance.

【 预 览 】
附件列表
Files Size Format View
Scaling synchronization primitives 1399KB PDF download
  文献评价指标  
  下载次数:18次 浏览次数:13次