Non-Volatile Memory Tia ianzheng Wang Justin Levandoski - PowerPoint PPT Presentation

Easy Lock-Free Programming in Non-Volatile Memory Tia ianzheng Wang Justin Levandoski Paul Larson

The making of concurrent data structures • With locks: one thread at a time • Lock-free: use atomic instructions directly Critical section Data races • Limited concurrency • More concurrency, faster • Deadlocks • Higher CPU utilization • Extremely difficult • Relatively easy T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 2

Lock-free data structures • Queues • Hash tables • Trees • Linked lists and skip lists . . . + many more . . . Widely used in performance-critical systems T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 3

Lock-free in persistent memory: more potential • Fast performance, high CPU utilization • Instant recovery • Fewer layers: simplified persistence model/architecture Persistent Previously: DRAM Now: memory Single-level (or with DRAM) Tree index Sounds great, but not automatic T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 4

Lock-free programming: even harder in PM • Inherits all the existing challenges in DRAM • Race conditions Actual persisted Thread 1 Thread 2 • Memory reclamation issues state: PM Cache PM Cache PM • New challenges • Volatile CPU caches (new) A A A • Recovery (new) • Permanent memory leaks (new) B B Unreachable Difficult and error-prone to deal with using hardware instructions T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 5

Compare-and-swap (CAS) Conceptually: CAS(*address, expected, desired) v = *address if v == expected then *address = desired return v Powerful, but limited to single 8-byte words T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 6

Example: doubly-linked list 1 Insert C between B and D: CAS( B .next, D , C ) B D C T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 7

Example: doubly-linked list Insert C between B and D: B D C Visible for forward scan Intermediate state exposed to concurrent threads T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 8

Example: doubly-linked list Inconsistent list if crashes 2 Insert C between B and D: CAS( D .prev, B , C ) B D C May compete with other inserts Many papers on devising lock-free doubly-linked lists T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 9

Persistent multi-word CAS (PMwCAS)* • Atomically changing multiple 8-byte words with persistence guarantee • Either all specified updates succeed, or none of them • Software-only • Lock-free • Based on a volatile MwCAS design [Harris+Fraser+Pratt 2002] • We made it work on persistent memory • With new necessary features on • Guaranteeing persistence • Recovery • Persistent memory management * Easy Lock-Free Indexing in Non-Volatile Memory, ICDE 2018 T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 11

The PMwCAS operation • Application specifies words to change atomically, in a descriptor • Following CAS interface for each word • Issue (launch) the operation after adding all words • Final result: either all words changed, or none of them PMwCAS descriptor Address 1 Expected 1 Desired 1 Address 2 Expected 2 Desired 2 Address 3 Expected 3 Desired 3 . . . Status T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 12

Doubly-linked list with PMwCAS Insert C between B and D: B D C PMwCAS descriptor PMwCAS(desc) &B.next D C &D.prev B C One step , C becomes atomically visible in both directions T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 13

So how does it work exactly? • PMwCAS algorithm • Guaranteeing persistence • Flush-upon-read – no logging needed • Recovery • Memory Management • Preventing persistent memory leaks • Integration with persistent memory allocator • Epoch-based memory reclamation T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 14

So how does it work exactly? • PMwCAS algorithm • Guaranteeing persistence • Flush-upon-read – no logging needed • Recovery • Memory Management • Preventing persistent memory leaks • Integration with persistent memory allocator • Epoch-based memory reclamation See paper for more details T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 15

PMwCAS algorithm 1. Persist entire descriptor Conflicting Phase 1 threads will “help” Install a pointer to descriptor on each word (using CAS) each other Change to ‘failed’ status if any CAS failed Otherwise change to ‘succeed’ status. 2. Persist all modified words Phase 2 If Phase 1 succeeded, install new values Otherwise roll back 3. Persist all modified words + set status to ‘finished’ + flush status T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 16

Recovery • Fixed-size descriptor pool • Doesn’t need to be large, 1000s -10k is good • Recovery = scan descriptor pool • Roll forward ‘succeeded’ PMwCAS operations • Roll back failed ones • Application-transparent recovery • Application transforms data structure from one consistent state to another • No application-specific code for recovery needed! • Volatile and persistent versions use the same code (turn persistence on/off) T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 17

Case studies and adoptions • Two non-trivial data structures, focusing on database index structures • Bw-Tree • Lock-free B+-tree in Microsoft SQL Server Hekaton • See details in paper • Doubly-linked skip list • Bz-Tree [Arulraj et al. VLDB 2018] • A new B+-tree for persistent memory • By Microsoft Research • Other institutions using PMwCAS now for their own research T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 18

Evaluation • Quad-socket, 8-core Xeon E5-4620 clocked at 2.2GHz • 32 physical cores, 64 hyperthreads in total • 256KB/2MB/16MBL1/L2/L3 caches • Persistent memory emulation • 512GB DRAM – assuming NVDIMM-N • CLFLUSH (SFENCE + CLFLUSHOPT) • Upper bound overhead • SFENCE + CLWB emulation with injected delays • Calibrated using non-temporal writes • Synthetic workloads • Insert/delete/search/scan on index structures (Bw-tree and doubly-linked skip list) • 20% write + 80% read (80% search + 20% range scan) T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 19

PMwCAS: easy implementation + fast • Code almost as mechanical as lock-based (check out repo) • < 10% overhead under realistic workloads (80% read + 20% write) Doubly-linked skip list Bw-Tree T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 20

Summary • Lock-free programming is already very hard in volatile memory • Even harder in persistent memory • Performance • Persistence and recovery • Race conditions • PMwCAS: primitive for easy lock-free programming in persistent memory • Code almost as simple as lock based – everything covered by PMwCAS • Transparent recovery – no application-specific code needed  Use the same code for both persistent and volatile versions Now open source at: Thank you! https://github.com/Microsoft/pmwcas T. Wang, J. Levandoski, P. Larson Easy Lock-Free Programming in Non-Volatile Memory 21

Non-Volatile Memory Tia ianzheng Wang Justin Levandoski - PowerPoint PPT Presentation

Easy Lock-Free Programming in Non-Volatile Memory Tia ianzheng Wang Justin Levandoski Paul Larson The making of concurrent data structures With locks: one thread at a time Lock-free: use atomic instructions directly

Encrypted Non-volatile Main Memory Systems Yu Hua Huazhong University of Science and Technology

Soft Updates Made Simple and Fast on Non-volatile Memory Mingkai Dong , Haibo Chen Institute of

Object-Oriented Recovery for Non-volatile Memory Nachshon Cohen, David Aksun, James Larus EPFL 10

Architectural Support for Atomic Durability in Non-Volatile Memory Arpit Joshi , Vijay Nagarajan,

Memory II. Memory improvement III. Problems with memory 3 systems/stages of Memory: memory

A Persistent Friedman Lock-Free Queue Maurice Herlihy for Non-Volatile Memory Virendra

HOOP: Efficient Hardware-Assisted Out-of-Place Update for Non-Volatile Memory Miao Cai Chance

A Write-friendly Hashing Scheme for Non-volatile Memory Systems Pengfei Zuo and Yu Hua Huazhong

Managing Non-Volatile Memory in Database Systems A review by Apaar Shanker DATA ANALYTICS

Storage Class Memory Towards a disruptively low-cost solid-state non-volatile memory Science

Rethinking Applications in the NVM Era Amitabha Roy ex- Intel Research NVM = Non Volatile

NOVA-Fortis: A Fault-Tolerant Non- Volatile Main Memory File System Jian Andiry Xu, Lu Zhang ,

A Persistent Lock- Free Queue for Maurice Herlihy Non-Volatile Virendra Memory (PPoPP18)

1 Memory SoC Persistent Memory-Driven Memory Memory Processor-Centric Memory SoC SoC

Networks Computer-Computer Comm CPU CPU CPU CPU Memory Device Device Memory Memory

Emerging Non Volatile Memory Resistive Memory Technologies Key concept: replace DRAM cell

Minutes of October 2019 CAS Senate meeting 104 Gore October 21, 2019 Present: J.

Minutes for University of Delaware CAS Faculty Senate meeting March 18, 2019 104 Gore Present:

Strengthening Early Childhood in Kansas in 2019 WEBINAR August 7, 2019 Statewide Needs

A Tool-based Approach to Teaching Parallel and Concurrent Programming Caitlin Sadowski Joint

Relaxed Linear References for Lock-free Data Structures Elias Castegren , Tobias Wrigstad

Benchmarks and Quality Evaluation of CAS ACA 2016 Kassel Germany Albert Heinle Symbolic

A Two-way Path between Formal and Informal Design of Embedded Systems Mingshuai Chen 1 , Anders P.

Next-Generation Secure Public-Key Infrastructures Pawe Szaachowski Network Security Group,

Sambuz

Useful Links

Newsletter

Mail Us