Read-Copy Update Todays Lecture System Calls Kernel (RCU) RCU - PDF document

3/1/20 COMP 790: OS Implementation COMP 790: OS Implementation Logical Diagram Binary Memory Threads Formats Allocators User Read-Copy Update Today’s Lecture System Calls Kernel (RCU) RCU File System Networking Sync Don Porter Memory CPU Device Management Scheduler Drivers Hardware Interrupts Disk Net Consistency 1 2 COMP 790: OS Implementation COMP 790: OS Implementation Motivation RCU in a nutshell (from Paul McKenney’s Thesis) • Think about data structures that are mostly read, occasionally written – Like the Linux dcache • RW locks allow concurrent reads – Still require an atomic decrement of a lock counter Performance of RW – Atomic ops are expensive lock only marginally • Idea: Only require locks for writers; carefully update better than mutex data structure so readers see consistent views of data lock 3 4 COMP 790: OS Implementation COMP 790: OS Implementation Principle (1/2) Principle (2/2) • Locks have an acquire and release cost • Reader/writer locks may allow critical regions to execute in parallel – Substantial, since atomic ops are expensive • For short critical regions, this cost dominates • But they still serialize the increment and decrement performance of the read count with atomic instructions – Atomic instructions performance decreases as more CPUs try to do them at the same time • The read lock itself becomes a scalability bottleneck, even if the data it protects is read 99% of the time 5 6 1

3/1/20 COMP 790: OS Implementation COMP 790: OS Implementation Lock-free data structures RCU: Split the difference • Some concurrent data structures have been • One of the hardest parts of lock-free algorithms is proposed that don’t require locks concurrent changes to pointers – So just use locks and make writers go one-at-a-time • They are difficult to create if one doesn’t already suit your needs; highly error prone • But, make writers be a bit careful so readers see a consistent view of the data structures • Can eliminate these problems • If 99% of accesses are readers, avoid performance- killing read lock in the common case 7 8 COMP 790: OS Implementation COMP 790: OS Implementation Example: Linked lists Example: Linked lists This implementation needs a lock Insert(B) Insert(B) A C E A C E B’s next B B pointer is uninitialized; Reader gets a Reader goes to C or B- Reader goes to B page fault --either is ok 9 10 COMP 790: OS Implementation COMP 790: OS Implementation Example recap Example 2: Linked lists • Notice that we first created node B, and set up all Delete (C) outgoing pointers • Then we overwrite the pointer from A A C E – No atomic instruction or reader lock needed – Either traversal is safe – In some cases, we may need a memory barrier • Key idea: Carefully update the data structure so that a reader can never follow a bad pointer – Writers still serialize using a lock Reader may still be looking at C. When can we delete? 11 12 2

3/1/20 COMP 790: OS Implementation COMP 790: OS Implementation Problem Worst-case scenario • We logically remove a node by making it unreachable • Reader follows pointer to node X (about to be freed) to future readers • Another thread frees X – No pointers to this node in the list • X is reallocated and overwritten with other data • We eventually need to free the node’s memory • Reader interprets bytes in X->next as pointer, – Leaks in a kernel are bad! segmentation fault • When is this safe? – Note that we have to wait for readers to “move on” down the list 13 14 COMP 790: OS Implementation COMP 790: OS Implementation Quiescence Quiescence, cont • Trick: Linux doesn’t allow a process to sleep while • There are some optimizations that keep the per-CPU traversing an RCU-protected data structure counter to just a bit – Includes kernel preemption, I/O waiting, etc. – Intuition: All you really need to know is if each CPU has called schedule() once since this list became non-empty • Idea: If every CPU has called schedule() (quiesced), – Details left to the reader then it is safe to free the node – Each CPU counts the number of times it has called schedule() – Put a to-be-freed item on a list of pending frees – Record timestamp on each CPU – Once each CPU has called schedule, do the free 15 16 COMP 790: OS Implementation COMP 790: OS Implementation Limitations Nonetheless • No doubly-linked lists • Linked lists are the workhorse of the Linux kernel • Can’t immediately reuse embedded list nodes • RCU lists are increasingly used where appropriate – Must wait for quiescence first • Improved performance! – So only useful for lists where an item’s position doesn’t change frequently • Only a few RCU data structures in existence 17 18 3

3/1/20 COMP 790: OS Implementation COMP 790: OS Implementation Big Picture API • Drop in replacement for read_lock: – rcu_read_lock() • Carefully designed data • Wrappers such as rcu_assign_pointer() and structures Hash Pending rcu_dereference_pointer() include memory barriers – Readers always see List Signals consistent view • Rather than immediately free an object, use • Low-level “helper” call_rcu(object, delete_fn) to do a deferred deletion functions encapsulate RCU “library” complex issues – Memory barriers – Quiescence 19 20 COMP 790: OS Implementation COMP 790: OS Implementation Code Example Simplified Code Example From fs/binfmt_elf.c From arch/x86/include/asm/rcupdate.h rcu_read_lock(); #define rcu_dereference(p) ({ \ prstatus->pr_ppid = typeof(p) ______p1 = (*(volatile typeof(p)*) &p);\ task_pid_vnr(rcu_dereference(p->real_parent)); read_barrier_depends(); // defined by arch \ rcu_read_unlock(); ______p1; // “returns” this value \ }) 21 22 COMP 790: OS Implementation COMP 790: OS Implementation Code Example From McKenney and Walpole, Introducing Technology into the Linux Kernel: A Case Study From fs/dcache.c static void d_free(struct dentry *dentry) { /* ... Ommitted code for simplicity */ call_rcu(&dentry->d_rcu, d_callback); } // After quiescence, call_rcu functions are called static void d_callback(struct rcu_head *rcu) { struct dentry *dentry = container_of(head, struct dentry, d_rcu); __d_free(dentry); // Real free } 23 24 4

3/1/20 COMP 790: OS Implementation Summary • Understand intuition of RCU • Understand how to add/delete a list node in RCU • Pros/cons of RCU 25 5

Read-Copy Update Todays Lecture System Calls Kernel (RCU) RCU - PDF document

3/1/20 COMP 790: OS Implementation COMP 790: OS Implementation Logical Diagram Binary Memory Threads Formats Allocators User Read-Copy Update Todays Lecture System Calls Kernel (RCU) RCU File System Networking Sync Don Porter

T Levels/Skills Plan Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body Copy Body

Read-Copy-Update for Read-Copy-Update for HelenOS HelenOS http://d3s.mff.cuni.cz Martjn Dck

RCUfs: A Read Copy Update file system An experimental method for handling file system locking

Studying copy-on-read and copy-on-write techniques on block device level to aid in large

AACS AACS Managed Copy Managed Copy Status Update Status Update And Implementation And

Advanced Access Content System (AACS) AACS Managed Copy Overview Presented to BD GPC October

Communication in History: The Key to Understanding Get your digital copy today at

Copying Objects: Reference Copy Copies: Reference vs. Shallow vs. Deep Reference Copy c1 := c2

Copy Constructor 1 Copy Constructor 1. Initialize one object from another of the same type

CS 225 Data Structures Fe February 3 - Li Lifecycle G G Carl Evans Cop Copy Con Constru

CSCI 104 Copy Semantics Mark Redekopp David Kempe 2 Copy constructors and assignment operators

Read Write Inc. Phonics MISS CASBAN About Read Write Inc Phonics

Read Write Inc. Phonics Parents Meeting Who is Read Write Inc. Phonics for? Read Write Inc.

Kindergarten Reading Getting Ready for Kindergarten Oregon Trail School District Read, Read

Read-Copy Update (RCU) Don Porter CSE 506 RCU in a nutshell Think about data structures

Read Write Inc. Phonics Parents Meeting Teach a child to read and keep that child reading

Adversarial Search and Game- Playing C H A P T E R 6 C M P T 3 1 0 : S P R I N G 2 0 1 1 H

Read-Copy Update User Todays Lecture System Calls Kernel (RCU) RCU File System

Thermal States of Transiently Accreting Neutron Stars in Quiescence Sophia Han University of

The Dependence of Quenching upon Inner Galactic Structure Edmond Cheung Sandy Faber David Koo

CS440/ECE448 Lecture 8: Two-Player Games Slides by Svetlana Lazebnik 9/2016 Modified by Mark

A logical characterisation for input output conformance simulation iocos (Work in Progress) Luca

Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Dynamic Adaptation Minema Minema

TGV Gnration de tests de conformit partir de modles formels Thierry Jron (INRIA /