Parallel Execution for Conflicting Transactions Neha Narula Thesis - PowerPoint PPT Presentation

Parallel Execution for Conflicting Transactions Neha Narula Thesis Advisors: Robert Morris and Eddie Kohler 1 ¡

Database-backed applications require good performance WhatsApp: • 1M messages/sec Facebook: • 1/5 of all page views in the US Twitter: • Millions of messages/sec from mobile devices

Databases are difficult to scale Application servers are stateless; add more for more traffic Database is stateful 3 ¡

Scale up using multi-core databases Context � • Many cores • In-memory database • OLTP workload • Transactions are stored procedures No stalls due to users, disk, or network 4 ¡

Goal Execute transactions in parallel Throughput 0 10 20 30 40 50 60 70 80 cores 5 ¡

Challenge Conflicting data access Conflict: two Throughput transactions access the same data and one is a write 0 10 20 30 40 50 60 70 80 cores 6 ¡

Database transactions should be serializable k=0,j=0 � TXN1( k, j Key) (Value, Value) { To the programmer: � a := GET( k ) b := GET( j ) TXN1 TXN2 return a, b } or � TXN2( k, j Key) { TXN2 TXN1 ADD( k, 1) ADD( j ,1) time } Valid return values for TX1: (0,0) � or (1,1) � 7 ¡

Executing in parallel could produce incorrect interleavings TX1 returns k=0,j=0 GET( k )GET( j ) (1,0) � ADD( k, 1) ADD( j ,1) time Transactions are incorrectly seeing intermediate values 8 ¡

Concurrency control enforces serial execution ADD(x,1) ADD(x,1) ADD(x,1) time Transactions on the same records execute one at a time 9 ¡

Concurrency control enforces serial execution core 0 ADD(x,1) core 1 ADD(x,1) core 2 ADD(x,1) time Serial execution results in a lack of scalability 10 ¡

Idea #1: Split representation for parallel execution core 0 x 0 :3 x 0 :1 x 0 :0 ADD(x,1) ADD(x,1) ADD(x,1) per-core values core 1 ADD(x,1) ADD(x,1) ADD(x,1) x 1 :1 x 1 :3 x 1 :0 for record x x is split across core 2 ADD(x,1) ADD(x,1) x 2 :1 x 2 :2 x 2 :0 cores time x = 8 • Transactions on the same record can proceed in parallel on per-core values • Reconcile per-core values for a correct value 11 ¡

Other types of operations do not work with split data core 0 x 0 :3 GET(x) core 1 ADD(x,1) x 1 :4 x 1 :3 core 2 PUT(x,42) x 2 :42 x 2 :2 x = ?? time • Executing with split data does not work for all types of operations • In a workload with many reads, better to not use per- core values 12 ¡

Idea #2: Reorder transactions core 0 ADD(x,1) ADD(x,1) reconcile � core 1 GET(x) ADD(x,1) GET(x) ADD(x,1) core 2 ADD(x,1) ADD(x,1) GET(x) Can execute in parallel Can execute in parallel time • Key Insight : Reordering transactions reduces – Cost of reconciling – Cost of conflict • Serializable execution 13 ¡

Idea #3: Phase reconciliation core 0 reconcile � Split Joined Split split � core 1 Phase Phase Phase Conventional core 2 concurrency control time • Database automatically detects contention to split a record between cores • Database cycles through phases : split and joined • Doppel: An in-memory key/value database 14 ¡

Challenges Combining split data with general database Combining split data with general database workloads: workloads: 1. How to handle transactions with multiple keys and different operations? 2. Which operations can use split data correctly? 3. How to dynamically adjust to changing workloads? 15 ¡

Contributions • Synchronized phases to support any transaction and reduce reconciliation overhead • Identifying a class of splittable operations • Detecting contention to dynamically split data 16 ¡

Outline • Challenge 1: Phases • Challenge 2: Operations • Challenge 3: Detecting contention • Performance evaluation • Related work and discussion 17 ¡

Split phase split phase core 0 ADD(x 0 ,1) core 1 ADD(x 1 ,1) core 2 ADD(x 2 ,1) • The split phase executes operations on contended records on per-core slices (x 0 , x 1 , x 2 ) 18 ¡

Reordering by stashing transactions split phase core 0 ADD(x 0 ,1) GET(x) core 1 ADD(x 1 ,1) ADD(x 1 ,1) core 2 ADD(x 2 ,1) • Split records have selected operations for a given split phase • Cannot correctly process a read of x in the current state • Stash transaction to execute after reconciliation 19 ¡

split phase core 0 ADD(x 0 ,1) core 1 ADD(x 1 ,1) ADD(x 1 ,1) core 2 ADD(x 2 ,1) GET(x) • All cores hear they should reconcile their per-core state • Stop processing per-core writes 20 ¡

reconciliation joined phase core 0 x = x + x 0 core 1 x = x + x 1 core 2 x = x + x 2 GET(x) • Reconcile state to global store • Wait until all cores have finished reconciliation • Resume stashed read transactions in joined phase 21 ¡

reconciliation joined phase core 0 x = x + x 0 GET(x) core 1 x = x + x 1 core 2 x = x + x 2 • Reconcile state to global store • Wait until all cores have finished reconciliation • Resume stashed read transactions in joined phase 22 ¡

Transitioning between phases joined phase split phase GET(x) core 0 core 1 ADD(x 1 ,1) core 2 GET(x) ADD(x 2 ,1) • Process stashed transactions in joined phase using conventional concurrency control • Joined phase is short; quickly move on to next split phase 23 ¡

Challenge #1 How to handle transactions with multiple keys and different operations? • Split and non-split data • Different operations on a split record • Multiple split records 24 ¡

Transactions on split and non-split data split phase core 0 ADD(x 0 ,1) core 1 ADD(x 1 ,1) PUT(y,2) core 2 ADD(x 3 ,1) PUT(y,2) • Transactions can operate on split and non-split records • Rest of the records (y) use concurrency control • Ensures serializability for the non-split parts of the transaction 25 ¡

Transactions with different operations on a split record split phase core 0 ADD(x 0 ,1) ADD(x,1)GET(x) core 1 ADD(x 1 ,1) PUT(y,2) core 2 ADD(x 3 ,1) PUT(y,2) • A transaction which executes different operations on a split record is also stashed, even if one is a selected operation 26 ¡

All records use concurrency control in joined phase split phase joined phase core 0 ADD(x 0 ,1) ADD(x,1)GET(x) core 1 ADD(x 1 ,1) PUT(y,2) core 2 ADD(x 3 ,1) PUT(y,2) ADD(x,1)GET(x) • In joined phase, no split data, no split operations • ADD also uses concurrency control 27 ¡

Transactions with multiple split records split phase core 0 ADD(x 0 ,1) core 1 ADD(x 1 ,1) core 2 ADD(x 2 ,1)MULT(y 2 ,2) MULT(y 2 ,1) • x and y are split and operations on them use per-core slices (x 0 , x 1 , x 2 ) and (y 0 , y 1 , y 2 ) • Split records all use the same synchronized phases 28 ¡

Reconciliation must be synchronized joined phase reconciliation core 0 x = x + x 0 y = y * y 0 GET(x)GET(y) core 1 x = x + x 1 y = y * y 1 core 2 y = y * y 2 x = x + x 2 • Cores reconcile all of their split records: ADD for x and MULT for y • Parallelize reconciliation • Guaranteed to read values atomically in next joined phase 29 ¡

Delay to reduce overhead of reconciliation joined split phase phase core 0 ADD(x 0 ,1) GET(x) ADD(x 0 ,1) GET(x) GET(x) core 1 ADD(x 1 ,1) ADD(x 1 ,1) GET(x) core 2 ADD(x 2 ,1) ADD(x 2 ,1) ADD(z,1) GET(x) GET(x) • Wait to accumulate stashed transactions, many in joined phase • Reads would have conflicted; now they do not 30 ¡

When does Doppel switch phases? (n s > 0 && t s > 10ms) || n s > 100,000 Joined Split phase phase n s = # stashed t s = time in split phase Completed stashed txns 31 ¡

Outline • Challenge 1: Phases • Challenge 2: Operations • Challenge 3: Detecting contention • Performance evaluation • Related work and discussion 32 ¡

Challenge #2 Define a class of operations that is correct and performs well with split data. 33 ¡

Operations in Doppel Developers write transactions as stored procedures which are composed of operations on database keys and values Operations on numeric void ADD( k , n ) values which modify the void MAX( k , n ) existing value void MULT( k , n ) 34 ¡

Why can ADD(x,1) execute correctly on split data in parallel? • Does not return a value • Commutative ADD(k,n) { v[k] = v[k] + n } 35 ¡

Commutativity Two operations commute if executed on the database s in either order, they produce the same state s’ and the same return values. o p = � s s ’ p o 36 ¡

Hypothetical design: commutativity is sufficient core 0 o 5 T5 T1 o 1 o 1 o 5 log: core 1 o 2 T2 o 4 T4 log: o 2 o 4 core 2 o 3 T3 o 6 T6 log: o 3 o 6 • Not-split operations in transactions execute • Split operations are logged • They have no return values and are on different data , so cannot affect transaction execution 37 ¡

Hypothetical design: apply logged operations later core 0 T5 T1 o 1 o 5 log: core 1 T2 T4 log: o 2 o 4 core 2 T3 T6 log: o 3 o 6 • Logged operations are applied to database state in a different order than their containing transactions 38 ¡

Correct because split operations can be applied in any order o 1 o 5 o 4 o 2 o 3 o 6 s s ’ = � o 4 o 5 o 1 o 2 o 3 o 6 T1 T2 T3 T4 T5 T6 After applying the split operations in any order , same database state 39 ¡

Is commutativity enough? For correctness, yes. For performance, no. Which operations can be summarized ? 40 ¡

Parallel Execution for Conflicting Transactions Neha Narula Thesis - PowerPoint PPT Presentation

Parallel Execution for Conflicting Transactions Neha Narula Thesis Advisors: Robert Morris and Eddie Kohler 1 Database-backed applications require good performance WhatsApp: 1M messages/sec Facebook: 1/5 of all page views in the US

Nested Transactions Nested Transactions Flat transactions The rules for committing of

Non-conflicting and Conflicting Parts of Belief Functions Milan Daniel Institute of Computer

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

13.1 Introduction 13.2 Transactions 13.3 Nested transactions 13.4 Locks 13.5 Optimistic

Module 15: Managing Transactions and Locks Overview Introduction to Transactions and Locks

Parallel Query Execution in POLARDB for MySQL ystein Grvlen Benny Wang Alibaba Cloud Agenda

Detection of Conflicting Functional Requirements in a Use Case-Driven Approach Jan Hendrik

Conflicting Interests and Language Change GURT 2014 Christopher Ahern & Robin Clark

20 0 6 Transactions $1.01 billion in bonds 18 transactions 20 0 6 Transactions By Num

Transactional Recovery Transactional Recovery Transactions: ACID Properties Transactions: ACID

Transactional Recovery Transactional Recovery Transactions: ACID Properties Transactions: ACID

Flat and nested distributed Outline transactions Flat and nested distributed transactions

Database Management Objectives of Lecture 7 Systems Transactions Models Transactions Models

execution states with swapping Processes, Execution, and State 3F. Execution State Model exit

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Introduction Introduction What is Parallel Architecture? Why Parallel Architecture? Evolution

Linear-Programming Decoding of Tanner Codes with Local-Optimality Certificates Nissim Halabi Guy

Design Principles for Scaling Multi-core OLTP Under High Contention Kun Ren, Jose Faleiro ,

Project discussion, 22 May: Mandatory but ungraded. Thanks for doing this June 4, 6pm deadline for

Thinking & Working Politically An in intr troduction to o key id ideas, example les an

Disclosures New Drugs for Osteoporosis Edward Hsiao receives research grant support and Bone

X.25 Slow, Safe and Reliable 2005/03/11 (C) Herbert Haas What is X.25 ? Connection-oriented

Air Force Research Laboratory C/NOFS Thermospheric Research and Reentry EXperiment (T-RREX)

Solar+Storage Virtual Power Plant in the Country February 12, 2020 HOUSEKEEPING Join audio:

Parallel Execution for Conflicting Transactions Neha Narula Thesis - PowerPoint PPT Presentation

Parallel Execution for Conflicting Transactions Neha Narula Thesis Advisors: Robert Morris and Eddie Kohler 1 Database-backed applications require good performance WhatsApp: 1M messages/sec Facebook: 1/5 of all page views in the US

Nested Transactions Nested Transactions Flat transactions The rules for committing of

Non-conflicting and Conflicting Parts of Belief Functions Milan Daniel Institute of Computer

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

13.1 Introduction 13.2 Transactions 13.3 Nested transactions 13.4 Locks 13.5 Optimistic

Module 15: Managing Transactions and Locks Overview Introduction to Transactions and Locks

Parallel Query Execution in POLARDB for MySQL ystein Grvlen Benny Wang Alibaba Cloud Agenda

Detection of Conflicting Functional Requirements in a Use Case-Driven Approach Jan Hendrik

Conflicting Interests and Language Change GURT 2014 Christopher Ahern &amp; Robin Clark

20 0 6 Transactions $1.01 billion in bonds 18 transactions 20 0 6 Transactions By Num

Transactional Recovery Transactional Recovery Transactions: ACID Properties Transactions: ACID

Transactional Recovery Transactional Recovery Transactions: ACID Properties Transactions: ACID

Flat and nested distributed Outline transactions Flat and nested distributed transactions

Database Management Objectives of Lecture 7 Systems Transactions Models Transactions Models

execution states with swapping Processes, Execution, and State 3F. Execution State Model exit

Parallel Numerical Algorithms Chapter 2 Parallel Thinking Section 2.2 Parallel

Introduction Introduction What is Parallel Architecture? Why Parallel Architecture? Evolution

Linear-Programming Decoding of Tanner Codes with Local-Optimality Certificates Nissim Halabi Guy

Design Principles for Scaling Multi-core OLTP Under High Contention Kun Ren, Jose Faleiro ,

Project discussion, 22 May: Mandatory but ungraded. Thanks for doing this June 4, 6pm deadline for

Thinking &amp; Working Politically An in intr troduction to o key id ideas, example les an

Disclosures New Drugs for Osteoporosis Edward Hsiao receives research grant support and Bone

X.25 Slow, Safe and Reliable 2005/03/11 (C) Herbert Haas What is X.25 ? Connection-oriented

Air Force Research Laboratory C/NOFS Thermospheric Research and Reentry EXperiment (T-RREX)

Solar+Storage Virtual Power Plant in the Country February 12, 2020 HOUSEKEEPING Join audio:

Conflicting Interests and Language Change GURT 2014 Christopher Ahern & Robin Clark

Thinking & Working Politically An in intr troduction to o key id ideas, example les an