Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ - PowerPoint PPT Presentation

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ Andy_Pavlo // 15- 721 // Spring 2018

2 CO URSE AN N O UN CEM EN TS Mid-Term: Wednesday March 7 th @ 3:00pm Project #2: Monday March 12 th @ 11:59pm Project #3 Proposal: Monday March 19 th CMU 15-721 (Spring 2018)

3 In-Memory Checkpoints Shared Memory Restarts CMU 15-721 (Spring 2018)

4 O BSERVATIO N Logging allows the DBMS to recover the database after a crash/restart. But this system will have to replay the entire log each time. Checkpoints allows the systems to ignore large segments of the log to reduce recovery time. CMU 15-721 (Spring 2018)

5 IN- M EM O RY CH ECKPO IN TS There are different approaches for how the DBMS can create a new checkpoint for an in-memory database. The choice of approach in a DBMS is tightly coupled with its concurrency control scheme. The checkpoint thread(s) scans each table and writes out data asynchronously to disk. CMU 15-721 (Spring 2018)

6 IDEAL CH ECKPO IN T PRO PERTIES Do not slow down regular txn processing. Do not introduce unacceptable latency spikes. Do not require excessive memory overhead. LOW- OVERHEAD ASYNCHRONOUS CHECKPOINTING IN MAIN- MEMORY DATABASE SYSTEMS SIGMOD 2016 CMU 15-721 (Spring 2018)

7 CO N SISTEN T VS. FUZZY CH ECKPO IN TS Approach #1: Consistent Checkpoints → Represents a consistent snapshot of the database at some point in time. No uncommitted changes. → No additional processing during recovery. Approach #2: Fuzzy Checkpoints → The snapshot could contain records updated from transactions that have not finished yet. → Must do additional processing to remove those changes. CMU 15-721 (Spring 2018)

8 CH ECKPO IN T CO N TEN TS Approach #1: Complete Checkpoint → Write out every tuple in every table regardless of whether were modified since the last checkpoint. Approach #2: Delta Checkpoint → Write out only the tuples that were modified since the last checkpoint. → Can merge checkpoints together in the background. CMU 15-721 (Spring 2018)

9 FREQ UEN CY Taking checkpoints too often causes the runtime performance to degrade. But waiting a long time between checkpoints is just as bad. Approach #1: Time-based Approach #2: Log File Size Threshold Approach #3: On Shutdown (always!) CMU 15-721 (Spring 2018)

10 CH ECKPO IN T IM PLEM EN TATIO N S Type Contents Frequency MemSQL Consistent Complete Log Size VoltDB Consistent Complete Time-Based Altibase Fuzzy Complete Manual? TimesTen Consistent (Blocking) Complete On Shutdown Fuzzy (Non-Blocking) Complete Time-Based Hekaton Consistent Delta Log Size SAP HANA Fuzzy Complete Time-Based CMU 15-721 (Spring 2018)

11 IN- M EM O RY CH ECKPO IN TS Approach #1: Naïve Snapshots Approach #2: Copy-on-Update Snapshots Approach #3: Wait-Free ZigZag Approach #4: Wait-Free PingPong FAST CHECKPOINT RECOVERY ALGORITHMS FOR FREQUENTLY CONSISTENT APPLICATIONS SIGMOD 2011 CMU 15-721 (Spring 2018)

12 N AÏVE SN APSH OT Create a consistent copy of the entire database in a new location in memory and then write the contents to disk. Two approaches to copying database: → Do it yourself (tuple data only). → Let the OS do it for you (everything). CMU 15-721 (Spring 2018)

13 H YPER FO RK SN APSH OTS Create a snapshot of the database by forking the DBMS process. → Child process contains a consistent checkpoint if there are not active txns. → Otherwise, use the in-memory undo log to roll back txns in the child process. Continue processing txns in the parent process. HYPER: A HYBRID OLTP&OLAP MAIN MEMORY DATABASE SYSTEM BASED ON VIRTUAL MEMORY SNAPSHOTS ICDE 2011 CMU 15-721 (Spring 2018)

14 H- STO RE FO RK SN APSH OTS Workload: TPC-C (8 Warehouses) + OLAP Query CMU 15-721 (Spring 2018)

15 CO PY- O N- UPDATE SN APSH OT During the checkpoint, txns create new copies of data instead of overwriting it. → Copies can be at different granularities (block, tuple) The checkpoint thread then skips anything that was created after it started. → Old data is pruned after it has been written to disk CMU 15-721 (Spring 2018)

16 VO LTDB CO N SISTEN T CH ECKPO IN TS A special txn starts a checkpoint and switches the DBMS into copy-on-write mode. → Changes are no longer made in-place to tables. → The DBMS tracks whether a tuple has been inserted, deleted, or modified since the checkpoint started. A separate thread scans the tables and writes tuples out to the snapshot on disk. → Ignore anything changed after checkpoint. → Clean up old versions as it goes along. CMU 15-721 (Spring 2018)

17 O BSERVATIO N Txns have to wait for the checkpoint thread when using naïve snapshots. Txns may have to wait to acquire latches held by the checkpoint thread under copy-on-update if not using MVCC. CMU 15-721 (Spring 2018)

18 WAIT- FREE ZIGZAG Maintain two copies of the entire database → Each txn write only updates one copy. Use two BitMaps to keep track of what copy a txn should read/write from per tuple. → Avoid the overhead of having to create copies on the fly as in the copy-on-update approach. CMU 15-721 (Spring 2018)

19 WAIT- FREE ZIGZAG Copy #1 Copy #2 Read Write BitMap BitMap Txn Writes Txn Writes 5 3 6 5 1 0 0 0 1 0 1 9 8 9 1 0 1 0 0 7 1 7 1 0 0 1 0 1 2 2 9 1 0 0 1 1 0 4 4 0 1 0 0 3 3 0 1 0 0 Checkpoint Checkpoint Checkpoint Thread Written to Disk Written to Disk CMU 15-721 (Spring 2018)

20 WAIT- FREE PIN GPO N G Trade extra memory + CPU to avoid pauses at the end of the checkpoint. Maintain two copies of the entire database at all times plus a third "base" copy. → Pointer indicates which copy is the current master. → At the end of the checkpoint, swap these pointers. CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 5 0 - 1 5 9 0 - 1 9 7 0 - 1 7 2 0 - 1 2 4 0 - 1 4 3 0 - 1 3 Copy #1 Master: Checkpoint Thread Shadow: Copy #2 CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 Txn Writes 5 0 - 1 5 9 0 - 1 9 7 0 - 1 7 2 0 - 1 2 4 0 - 1 4 3 0 - 1 3 Copy #1 Master: Checkpoint Thread Shadow: Copy #2 CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 Txn Writes 5 6 0 1 6 - 1 5 9 0 - 1 9 7 1 0 1 - 1 1 7 9 2 1 0 9 - 1 2 4 0 - 1 4 3 0 - 1 3 Copy #1 Master: Checkpoint Thread Shadow: Copy #2 CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 5 6 0 1 6 - 1 5 9 0 - 1 9 1 7 1 0 - 1 1 7 9 2 1 0 9 - 1 2 4 0 - 1 4 3 0 - 1 3 Copy #1 Master: Checkpoint Thread Shadow: Copy #2 CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 5 6 0 1 6 - 1 0 - 5 9 0 - 0 1 - 9 1 7 0 1 1 - 1 0 - 7 2 9 1 0 9 - 1 0 2 - 4 0 - 0 1 - 4 3 0 - 0 1 3 - Copy #1 Master: Checkpoint Thread Shadow: Copy #2 CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 5 6 0 1 6 - 1 0 - 5 9 0 - 0 1 - 9 1 7 0 1 1 - 0 1 - 7 9 2 0 1 - 9 0 1 2 - 4 0 - 1 0 - 4 3 0 - 0 1 - 3 Copy #2 Master: Copy #1 Shadow: CMU 15-721 (Spring 2018)

37 WAIT- FREE PIN GPO N G Base Copy Copy #1 Copy #2 5 6 0 1 6 - 0 1 5 - 9 0 - 1 0 - 9 1 7 0 1 1 - 1 0 - 7 9 2 1 0 - 9 0 1 - 2 4 0 - 0 1 - 4 3 0 - 0 1 3 - Copy #2 Master: Checkpoint Thread Copy #1 Shadow: CMU 15-721 (Spring 2018)

30 CH ECKPO IN T IM PLEM EN TATIO N S Bulk State Copying → Pause txn execution to take a snapshot. Locking / Latching → Use latches to isolate the checkpoint thread from the worker threads if they operate on shared regions. Bulk Bit-Map Reset: → If DBMS uses BitMap to track dirty regions, it must perform a bulk reset at the start of a new checkpoint. Memory Usage: → To avoid synchronous writes, the method may need to allocate additional memory for data copies. CMU 15-721 (Spring 2018)

31 IN- M EM O RY CH ECKPO IN TS Bulk Bulk Bit- Memory Copying Locking Map Reset Usage No No Naïve Snapshot Yes 2x No Yes Yes 2x Copy-on-Update No No Wait-Free ZigZag Yes 2x No No No Wait-Free Ping-Pong 3x CMU 15-721 (Spring 2018)

32 O BSERVATIO N Not all DBMS restarts are due to crashes. → Updating OS libraries → Hardware upgrades/fixes → Updating DBMS software Need a way to be able to quickly restart the DBMS without having to re-read the entire database from disk again. CMU 15-721 (Spring 2018)

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ - PowerPoint PPT Presentation

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ Andy_Pavlo // 15- 721 // Spring 2018 2 CO URSE AN N O UN CEM EN TS Mid-Term: Wednesday March 7 th @ 3:00pm Project #2: Monday March 12 th @ 11:59pm Project #3 Proposal: Monday

Lect 12a - Delaunay Triangulations Lect 12b - Delaunay Triangulations Lect 12c - Delaunay

Lect ure # 11 ADVANCED DATABASE SYSTEMS System Catalogs and Database Compression @

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction & History of Database Systems

Lect ure # 03 ADVANCED DATABASE SYSTEMS Query Compilation @ Andy_Pavlo // 15- 721 // Spring

Lect ure # 14 ADVANCED DATABASE SYSTEMS Networking @ Andy_Pavlo // 15- 721 // Spring 2018 2

Lect ure # 06 ADVANCED DATABASE SYSTEMS Multi-Version Concurrency Control (Part II) @

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware @ Andy_Pavlo // 15- 721 //

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models & Data Layout @ Andy_Pavlo // 15-

Lect ure # 22 ADVANCED DATABASE SYSTEMS Vectorized Execution (Part II) @ Andy_Pavlo // 15-

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements:

ADVANCED DATABASE SYSTEMS Database Compression @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

ADVANCED DATABASE SYSTEMS Recovery Protocols @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

ADVANCED DATABASE SYSTEMS Self-Driving Database Management Systems @ Andy_Pavlo // 15- 721 //

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Checkpointing HPC Applications Thomas Ropars thomas.ropars@imag.fr Universit e Grenoble Alpes

Hall A Outlook Thia Keppel DVCS Collaboration Meeting December 2013 Hall A Projected Experiment

Version Control and Subversion Chris Coakley Outline What is Version Control? Why use

How not to be a Git Tips and tricks for a good workflow Who am I? Adam Jimerson

Virtual Machine Checkpointing Brendan Cully University of British Columbia with Andrew Warfield

Checkpoint/Restart in Linux Sukadev Bhattiprolu IBM Linux Technology Center 09/2009 Linux is a

Lecture 22 I/O Performance and Checkpoints EN 600.320/420/620 Instructor: Randal Burns 27 March

Cyber-Physical System Checkpointing and Recovery Fanxin Kong , Meng Xu, James Weimer, Oleg

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ - PowerPoint PPT Presentation

Lect ure # 13 ADVANCED DATABASE SYSTEMS Checkpoint Protocols @ Andy_Pavlo // 15- 721 // Spring 2018 2 CO URSE AN N O UN CEM EN TS Mid-Term: Wednesday March 7 th @ 3:00pm Project #2: Monday March 12 th @ 11:59pm Project #3 Proposal: Monday

Lect 12a - Delaunay Triangulations Lect 12b - Delaunay Triangulations Lect 12c - Delaunay

Lect ure # 11 ADVANCED DATABASE SYSTEMS System Catalogs and Database Compression @

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction &amp; History of Database Systems

Lect ure # 03 ADVANCED DATABASE SYSTEMS Query Compilation @ Andy_Pavlo // 15- 721 // Spring

Lect ure # 14 ADVANCED DATABASE SYSTEMS Networking @ Andy_Pavlo // 15- 721 // Spring 2018 2

Lect ure # 06 ADVANCED DATABASE SYSTEMS Multi-Version Concurrency Control (Part II) @

Lect ure # 24 ADVANCED DATABASE SYSTEMS Databases on New Hardware @ Andy_Pavlo // 15- 721 //

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models &amp; Data Layout @ Andy_Pavlo // 15-

Lect ure # 22 ADVANCED DATABASE SYSTEMS Vectorized Execution (Part II) @ Andy_Pavlo // 15-

Lect 14a - Line Arrangements: Definitions and Zone Theorem Lect 14b - Line Arrangements:

ADVANCED DATABASE SYSTEMS Database Compression @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

Database Utilities 10/17/2007 DC/Win Database Utilities Opening Database Utilities From File on

Advanced Database CS 525: Organization? Advanced Database =Database Implementation

ADVANCED DATABASE SYSTEMS Recovery Protocols @ Andy_Pavlo // 15- 721 // Spring 2019 CMU

ADVANCED DATABASE SYSTEMS Self-Driving Database Management Systems @ Andy_Pavlo // 15- 721 //

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Checkpointing HPC Applications Thomas Ropars thomas.ropars@imag.fr Universit e Grenoble Alpes

Hall A Outlook Thia Keppel DVCS Collaboration Meeting December 2013 Hall A Projected Experiment

Version Control and Subversion Chris Coakley Outline What is Version Control? Why use

How not to be a Git Tips and tricks for a good workflow Who am I? Adam Jimerson

Virtual Machine Checkpointing Brendan Cully University of British Columbia with Andrew Warfield

Checkpoint/Restart in Linux Sukadev Bhattiprolu IBM Linux Technology Center 09/2009 Linux is a

Lecture 22 I/O Performance and Checkpoints EN 600.320/420/620 Instructor: Randal Burns 27 March

Cyber-Physical System Checkpointing and Recovery Fanxin Kong , Meng Xu, James Weimer, Oleg

Lect ure # 01 ADVANCED DATABASE SYSTEMS Course Introduction & History of Database Systems

Lect ure # 10 ADVANCED DATABASE SYSTEMS Storage Models & Data Layout @ Andy_Pavlo // 15-