ADVANCED DATABASE SYSTEMS Larger-than-Memory Databases @ - PowerPoint PPT Presentation

Lect ure # 23 ADVANCED DATABASE SYSTEMS Larger-than-Memory Databases @ Andy_Pavlo // 15- 721 // Spring 2020

2 ADM IN ISTRIVIA April 22: Final Exam Released April 29: Guest Speaker (Live) May 4: Code Review #2 Submission May 5: Final Presentations (Live) May 13: Final Exam Due Date 15-721 (Spring 2020)

3 O BSERVATIO N DRAM is expensive, son. → Expensive to buy. → Expensive to maintain. It would be nice if our in-memory DBMS could use cheaper storage without having to bring in the entire baggage of a disk-oriented architecture. 15-721 (Spring 2020)

4 Background Implementation Issues Real-world Examples 15-721 (Spring 2020)

5 LARGER- TH AN- M EM O RY DATABASES Allow an in-memory DBMS to store/access data on disk without bringing back all the slow parts of a disk-oriented DBMS. → Minimize the changes that we make to the DBMS that are required to deal with disk-resident data. Need to be aware of hardware access methods → In-memory Storage = Tuple-Oriented → Disk Storage = Block-Oriented 15-721 (Spring 2020)

6 O LAP OLAP queries generally access the entire table. Thus, there is not anything about OLAP queries that an in-memory DBMS would handle differently than a disk-oriented DBMS. In-Memory Disk Data A Zone Map (A) A MIN=## COUNT=## MAX=## AVG=### SUM=## STDEV=### ⋮ 15-721 (Spring 2020)

7 O LTP OLTP workloads almost always have hot and cold portions of the database. → We can assume txns will almost always access hot tuples. The DBMS needs a mechanism to move cold data out to disk and then retrieve it if it is ever needed again. 15-721 (Spring 2020)

8 LARGER- TH AN- M EM O RY DATABASES In-Memory In-Memory Cold-Data Index Table Heap Storage Tuple #00 Tuple #01 Tuple #02 Tuple #03 Tuple #04 15-721 (Spring 2020)

8 LARGER- TH AN- M EM O RY DATABASES In-Memory In-Memory Cold-Data Index Table Heap Storage Evicted Tuple Block header Tuple #00 Tuple #01 ??? Tuple #03 Tuple #02 Tuple #04 ??? ??? 15-721 (Spring 2020)

8 LARGER- TH AN- M EM O RY DATABASES In-Memory In-Memory Cold-Data Index Table Heap Storage Evicted Tuple Block header Tuple #00 ??? Tuple #01 ??? Tuple #03 Tuple #02 Tuple #04 ??? ??? 15-721 (Spring 2020)

8 LARGER- TH AN- M EM O RY DATABASES In-Memory In-Memory Cold-Data Index Table Heap Storage ??? Evicted Tuple Block header Tuple #00 ??? Tuple #01 ??? Tuple #03 Tuple #02 Tuple #04 ??? ??? SELECT * FROM table WHERE id = <Tuple #01> ??? 15-721 (Spring 2020)

9 O LTP ISSUES Run-time Operations → Cold Data Identification Eviction Policies → Timing, Evicted Metadata Data Retrieval Policies → Granularity, Retrieval Mechanism, Merging LARGER- THAN- MEMORY DATA MANAGEMENT ON MODERN STORAGE HARDWARE FOR IN- MEMORY OLTP DATABASE SYSTEMS DAMON 2 2016 15-721 (Spring 2020)

10 CO LD DATA IDEN TIFICATIO N Choice #1: On-line → The DBMS monitors txn access patterns and tracks how often tuples/pages are used. → Embed the tracking meta-data directly in tuples/pages. Choice #2: Off-line → Maintain a tuple access log during txn execution. → Process in background to compute frequencies. 15-721 (Spring 2020)

11 EVICTIO N TIM IN G Choice #1: Threshold → The DBMS monitors memory usage and begins evicting tuples when it reaches a threshold. → The DBMS must manually move data. Choice #2: On Demand → The DBMS/OS runs a replacement policy to decide when to evict data to free space for new data that is needed. 15-721 (Spring 2020)

12 EVICTED TUPLE M ETADATA Choice #1: Tuple Tombstones → Leave a marker that points to the on-disk tuple. → Update indexes to point to the tombstone tuples. Choice #2: Bloom Filters → Use approximate data structure for each index. → Check both index + filter for each query. Choice #3: DBMS Managed Pages → DBMS tracks what data is in memory vs. on disk. Choice #4: OS Virtual Memory → OS tracks what data is on in memory vs. on disk. 15-721 (Spring 2020)

13 EVICTED TUPLE M ETADATA In-Memory In-Memory Cold-Data Index Table Heap Storage Tuple #00 Tuple #01 Tuple #02 Tuple #03 Access Frequency Tuple #04 Tuple #00 Tuple #01 Tuple #02 Tuple #03 Tuple #04 Tuple #05 15-721 (Spring 2020)

13 EVICTED TUPLE M ETADATA In-Memory In-Memory Cold-Data Index Table Heap Storage header Tuple #00 Tuple #01 Tuple #03 Tuple #02 Tuple #04 Access Frequency Tuple #00 Tuple #01 Tuple #02 Tuple #03 Tuple #04 Tuple #05 15-721 (Spring 2020)

13 EVICTED TUPLE M ETADATA In-Memory In-Memory Cold-Data Index Table Heap Storage header Tuple #00 Tuple #01 Tuple #03 Tuple #02 Tuple #04 <Block,Offset> <Block,Offset> <Block,Offset> 15-721 (Spring 2020)

13 EVICTED TUPLE M ETADATA Does 'x' exist? In-Memory In-Memory Cold-Data Index Table Heap Storage header Tuple #00 Tuple #01 Tuple #03 Tuple #02 Tuple #04 Index Bloom Filter 15-721 (Spring 2020)

14 DATA RETRIEVAL GRAN ULARITY Choice #1: All Tuples in Block → Merge all the tuples retrieved from a block regardless of whether they are needed. → More CPU overhead to update indexes. → Tuples are likely to be evicted again. Choice #2: Only Tuples Needed → Only merge the tuples that were accessed by a query back into the in-memory table heap. → Requires additional bookkeeping to track holes. 15-721 (Spring 2020)

15 M ERGIN G TH RESH O LD Choice #1: Always Merge → Retrieved tuples are always put into table heap. Choice #2: Merge Only on Update → Retrieved tuples are only merged into table heap if they are used in an UPDATE query. → All other tuples are put in a temporary buffer. Choice #3: Selective Merge → Keep track of how often each block is retrieved. → If a block's access frequency is above some threshold, merge it back into the table heap. 15-721 (Spring 2020)

16 RETRIEVAL M ECH AN ISM Choice #1: Abort-and-Restart → Abort the txn that accessed the evicted tuple. → Retrieve the data from disk and merge it into memory with a separate background thread. → Restart the txn when the data is ready. → Requires MVCC to guarantee consistency for large txns that access data that does not fit in memory. Choice #2: Synchronous Retrieval → Stall the txn when it accesses an evicted tuple while the DBMS fetches the data and merges it back into memory. 15-721 (Spring 2020)

17 IM PLEM EN TATIO N S H-Store – Anti-Caching Hekaton – Project Siberia Tuples EPFL’s VoltDB Prototype Apache Geode – Overflow Tables LeanStore – Hierarchical Buffer Pool Umbra – Variable-length Buffer Pool Pages MemSQL – Columnar Tables 15-721 (Spring 2020)

18 H- STO RE AN TI- CACH ING On-line Identification Administrator-defined Threshold Tombstones Abort-and-restart Retrieval Block-level Granularity Always Merge ANTI- CACHING: A NEW APPROACH TO DATABASE MANAGEMENT SYSTEM ARCHITECTURE VLDB 2013 15-721 (Spring 2020)

19 H EKATO N PRO J ECT SIBERIA Off-line Identification Administrator-defined Threshold Bloom Filters Synchronous Retrieval Tuple-level Granularity Always Merge TREKKING THROUGH SIBERIA: MANAGING COLD D DATA I IN A MEMORY- OPTIMIZED DATABASE VLDB 2014 15-721 (Spring 2020)

20 EPFL VO LTDB Off-line Identification OS Virtual Memory Synchronous Retrieval Page-level Granularity Always Merge ENABLING EFFICIENT OS PAGING FOR MAIN- MEMORY OLTP DATABASES DAMON 2 2013 15-721 (Spring 2020)

21 EPFL VO LTDB In-Memory Cold-Data Table Heap Storage mlock Tuple #00 Hot Tuples Tuple #01 Tuple #02 Cold Tuples 15-721 (Spring 2020)

21 EPFL VO LTDB In-Memory Cold-Data Table Heap Storage mlock Tuple #00 Hot Tuples Tuple #03 Tuple #02 Cold Tuples Tuple #01 15-721 (Spring 2020)

21 EPFL VO LTDB In-Memory Cold-Data Table Heap Storage mlock Tuple #00 Hot Tuples Tuple #03 Tuple #02 Cold Tuples 15-721 (Spring 2020)

22 APACH E GEO DE OVERFLOW TABLES On-line Identification Administrator-defined Threshold Tombstones ( ? ) Synchronous Retrieval Tuple-level Granularity Merge Only on Update ( ? ) Source: Apache Geode 15-721 (Spring 2020)

ADVANCED DATABASE SYSTEMS Larger-than-Memory Databases @ - PowerPoint PPT Presentation

Lect ure # 23 ADVANCED DATABASE SYSTEMS Larger-than-Memory Databases @ Andy_Pavlo // 15- 721 // Spring 2020 2 ADM IN ISTRIVIA April 22: Final Exam Released April 29: Guest Speaker (Live) May 4: Code Review #2 Submission May 5: Final

1/88 Presentation: Advanced Techniques 2/88 Presentation: Advanced Techniques 3/88

Advanced Nutrition Course Advanced Nutrition Course 6 Week Advanced Nutrition Live Online

TACN - 2019 Tennessee Advanced Communication Network 1 Tennessee Advanced Communication Network

Challenges with Advanced Therapy Medicinal Products Challenges with Advanced Therapy Medicinal

Advanced Learning for Grades 6-12 Highly Capable and Advanced Learning Services Welcome! Who

THE ROLE OF THE ADVANCED CLINICAL PRACTITIONER IN MIDWIFERY Louise Clarke Trainee Advanced

Advanced Manufacturing @ Forsyth Tech Gary M. Green President Advanced Manufacturing

Advanced Electric Generating Advanced Electric Generating Advanced Electric Generating

PLS Advanced Diffusion Model New Advanced Diffusion Model for Dopants in Silicon Advanced Dopant

Expanding Enrollment in Advanced Expanding Enrollment in Advanced Expanding Enrollment in

Advanced UNIX CIS 218 Advanced UNIX Director ies again CIS 218 Advanced UNIX 1 Directory

Advanced SQL II Advanced Aggregation and OLAP 5DV120 Database System Principles Ume a

MACHINE LEARNING Kernel Canonical Correlation Analysis 1 ADVANCED MACHINE LEARNING ADVANCED

Advanced Airway Management PRESENTED BY: JOSIAH POIRIER RN, JOHN GRUBER FP-C Advanced Airway

NEVADAS ADVANCED ENERGY SHOWCASE Ray Fakhoury March 12, 2019 Advanced Energy Economy 1

Regional Leadership Forums on Advanced Illness Care The Coalition To Transform Advanced Care

ADM INISTRIVIA Homework #1 is due September 11 th @ 11:59pm Project #1 will be released on

Enterprise Storage Architecture Fall 2018 Hard disks, SSDs, and the I/O subsystem Tyler Bletsch

Unified Address Translation for Memory Mapped SSDs with FlashMap Jian Huang Anirudh Badam

CSE 232A Graduate Database Systems Arun Kumar Topic 1: Data Storage Chapters 8 and 9 of Cow

Today: Secondary Storage To read or write a disk block: Seek: (latency) position head over

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Input/Output Input/Output April 22,

hard drives / fjlesystems 2 1 last time direct memory access write directy to device driver

Maximizing the Sum of Radii of Disjoint Balls or Disks David Eppstein 28th Canadian Conference