B -trees CSCI 333 Williams College Logistics Lab 2b Office - PowerPoint PPT Presentation

B ε -trees CSCI 333 Williams College

Logistics • Lab 2b • Office hours Tuesday night, 7-9pm • Final Project Proposals • Due Friday — Come see me!

Last Class • General principles of write optimization • LSM-trees ‣ Operations ‣ Performance • LevelDB - SSTables store key-value pairs at each level • PebblesDB - Fragmented LSM • WiscKey - Separates keys (LSM) from values (log)

This Class • B ε -trees ‣ Operations ‣ Performance • Choosing Parameters • Compare to B-trees and LSM-trees

But first… Tradeoffs What are some of the tradeoffs we’ve discussed   so far in topics we’ve covered?

Big Picture: Write-Optimized Dictionaries • New class of data structures developed in the ’90s • LSM Trees [O’Neil, Cheng Gawlick, & O’Neil ’96] • B ε -trees [Brodal & Fagerberg ’03] • COLAs [Bender, Farach-Colton, Fineman, Fogel, Kuzmaul & Nelson ’07] • xDicts [Brodal, Demaine, Fineman, Iacono, Langerman & Munro ’10] • WOD queries are asymptotically as fast as a B-tree (at least they can be in “good” WODs) • WOD inserts/updates/deletes are orders-of- magnitude faster than a B-tree

B ε -trees [Brodal & Fagerberg ’03] • B ε -trees: an asymptotically optimal key-value store ‣ Fast in best cases, bounds on worst-cases • B ε -tree searches are just as fast as* B-trees • B ε -tree updates are orders-of-magnitude faster* *asymptotically, in the DAM model

B and ε are parameters: • B ➡ how much “stuff” fits in one node • ε ➡ fanout ➡ how tall the tree is B-B ε B B ε . . . O(log N) B ε O(B ε ) children . . . . . . O(N/B) leaves

B ε -trees [Brodal & Fagerberg ’03] • B ε -tree leaf nodes store key-value pairs • Internal B ε -tree node buffers store messages ‣ Messages target a specific key ‣ Messages encode a mutation • Messages are flushed downwards, and eventually applied to key-value pairs in the leaves High-level: messages + LSM/B-tree hybrid

B ε -tree Operations • Implement a dictionary on key-value pairs ▪ insert( k , v ) ▪ v = search( k ) ▪ {(k i ,v i ), … (k j , v j ) } = search( k 1 , k 2 ) ▪ delete( k ) • New operation: Talk about soon! ▪ upsert( k , ƒ, 𝚬 )

B ε -tree Inserts All data is inserted to the root node’s buffer.

B ε -tree Inserts When a buffer fills, contents are flushed to children

B ε -tree Inserts

B ε -tree Inserts Flushes can cascade if not enough room in child nodes

B ε -tree Inserts Flushes can cascade if not enough room in child nodes Invariant: height in the tree preserves update order

B ε -tree Searches Read and search all nodes on root-to-leaf path Newest insert is closest to the root. Search all node buffers   for messages   applicable to target key

Updates • In most systems, updating a value requires: read, modify, write FUSE FAT write? • Problem: B ε -tree inserts are faster than searches ‣ fast updates are impossible if we must search first upsert = update + insert

Upsert messages • Each upsert message contains a: • Target key, k • Callback function, ƒ • Set of function arguments, 𝚬 • Upserts are added into the B ε -tree like any other message • The callback is evaluated whenever the message is applied ‣ Upserts can specify a modification and lazily do the work

B ε -tree Upserts upsert( k ,ƒ, 𝚬 )

B ε -tree Upserts Upserts are stored in the tree like any other operation

B ε -tree Upserts

Searching with Upserts Read all nodes on root-to- leaf search path Apply updates in reverse chronological order Upserts don’t harm searches, but they let us perform blind updates .

Thought Question • What types of operations might naturally be encoded as upserts?

Performance Model • Disk Access Machine (DAM) Model [Aggarwal & Vitter ’88] • Idea: expensive part of an algorithm’s execution is transferring data to/from memory Memory • Parameters: - B : block size B - M : memory size B - N : data size Disk Performance = (# of I/Os)

? Point Query: Range Query: Insert/upsert: B − B ε B ε O ( log B ε N ) … … … … … …

O(log B N) Goal: Compare query performance to a B-tree ➡ B ε -tree fanout: B ε … s e s a b t n ➡ B ε -tree height: O ( log B ε N ) e r e f f i D [ https://www.khanacademy.org ] [ https://www.chilimath.com/lessons/advanced-algebra/logarithm-rules/ ]

O ( log B N ) Point Query: ε ? Range Query: Insert/upsert: B − B ε B ε O ( log B ε N ) … … … … … …

O ( log B N ) Point Query: ε O ( log B N + ` B ) Range Query: " ? Insert/upsert: B − B ε B ε O ( log B ε N ) … … … … … … O ( ` B )

O ( log B N ) Point Query: ε O ( log B N + ` B ) Range Query: " ? Insert/upsert: B − B ε B ε O ( log B ε N ) … … … … … …

Goal: Attribute the cost of flushing across all messages   that benefit from the work. ➡ How many times is an insert flushed? O ( log B ε N ) ➡ How many messages are moved per flush? O ( B − B ε ) B ε B-B ε B B ε ➡ How do we “share the work” among the messages? • Divide by the total cost by the number of messages

O ( log B N ) Point Query: ε O ( log B N + ` B ) Range Query: " O ( log B N ε B 1 − ε ) Insert/upsert: Each flush operation moves items Each insert message is O ( B − B ε ) B ε flushed times O ( log B ε N ) B − B ε B ε O ( log B ε N ) … … Batch size divides the insert cost… Inserts are very fast! … … … …

Recap/Big Picture • Disk seeks are slow ➡ big I/Os improve performance • B ε -trees convert small updates to large I/Os • Inserts: orders-of-magnitude faster • Upserts: let us update data without reading • Point queries: as fast as standard tree indexes • Range queries: near-disk bandwidth (w/ large B) Question: How do we choose B and ε ?

Thought Questions B-B ε • How do we choose ε ? B B ε • Original paper didn’t actually use the term B ε -tree (or spend very long on the idea). Showed there are various points on the trade-off curve between B-trees and Buffered Repository trees • What happens if ε = 1? ε = 1 corresponds to a B-tree • What happens if ε = 0? ε = 0 corresponds to a Buffered Repository tree

Thought Questions B-B ε • How do we choose B ? B B ε • Let’s first think about B-trees • What changes when B is large? • What changes when B is small? • B ε -trees buffer data; batch size divides the insert cost • What changes when B is large? • What changes when B is small? In practice choose B and “fanout”. B ≈ 2-8MiB, fanout ≈ 16

Thought Questions • How does a B ε -tree compare to an LSM-tree? ‣ Compaction vs. flushing ‣ Queries (range and point) ‣ Upserts

Thought Questions • How would you implement copy(old, new) ‣ delete(“large”) :: kv-pair that occupies a whole leaf? ‣ delete(“a*|b*|c*”) :: a contiguous range of kv-pairs? ‣

Next Class • From Be-tree to file system!

B -trees CSCI 333 Williams College Logistics Lab 2b Office - PowerPoint PPT Presentation

B -trees CSCI 333 Williams College Logistics Lab 2b Office hours Tuesday night, 7-9pm Final Project Proposals Due Friday Come see me! Last Class General principles of write optimization LSM-trees Operations

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Trees Applied Multivariate Statistics Spring 2012 Overview Intuition for Trees

Splay Trees and B-Trees CSE 373 Data Structures Lecture 9 Readings Reading Sections

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

The number of spanning trees of random 2 -trees Stephan Wagner (joint work with Elmar Teufl)

Binary Trees, Heaps Binary Trees, Heaps Binary trees Binary trees A binary tree (

AVL TREES Height Balance : AVL Trees h 1 h 2 | h - h | 1 AVL AVL 2 1 non-AVL trees

Algorithms and Data Structures Balanced Trees (AVL-Trees, (a,b)-Trees, Red-Black-Trees)

General Trees children that any node may have. Chapter 7 Well, non-binary trees anyway.

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

CS 10: Problem solving via Object Oriented Programming Prioritizing 2 Agenda 1. Heaps 2. Heap

PiTree: Practical Implementations of ABR Algorithms Using Decision Trees Paper # P5C-04 Zili Meng

Mat 3770 Relationships Week 7 Mary Trees Examples Balanced Prufer Exercises Spring 2014

Trees 5 May 2016 OSU CSE 1 A New Math Type: tree A ubiquitous concept in computing is that

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li Time: 6:00pm 8:50pm R

A Datatype for Binary Trees Functions on Trees Amtoft from Hatcliff Binary Trees One of many

CS 225 Data Structures Mar March h 11 11 BT BTrees Wade Fa Wa Fagen-Ul Ulmsch schnei

Storage and Indexing DBS Database Systems Reading: R&G Chapters 8, 9 & 10.1 Implementing

B -trees CSCI 333 Williams College Logistics Lab 2b Office - PowerPoint PPT Presentation

B -trees CSCI 333 Williams College Logistics Lab 2b Office hours Tuesday night, 7-9pm Final Project Proposals Due Friday Come see me! Last Class General principles of write optimization LSM-trees Operations

Trees Trees CSE, IIT KGP Trees and Spanning Trees Trees and Spanning Trees A graph having

( ( ) ) ( ) ( ) = = Work = h log t n B- B -Trees Trees B B- -Trees

Trees Chapter 11 Chapter Summary Introduction to Trees Applications of Trees Tree

Trees Eric McCreath Overview In this lecture we will explore: general trees, binary trees,

2-3-4 Trees and Red- Black Trees 204 erm CS 16: Balanced Trees 2-3-4 Trees Revealed Nodes

/ + - * * 5 3 2 6 5 2 Examples Binary Trees BSTs Augmenting BinExpr General Trees

Trees Applied Multivariate Statistics Spring 2012 Overview Intuition for Trees

Splay Trees and B-Trees CSE 373 Data Structures Lecture 9 Readings Reading Sections

Decision Trees Lecture 23 To left or to right 1 Decision Trees 2 Decision Trees A different

Outline Univariate Trees 1 Decision Trees Classification Regression Pruning Steven J Zeil

The number of spanning trees of random 2 -trees Stephan Wagner (joint work with Elmar Teufl)

Binary Trees, Heaps Binary Trees, Heaps Binary trees Binary trees A binary tree (

AVL TREES Height Balance : AVL Trees h 1 h 2 | h - h | 1 AVL AVL 2 1 non-AVL trees

Algorithms and Data Structures Balanced Trees (AVL-Trees, (a,b)-Trees, Red-Black-Trees)

General Trees children that any node may have. Chapter 7 Well, non-binary trees anyway.

Decision Trees Lecture 22 To left or to right 1 Decision Trees 2 Decision Trees A different

CS 10: Problem solving via Object Oriented Programming Prioritizing 2 Agenda 1. Heaps 2. Heap

PiTree: Practical Implementations of ABR Algorithms Using Decision Trees Paper # P5C-04 Zili Meng

Mat 3770 Relationships Week 7 Mary Trees Examples Balanced Prufer Exercises Spring 2014

Trees 5 May 2016 OSU CSE 1 A New Math Type: tree A ubiquitous concept in computing is that

DS504/CS586: Big Data Analytics Data Management Prof. Yanhua Li Time: 6:00pm 8:50pm R

A Datatype for Binary Trees Functions on Trees Amtoft from Hatcliff Binary Trees One of many

CS 225 Data Structures Mar March h 11 11 BT BTrees Wade Fa Wa Fagen-Ul Ulmsch schnei

Storage and Indexing DBS Database Systems Reading: R&amp;G Chapters 8, 9 &amp; 10.1 Implementing

Storage and Indexing DBS Database Systems Reading: R&G Chapters 8, 9 & 10.1 Implementing