LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small - PowerPoint PPT Presentation

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small Data Xingbo Wu, Yuehai Xu, Zili Shao, Song Jiang Wayne State University The Hong Kong Polytechnic University ATC 2015 1

Motivation • Very small KV items are widespread. For a store of given capacity, smaller KV items demand more metadata to • locate them. Demand on a KV store’s capacity at individual KV servers keep • increasing. • Many KV stores require high performance for both reads and writes. However, for SILT, its major effort spends on optimizing reads by minimizing • metadata size, write performance is compromised. 2

Question 1: In the meantime, for some KV stores, such as SILT (Small • Index Large Table), major efforts are made to optimize reads by minimizing metadata size, while write performance can be compromised without conducting multi-level incremental compactions. Explain how high write amplifications are produced in SILT? 3

Question 3: Use Figure 1 in this paper to explain the difference between • linear and exponential growth pattern? 4

Question 2: Note that LSM-trie uses hash functions to organize its data • and accordingly does not support range search. Does LevelDB support range search? LevelDB supports range search. This figure from paper: <WiscKey: Sepera tj ng Keys fs om Values in SSD-Conscious S tp rage> 5

Question 4: Among all compactions moving data from Lk to Lk+1, we must • make sure their key range are not overlapped to keep any two SSTables at level Lk+1 from having overlapped key ranges. However, this cannot be achieved with the LevelDB data organization. Explain why levelDB can not achieve it? The key range of an SSTable is variable in levelDB and the range’s distribution can be different in different levels. 6

Question 5: Use Figure 2 and 3 to describe the LSM-trie’s structure and • how compaction is performed in the trie? 7

Question 5: Use Figure 2 and 3 to describe the LSM-trie’s structure and • how compaction is performed in the trie? 8

Question 6: The indices and Bloom filters in a KV store can grow very • large. Use an example to show that these metadata in LevelDB may have to be out of core. Suppose we have 10TB disk, the block size is 4 KB, and size of index for each block is 12B. Then the total size of index will be 30 GB. (10TB/4KB)*12B = 30GB Suppose we have 10 TB disk, the size of each KV item is 100 B, and 10-bit-per-key in Bloom filter. Then the total size of Bloom filter will be 125 GB. (10TB/100B)*1.25B = 125GB The total size of metadata will be 155GB! 9

Question 7: Therefore, the Bloom filter must be beefed up by using more • bits. Use an example to show why the Bloom filter have to be longer. Suppose we have a SSTable-trie which has 7 levels, each level has 8 sub- levels. The false positive rate for Bloom filter employed 10-bit per item will be 0.82%. At the worst case, the probability of searching the levels without the KV item we wanted will increase from 5.74% (7 * 0.82%) to 45.92% (7*8*0.82%). 10

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small - PowerPoint PPT Presentation

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small Data Xingbo Wu, Yuehai Xu, Zili Shao, Song Jiang Wayne State University The Hong Kong Polytechnic University ATC 2015 1 Motivation Very small KV items are widespread. For

LSM-trie An LSM-tree-based Ultra-Large Key-Value Store for small Data by: Xingbo Wu, Yuehai Xu,

LSM-trie: An LSM-tree-based Ultra-Large Key-Value Store for Small Data Xingbo Wu , Yuehai Xu ,

Sapporo Sapporo Namba Namba Shinjuku Shinjuku Store Store Store Store West Store West

LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d Ultra-Lar arge ge Ke Key-Va Valu

LOG-STRUCTURED MERGE-TRIE PART 1 Xingbo Wu and Yuehai Xu, Wayne State University; Zili Shao, The

ChemBioDraw Today & Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Meeting the Challenges of Ultra- -Large Large- - Meeting the Challenges of Ultra Scale Systems

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Enabling Future Enabling Future Technology Technology Ultra-Large-Scale Systems

Tree-sitter @maxbrunsfeld What is Tree-sitter? Why I wrote Tree-sitter What were

Key-Value Stores Key-value stores are popular. web searching, social networks, e-commerce,

Position: Synergetic Effects of Software and Hardware Parameters on the LSM System Authors:

Time-critical reactive systems (modelling) Jos Proena HASLab - INESC TEC Universidade do

Time-critical reactive systems (modelling) Lus Soares Barbosa HASLab - INESC TEC Universidade

Hardware-Software Codesign 11. Thermal-Aware Design Iuliana Bacivarov & Lothar Thiele Swiss

Concurrent Programming in Harmony: Critical Sections and Locks CS 4410 Operating Systems

Darrell Bethea May 12, 2011 1 Homework 0 due tonight Grades will be posted on Blackboard

CIS 500 Software Foundations Recursion Fall 2005 2 November CIS 500, 2

CS 10: Problem solving via Object Oriented Programming Info Retrieval ADT Overview List

DATABASE SYSTEM IMPLEMENTATION GT 4420/6422 // SPRING 2019 // @JOY_ARULRAJ LECTURE #3: STORAGE

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small - PowerPoint PPT Presentation

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small Data Xingbo Wu, Yuehai Xu, Zili Shao, Song Jiang Wayne State University The Hong Kong Polytechnic University ATC 2015 1 Motivation Very small KV items are widespread. For

LSM-trie An LSM-tree-based Ultra-Large Key-Value Store for small Data by: Xingbo Wu, Yuehai Xu,

LSM-trie: An LSM-tree-based Ultra-Large Key-Value Store for Small Data Xingbo Wu , Yuehai Xu ,

Sapporo Sapporo Namba Namba Shinjuku Shinjuku Store Store Store Store West Store West

LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d Ultra-Lar arge ge Ke Key-Va Valu

LOG-STRUCTURED MERGE-TRIE PART 1 Xingbo Wu and Yuehai Xu, Wayne State University; Zili Shao, The

ChemBioDraw Today &amp; Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Are Hybrid Physical Designs Important? 1 B+ tree 2 C O L B+ tree 3 ? C O L C O L B+ tree

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

61A Lecture 21 Announcements Binary Trees Binary Tree Class 4 Binary Tree Class class

Meeting the Challenges of Ultra- -Large Large- - Meeting the Challenges of Ultra Scale Systems

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Enabling Future Enabling Future Technology Technology Ultra-Large-Scale Systems

Tree-sitter @maxbrunsfeld What is Tree-sitter? Why I wrote Tree-sitter What were

Key-Value Stores Key-value stores are popular. web searching, social networks, e-commerce,

Position: Synergetic Effects of Software and Hardware Parameters on the LSM System Authors:

Time-critical reactive systems (modelling) Jos Proena HASLab - INESC TEC Universidade do

Time-critical reactive systems (modelling) Lus Soares Barbosa HASLab - INESC TEC Universidade

Hardware-Software Codesign 11. Thermal-Aware Design Iuliana Bacivarov &amp; Lothar Thiele Swiss

Concurrent Programming in Harmony: Critical Sections and Locks CS 4410 Operating Systems

Darrell Bethea May 12, 2011 1 Homework 0 due tonight Grades will be posted on Blackboard

CIS 500 Software Foundations Recursion Fall 2005 2 November CIS 500, 2

CS 10: Problem solving via Object Oriented Programming Info Retrieval ADT Overview List

DATABASE SYSTEM IMPLEMENTATION GT 4420/6422 // SPRING 2019 // @JOY_ARULRAJ LECTURE #3: STORAGE

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

ChemBioDraw Today & Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Hardware-Software Codesign 11. Thermal-Aware Design Iuliana Bacivarov & Lothar Thiele Swiss