LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d - PowerPoint PPT Presentation

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d Ultra-Lar arge ge Ke Key-Va Valu lue e St Store e for Small Data (Cont’d) Presenter: Sandeep S Rangaraju

2.4.2 Removing Indices by Using HTable • LSM Trie HTable- A Hash Based KV Organization • In an SSTable, items are sorted and index is needed for locating a block Whereas in HTable – Each Block is considered as a bucket for receiving KV items whose keys are hashed into it. • Each KV item has SHA-1-generated 160 bit hashkey its prefix has been used to identify an SSTable in SSTable Trie or HTable in LSM Trie we use its suffix to determine a bucket within an HTable for the KV item

How to eliminate Index? • To eliminate index in an HTable LSM Trie must use buckets of fixed size • Bloom filter is applied on individual buckets, an entire bucket would be read should its filter indicate a possible existence of a lookup item in the bucket • For access efficiency buckets should be of the same size as disk blocks (4 KB)

Question 8 What’s the difference between SSTable in LevelDB and HTable in LSM-trie? SSTable in LevelDB HTable in LSM-Trie • Items are sorted and index is needed • Each block is considered as a bucket for locating a block. for receiving KV items whose keys are hashed into it. No index/indices.

Question 9.. • “ However, a challenging issue is whether the buckets can be load balanced in terms of aggregate size of KV items hashed into them ” Why may the buckets in an HTable be load unbalanced? How to correct the problem?

Why unbalanced? • When hash function is applied and a new KV item is received it could map to the same buckets.

How to correct? • The buckets are first sorted into a list according to their initial loads • Paired migration operation within the list, in which a minimal number of KV items are moved out of the most overloaded bucket (the source) to the most under-loaded bucket (the destination) until the remaining items in the source can fit in the bucket.

Handling overflown Buckets • During creation of new HTable, LSM-trie sets up a special bucket to receive them and these items in the special bucket are fully indexed. • Index is saved in the HTable file and is also cached in memory for efficiently locating the items.

Issues in addressing load balancing.. • How to efficiently identify KV items overflown out of a bucket. • Hash function on the keys to rank KV items and logically place them into the bucket according to their rankings. • Bucket capacity (4KB) is used as WATERMARK. Any item above the WATERMARK are considered as overflown items for migration. • Record the hash value for the item at the WATERMARK named HASHMARK for future lookups to know whether an item has been migrated.

• Metadata for overflown items consists of SRC,DEST and HASHMARK • If we cache every bucket’s metadata, the cost would be comparable to the indices in SSTable, which records one key for each block (bucket). • Only cache the metadata for the most overloaded buckets and make a lookup of these items to be re-directed to their respect destination. • Similar to LevelDB, LSM-Trie maintains a bloom filter for each bucket to quickly determine whether a KV item could be there. • Migration of KV items doesn’t require updating the bucket’s bloom filter, as these KV items still logically remain in the bucket and are only physically stored in other buckets.

LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d - PowerPoint PPT Presentation

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d Ultra-Lar arge ge Ke Key-Va Valu lue e St Store e for Small Data (Contd)

FREE FREE FREE FREE RIDE RIDE RIDE RIDE W HAT HAT IS IS F REE REE RIDE RIDE ? HAT HAT IS

LSM-trie An LSM-tree-based Ultra-Large Key-Value Store for small Data by: Xingbo Wu, Yuehai Xu,

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small Data Xingbo Wu, Yuehai Xu, Zili

LSM-trie: An LSM-tree-based Ultra-Large Key-Value Store for Small Data Xingbo Wu , Yuehai Xu ,

LOG-STRUCTURED MERGE-TRIE PART 1 Xingbo Wu and Yuehai Xu, Wayne State University; Zili Shao, The

ChemBioDraw Today & Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

Calif ifornia Gra rape & Tre ree Fru ruit League To fulfill the needs of its membership

ACWQ Austin Canopy & Water Quality Final Report: port: Aus usti tin Tre ree- Canopy py

Ohio B Buck ckeye T eye Tre ree Commo mmon Name Name: Ohio Buckeye Scienti tifi fic Name

CS 225 Data Structures Feb. 21 Binary Search Tre ree Wad ade Fag agen-Ulm lmschneid ider

Fault lt Tre ree An Analysis lysis (F (FTA) A) Kim R. Fowler KSU ECE February 2013 Pu

TOWN OF SACKVILLE 2017 Tax Base $629,240,300 2018 Tax Base $619,997,885 2019 Tax Base

C ons truction and us efulnes s of the s s olar cadas olar cadas tre of the s tre of the

Review of recent developments on leptonic and semileptonic charm decays from lattice QCD

TBEN-S Ultra-Compact Multiprotocol I/O Modules Ultra-Compact Multiprotocol I/O Modules in IP67

Automating Drupal Migrations How to go from an Estimated One Week to Two Minutes Down Time

8.2.x DATA TO DRUPAL 8 Ready! about.me Ignacio Snchez Drupal developer @ @isholgueras

Performance Model Contact Committee Working Group on Public Procurement Contact Committee

Information Session Compliance Audit Committee Disclaimer These slides are provided by the

Drupal Camp Bangalore Getting into Drupal 8 Migration | making happiness possible axelerant.com

8.1.x DATA TO Ready! DRUPAL 8 about.me Nacho Snchez CTO@ @isholgueras

Migrating Mature Companies to the Cloud Opportunities & Challenges Mentation Solutions

Combining Timing, Localities and Migration in a Process Calculus Andrew Hughes

LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d - PowerPoint PPT Presentation

CSE 6350 File and Storage System Infrastructure in Data centers Supporting Internet-wide Services LSM SM-Tr Trie ie: : An An LSM SM-tre ree-base ased d Ultra-Lar arge ge Ke Key-Va Valu lue e St Store e for Small Data (Contd)

FREE FREE FREE FREE RIDE RIDE RIDE RIDE W HAT HAT IS IS F REE REE RIDE RIDE ? HAT HAT IS

LSM-trie An LSM-tree-based Ultra-Large Key-Value Store for small Data by: Xingbo Wu, Yuehai Xu,

LSM-trie: An LSM-tree-based Ultra- Large Key-Value Store for Small Data Xingbo Wu, Yuehai Xu, Zili

LSM-trie: An LSM-tree-based Ultra-Large Key-Value Store for Small Data Xingbo Wu , Yuehai Xu ,

LOG-STRUCTURED MERGE-TRIE PART 1 Xingbo Wu and Yuehai Xu, Wayne State University; Zili Shao, The

ChemBioDraw Today &amp; Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Stateful access control using LSM CS547 Thomas Uphill Stateful access cont rol using LSM 11

Calif ifornia Gra rape &amp; Tre ree Fru ruit League To fulfill the needs of its membership

ACWQ Austin Canopy &amp; Water Quality Final Report: port: Aus usti tin Tre ree- Canopy py

Ohio B Buck ckeye T eye Tre ree Commo mmon Name Name: Ohio Buckeye Scienti tifi fic Name

CS 225 Data Structures Feb. 21 Binary Search Tre ree Wad ade Fag agen-Ulm lmschneid ider

Fault lt Tre ree An Analysis lysis (F (FTA) A) Kim R. Fowler KSU ECE February 2013 Pu

TOWN OF SACKVILLE 2017 Tax Base $629,240,300 2018 Tax Base $619,997,885 2019 Tax Base

C ons truction and us efulnes s of the s s olar cadas olar cadas tre of the s tre of the

Review of recent developments on leptonic and semileptonic charm decays from lattice QCD

TBEN-S Ultra-Compact Multiprotocol I/O Modules Ultra-Compact Multiprotocol I/O Modules in IP67

Automating Drupal Migrations How to go from an Estimated One Week to Two Minutes Down Time

8.2.x DATA TO DRUPAL 8 Ready! about.me Ignacio Snchez Drupal developer @ @isholgueras

Performance Model Contact Committee Working Group on Public Procurement Contact Committee

Information Session Compliance Audit Committee Disclaimer These slides are provided by the

Drupal Camp Bangalore Getting into Drupal 8 Migration | making happiness possible axelerant.com

8.1.x DATA TO Ready! DRUPAL 8 about.me Nacho Snchez CTO@ @isholgueras

Migrating Mature Companies to the Cloud Opportunities &amp; Challenges Mentation Solutions

Combining Timing, Localities and Migration in a Process Calculus Andrew Hughes

ChemBioDraw Today & Tomorrow Mark L. Olson, PhD Vice-President, Software Development

Calif ifornia Gra rape & Tre ree Fru ruit League To fulfill the needs of its membership

ACWQ Austin Canopy & Water Quality Final Report: port: Aus usti tin Tre ree- Canopy py

Migrating Mature Companies to the Cloud Opportunities & Challenges Mentation Solutions