NoSQL Databases Amir H. Payberah payberah@kth.se 03/09/2019 The - PowerPoint PPT Presentation

Master ◮ Assigns tablets to tablet server. ◮ Balances tablet server load. ◮ Garbage collection of unneeded files in GFS. 44 / 89

Master ◮ Assigns tablets to tablet server. ◮ Balances tablet server load. ◮ Garbage collection of unneeded files in GFS. ◮ Handles schema changes, e.g., table and column family creations 44 / 89

Tablet Server ◮ Can be added or removed dynamically. 45 / 89

Tablet Server ◮ Can be added or removed dynamically. ◮ Each manages a set of tablets (typically 10-1000 tablets/server). 45 / 89

Tablet Server ◮ Can be added or removed dynamically. ◮ Each manages a set of tablets (typically 10-1000 tablets/server). ◮ Handles read/write requests to tablets. 45 / 89

Tablet Server ◮ Can be added or removed dynamically. ◮ Each manages a set of tablets (typically 10-1000 tablets/server). ◮ Handles read/write requests to tablets. ◮ Splits tablets when too large. 45 / 89

Client Library ◮ Library that is linked into every client. ◮ Client data does not move though the master. ◮ Clients communicate directly with tablet servers for reads/writes. 46 / 89

Building Blocks ◮ The building blocks for the BigTable are: • Google File System (GFS) • Chubby • SSTable 47 / 89

Google File System (GFS) ◮ Large-scale distributed file system. ◮ Store log and data files. 48 / 89

Chubby Lock Service ◮ Ensure there is only one active master. ◮ Store bootstrap location of BigTable data. ◮ Discover tablet servers. ◮ Store BigTable schema information and access control lists. 49 / 89

SSTable ◮ SSTable file format used internally to store BigTable data. 50 / 89

SSTable ◮ SSTable file format used internally to store BigTable data. ◮ Chunks of data plus a block index. 50 / 89

SSTable ◮ SSTable file format used internally to store BigTable data. ◮ Chunks of data plus a block index. ◮ Immutable, sorted file of key-value pairs. 50 / 89

SSTable ◮ SSTable file format used internally to store BigTable data. ◮ Chunks of data plus a block index. ◮ Immutable, sorted file of key-value pairs. ◮ Each SSTable is stored in a GFS file. 50 / 89

Tablet Serving 51 / 89

Master Startup ◮ The master executes the following steps at startup: 52 / 89

Master Startup ◮ The master executes the following steps at startup: • Grabs a unique master lock in Chubby, which prevents concurrent master instantiations. 52 / 89

Master Startup ◮ The master executes the following steps at startup: • Grabs a unique master lock in Chubby, which prevents concurrent master instantiations. • Scans the servers directory in Chubby to find the live servers. 52 / 89

Master Startup ◮ The master executes the following steps at startup: • Grabs a unique master lock in Chubby, which prevents concurrent master instantiations. • Scans the servers directory in Chubby to find the live servers. • Communicates with every live tablet server to discover what tablets are already assigned to each server. 52 / 89

Master Startup ◮ The master executes the following steps at startup: • Grabs a unique master lock in Chubby, which prevents concurrent master instantiations. • Scans the servers directory in Chubby to find the live servers. • Communicates with every live tablet server to discover what tablets are already assigned to each server. • Scans the METADATA table to learn the set of tablets. 52 / 89

Tablet Assignment ◮ 1 tablet → 1 tablet server. 53 / 89

Tablet Assignment ◮ 1 tablet → 1 tablet server. ◮ Master uses Chubby to keep tracks of live tablet serves and unassigned tablets. • When a tablet server starts, it creates and acquires an exclusive lock in Chubby. 53 / 89

Tablet Assignment ◮ 1 tablet → 1 tablet server. ◮ Master uses Chubby to keep tracks of live tablet serves and unassigned tablets. • When a tablet server starts, it creates and acquires an exclusive lock in Chubby. ◮ Master detects the status of the lock of each tablet server by checking periodically. 53 / 89

Tablet Assignment ◮ 1 tablet → 1 tablet server. ◮ Master uses Chubby to keep tracks of live tablet serves and unassigned tablets. • When a tablet server starts, it creates and acquires an exclusive lock in Chubby. ◮ Master detects the status of the lock of each tablet server by checking periodically. ◮ Master is responsible for finding when tablet server is no longer serving its tablets and reassigning those tablets as soon as possible. 53 / 89

Finding a Tablet ◮ Three-level hierarchy. ◮ The first level is a file stored in Chubby that contains the location of the root tablet. ◮ Root tablet contains location of all tablets in a special METADATA table. ◮ METADATA table contains location of each tablet under a row. ◮ The client library caches tablet locations. 54 / 89

Tablet Serving (1/2) ◮ Updates committed to a commit log. ◮ Recently committed updates are stored in memory - memtable ◮ Older updates are stored in a sequence of SSTables. 55 / 89

Tablet Serving (2/2) ◮ Strong consistency • Only one tablet server is responsible for a given piece of data. • Replication is handled on the GFS layer. 56 / 89

Tablet Serving (2/2) ◮ Strong consistency • Only one tablet server is responsible for a given piece of data. • Replication is handled on the GFS layer. ◮ Trade-off with availability • If a tablet server fails, its portion of data is temporarily unavailable until a new server is assigned. 56 / 89

Loading Tablets ◮ To load a tablet, a tablet server does the following: ◮ Finds locaton of tablet through its METADATA. • Metadata for a tablet includes list of SSTables and set of redo points. ◮ Read SSTables index blocks into memory. ◮ Read the commit log since the redo point and reconstructs the memtable. 57 / 89

BigTable vs. HBase BigTable HBase GFS HDFS Tablet Server Region Server SSTable StoreFile Memtable MemStore Chubby ZooKeeper 58 / 89

HBase Example # Create the table "test", with the column family "cf" create ’test’, ’cf’ 59 / 89

HBase Example # Create the table "test", with the column family "cf" create ’test’, ’cf’ # Use describe to get the description of the "test" table describe ’test’ 59 / 89

HBase Example # Create the table "test", with the column family "cf" create ’test’, ’cf’ # Use describe to get the description of the "test" table describe ’test’ # Put data in the "test" table put ’test’, ’row1’, ’cf:a’, ’value1’ put ’test’, ’row2’, ’cf:b’, ’value2’ put ’test’, ’row3’, ’cf:c’, ’value3’ 59 / 89

HBase Example # Create the table "test", with the column family "cf" create ’test’, ’cf’ # Use describe to get the description of the "test" table describe ’test’ # Put data in the "test" table put ’test’, ’row1’, ’cf:a’, ’value1’ put ’test’, ’row2’, ’cf:b’, ’value2’ put ’test’, ’row3’, ’cf:c’, ’value3’ # Scan the table for all data at once scan ’test’ 59 / 89

HBase Example # Create the table "test", with the column family "cf" create ’test’, ’cf’ # Use describe to get the description of the "test" table describe ’test’ # Put data in the "test" table put ’test’, ’row1’, ’cf:a’, ’value1’ put ’test’, ’row2’, ’cf:b’, ’value2’ put ’test’, ’row3’, ’cf:c’, ’value3’ # Scan the table for all data at once scan ’test’ # To get a single row of data at a time, use the get command get ’test’, ’row1’ 59 / 89

Cassandra 60 / 89

Cassandra ◮ A column-oriented database ◮ It was created for Facebook and was later open sourced ◮ CAP: availability and partition tolerance 61 / 89

Borrowed From BigTable ◮ Data model: column oriented • Keyspaces (similar to the schema in a relational database), tables, and columns. 62 / 89

Borrowed From BigTable ◮ Data model: column oriented • Keyspaces (similar to the schema in a relational database), tables, and columns. ◮ SSTable disk storage • Append-only commit log • Memtable (buffering and sorting) • Immutable sstable files 62 / 89

Data Partitioning (1/2) ◮ Key/value, where values are stored as objects. ◮ If size of data exceeds the capacity of a single machine: partitioning 63 / 89

Data Partitioning (1/2) ◮ Key/value, where values are stored as objects. ◮ If size of data exceeds the capacity of a single machine: partitioning ◮ Consistent hashing for partitioning. 63 / 89

Data Partitioning (2/2) ◮ Consistent hashing. ◮ Hash both data and node ids using the same hash function in a same id space. ◮ partition = hash(d) mod n , d : data, n : the size of the id space 64 / 89

Data Partitioning (2/2) ◮ Consistent hashing. ◮ Hash both data and node ids using the same hash function in a same id space. ◮ partition = hash(d) mod n , d : data, n : the size of the id space id space = [0, 15], n = 16 hash("Fatemeh") = 12 hash("Ahmad") = 2 hash("Seif") = 9 hash("Jim") = 14 hash("Sverker") = 4 64 / 89

Replication ◮ To achieve high availability and durability, data should be replicated on multiple nodes. 65 / 89

Adding and Removing Nodes ◮ Gossip-based mechanism: periodically, each node contacts another randomly selected node. 66 / 89

NoSQL Databases Amir H. Payberah payberah@kth.se 03/09/2019 The - PowerPoint PPT Presentation

NoSQL Databases Amir H. Payberah payberah@kth.se 03/09/2019 The Course Web Page https://id2221kth.github.io 1 / 89 Where Are We? 2 / 89 Database and Database Management System Database: an organized collection of data. Database

NoSQL Source: Pramod J. Sadalage and Martin Fowler NoSQL Distilled: A Brief Guide to the

1 2 What is covered in this presentation? A brief history of databases NoSQL WHY, WHAT

NoSQL and MongoDB 1 2 Introduction to NoSQL Based on a presentation by Traversy Media 3 What

NoSQL like There is No Tomorrow Khawaja Head of Engineering, NoSQL Swaminathan Sivasubramanian

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Zrich |

NoSQL Terje Gjster, Ph.D. UiA, Grimstad 16. November 2015 Overview Introduction and

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Basel |

Why NoSQL? Why Riak? Justin Sheehy justin@basho.com 1 What's all of this NoSQL nonsense?

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Security and Performance Analysis of Encrypted NoSQL Databases M.W. Grim BSc., Abe Wiersma BSc.

CS 61: Database Systems NoSQL/Mongo CRUD Adapted from mongodb.com unless otherwise noted Agenda

4th Generation 4th Generation Obj Object Databases t D t b (we are not alone 3 more nosql events

The NoSQL Ecosystem 7-21-10 Wednesday, July 21, 2010 Executive summary NoSQL is about using

NoSQL Concepts, Techniques & Systems Part 1 Valentina Ivanova IDA, Linkping University

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

CS 225 Data Structures Mar March h 13 13 Ha Hashing Wade Fa Wa Fagen-Ul Ulmsch

CS 225 Data Structures Ma March 13 BT BTree An Analysis G G Carl Evans BT BTree An

Some Cryptanalytic Results on TRIAD Abhishek Kesarwani IIT Madras, India INDOCRYPT 2019 16

Lecture 5 - Cryptography CMPSC 443 - Spring 2012 Introduction Computer and Network Security

CSE 484 / CSE M 584 Computer Security Sec4on Week 4: Cryptography TA: Thomas Crosley

Announcements Exam 2: 03/11, PA2,HW3. Today: Loose ends Associative Arrays Hash

INTERCONNECTING CDNS AKA PEERING PEER-TO- PEER Bruce Davie & Francois le Faucheur

NoSQL Databases Amir H. Payberah payberah@kth.se 03/09/2019 The - PowerPoint PPT Presentation

NoSQL Databases Amir H. Payberah payberah@kth.se 03/09/2019 The Course Web Page https://id2221kth.github.io 1 / 89 Where Are We? 2 / 89 Database and Database Management System Database: an organized collection of data. Database

NoSQL Source: Pramod J. Sadalage and Martin Fowler NoSQL Distilled: A Brief Guide to the

1 2 What is covered in this presentation? A brief history of databases NoSQL WHY, WHAT

NoSQL and MongoDB 1 2 Introduction to NoSQL Based on a presentation by Traversy Media 3 What

NoSQL like There is No Tomorrow Khawaja Head of Engineering, NoSQL Swaminathan Sivasubramanian

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Zrich |

NoSQL Terje Gjster, Ph.D. UiA, Grimstad 16. November 2015 Overview Introduction and

How to Use NoSQL in Enterprise Java Applications Patrick Baumgartner NoSQL Roadshow | Basel |

Why NoSQL? Why Riak? Justin Sheehy justin@basho.com 1 What's all of this NoSQL nonsense?

Creating Databases and Tables Introduction to Databases in Python Creating Databases

Inductive Inductive Inductive Inductive Databases Databases Databases Databases and

Lecture 11: Persistent Memory Databases 1 / 71 Persistent Memory Databases Recap

Security and Performance Analysis of Encrypted NoSQL Databases M.W. Grim BSc., Abe Wiersma BSc.

CS 61: Database Systems NoSQL/Mongo CRUD Adapted from mongodb.com unless otherwise noted Agenda

4th Generation 4th Generation Obj Object Databases t D t b (we are not alone 3 more nosql events

The NoSQL Ecosystem 7-21-10 Wednesday, July 21, 2010 Executive summary NoSQL is about using

NoSQL Concepts, Techniques &amp; Systems Part 1 Valentina Ivanova IDA, Linkping University

Lessons Learned with Cassandra &amp; Spark_ Matthias Niehoff Apache: Big Data 2017

CS 225 Data Structures Mar March h 13 13 Ha Hashing Wade Fa Wa Fagen-Ul Ulmsch

CS 225 Data Structures Ma March 13 BT BTree An Analysis G G Carl Evans BT BTree An

Some Cryptanalytic Results on TRIAD Abhishek Kesarwani IIT Madras, India INDOCRYPT 2019 16

Lecture 5 - Cryptography CMPSC 443 - Spring 2012 Introduction Computer and Network Security

CSE 484 / CSE M 584 Computer Security Sec4on Week 4: Cryptography TA: Thomas Crosley

Announcements Exam 2: 03/11, PA2______,HW3______. Today: Loose ends Associative Arrays Hash

INTERCONNECTING CDNS AKA PEERING PEER-TO- PEER Bruce Davie &amp; Francois le Faucheur

NoSQL Concepts, Techniques & Systems Part 1 Valentina Ivanova IDA, Linkping University

Lessons Learned with Cassandra & Spark_ Matthias Niehoff Apache: Big Data 2017

Announcements Exam 2: 03/11, PA2,HW3. Today: Loose ends Associative Arrays Hash

INTERCONNECTING CDNS AKA PEERING PEER-TO- PEER Bruce Davie & Francois le Faucheur