Dynamo: Amazons Highly Available Key-value Store Josh Blum | 6.S897 - PowerPoint PPT Presentation

Dynamo: Amazon’s Highly Available Key-value Store Josh Blum | 6.S897 | 09/28/2015

Introduction - Amazon’s e-commerce platform serves tens of millions customers at peak times using tens of thousands of servers located in many data centers around the world. - Need for a scalable and highly available key-value store - Choose to focus on an eventually consistent store - Sacrifices consistency for availability

System Assumptions and Requirements - Query Model Data is uniquely identified by a key, stored as binary blob - - No need for relational schema - Efficiency - Runs on commodity heterogenous hardware infrastructure - Stringent latency requirements: SLA is 300ms for 99.9th percentile requests - Other Assumptions - Security isn’t an issue

API - get(key) - Returns a single object or a list of objects with conflicting versions along with a context - Conflicts are handled on reads, never reject a write - put(key, context, object) - context refers to various kinds of system metadata

Data Partitioning - Consistent hashing - Output range of a hash is treated as a ‘ring’. - Assign a key to each object (MD5 of 128-bit client supplied key) - MD5(key) -> node (position on the Ring) - Incrementally scalable: adding a single node does not affect the system significantly - “Virtual Nodes” - Each node can be responsible for more than one virtual node. - Work distribution proportional to the capabilities of the individual node

Data Partitioning

Replication Example: N=3 - Node B replicates the key k at nodes C and D in addition to storing it locally. - Node D will store the keys in the ranges (A, B], (B, C], and (C, D].

Data Versioning - System is eventually consistent, thus a get() call may return stale data - An object can have distinct version sub-histories, the system needs reconcile in the future - Uses vector clocks in order to capture causality between different versions of the same object.

Vector Clocks - A vector clock is a list of (node, counter) pairs. - Every version of every object is associated with one vector clock. - When a client wishes to update an object, it must specify which version it is updating. - This is done by passing the “context” it obtained from an earlier read operation, which contains the vector clock information.

Sloppy Quorum - R : minimum number of nodes that must participate in a successful read operation - W : the minimum number of nodes that must participate in a successful write operation - Setting R + W > N yields a quorum-like system. - The latency of a get() (or put() ) operation is dictated by the slowest of the R (or W ) replicas - R and W are usually configured to be less than N , to provide better latency.

Sloppy Quorum: get() - get() : coordinator reads from N nodes; waits for R responses. - If they agree, return value. - If they disagree, but are causally related, return the most recent value - If they are causally unrelated apply reconciliation techniques and write back the corrected version

Sloppy Quorum: put() - put() : the coordinator writes to the first N healthy nodes on the preference list. - Coordinator writes new version vector clock locally and forwards to N highest ranked reachable nodes - If W-1 more writes succeed, the write is considered to be successful

(N, R, W) Configurations - Typical: (3, 2, 2) - Balances performance, durability, and availability - W = 1 - Never reject a write as long as one node is alive - Low values of W and R can increase the risk of inconsistency - Requests are successful before being processed by a majority of the replicas. - Introduces vulnerability window for durability for writes

Failures - Like Google, Amazon has a number of data centers, each with many commodity machines. - Individual machines fail regularly - Sometimes entire data centers fail due to power outages, network partitions, tornados, etc. - To handle failure of entire centers, replicas are spread across multiple data centers. - Hinted handoff for transient failures - Merkle trees for replica synchronization

Questions?

Dynamo: Amazons Highly Available Key-value Store Josh Blum | 6.S897 - PowerPoint PPT Presentation

Dynamo: Amazons Highly Available Key-value Store Josh Blum | 6.S897 | 09/28/2015 Introduction - Amazons e-commerce platform serves tens of millions customers at peak times using tens of thousands of servers located in many data centers

Amazon Dynamo A Highly Available Key-value Store Present by Jian Fang jianf@cmu.edu What is

Sapporo Sapporo Namba Namba Shinjuku Shinjuku Store Store Store Store West Store West

Relational Document Time Series Amazon Aurora Amazon DocumentDB Amazon Timestream Graph

Dynamo & Bigtable CSCI 2270, Spring 2011 Irina Calciu Zikai Wang Dynamo Amazon's highly

Relational Amazon Aurora Amazon RedShi f Amazon RDS AWS Database Migration Service DMS

Amazon Dynamo distributed key-value storage Michal Oniszczuk October 10, 2012 Michal Oniszczuk

Introduction Need for a highly available Distributed Data Store During the holiday shopping

Reliability at Scale A tale of Amazon Dynamo Presented by Yunhe Liu @ CS6410 Fall19 Slides

Dynamo Saurabh Agarwal What have we looked at so far ? Assumptions CAP Theorem SQL and

Dynamo: Amazons Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan

Dynamo Amazons Highly Available Key-value Store SOSP 07 Authors Giuseppe DeCandia, Deniz

Dynamo Amazons Highly-Available Key-value Store 2007 Giuseppe DeCandia, Deniz Hastorun, Madan

DynamO Workshop Introduction to Event-Driven Dynamics and DynamO Dr Marcus N. Bannerman & Dr

Dynamo Dynamo motivation Fast, available writes - Shopping cart: always enable purchases FLP:

Dynamo Dynamo motivation Fast, available writes - Shopping cart: always enable purchases FLP:

Deep Semantic Matching for Amazon Product Search Yi Yiwei ei So Song ng Amazon Product

Fixed Income Investor Presentation FY 2016 Results 24 February 2017 Ewen Stevenson Chief

Unit 4: Performance & Benchmarking CPU Performance Performance Pitfalls

1. X-ray and gamma-ray Astronomy PhD Course, University of Padua Page 1 High Energy and Time

Scalable Machine Learning 3. Data Streams Alex Smola Yahoo! Research and ANU

Material structure elucidation methods X-ray analysis dr. va Mak 1 Major branches of

Theoretical results Ignoring demand dynamics, nave old pricing model works well. Theorem : In

Replica and all that Giorgio Parisi In this talk I will present an history of the replica method.

Scheduling 3 / Threads 1 last time shortest job fjrst/shortest remaining time fjrst response

Dynamo: Amazons Highly Available Key-value Store Josh Blum | 6.S897 - PowerPoint PPT Presentation

Dynamo: Amazons Highly Available Key-value Store Josh Blum | 6.S897 | 09/28/2015 Introduction - Amazons e-commerce platform serves tens of millions customers at peak times using tens of thousands of servers located in many data centers

Amazon Dynamo A Highly Available Key-value Store Present by Jian Fang jianf@cmu.edu What is

Sapporo Sapporo Namba Namba Shinjuku Shinjuku Store Store Store Store West Store West

Relational Document Time Series Amazon Aurora Amazon DocumentDB Amazon Timestream Graph

Dynamo &amp; Bigtable CSCI 2270, Spring 2011 Irina Calciu Zikai Wang Dynamo Amazon's highly

Relational Amazon Aurora Amazon RedShi f Amazon RDS AWS Database Migration Service DMS

Amazon Dynamo distributed key-value storage Michal Oniszczuk October 10, 2012 Michal Oniszczuk

Introduction Need for a highly available Distributed Data Store During the holiday shopping

Reliability at Scale A tale of Amazon Dynamo Presented by Yunhe Liu @ CS6410 Fall19 Slides

Dynamo Saurabh Agarwal What have we looked at so far ? Assumptions CAP Theorem SQL and

Dynamo: Amazons Highly Available Key-value Store Giuseppe DeCandia, Deniz Hastorun, Madan

Dynamo Amazons Highly Available Key-value Store SOSP 07 Authors Giuseppe DeCandia, Deniz

Dynamo Amazons Highly-Available Key-value Store 2007 Giuseppe DeCandia, Deniz Hastorun, Madan

DynamO Workshop Introduction to Event-Driven Dynamics and DynamO Dr Marcus N. Bannerman &amp; Dr

Dynamo Dynamo motivation Fast, available writes - Shopping cart: always enable purchases FLP:

Dynamo Dynamo motivation Fast, available writes - Shopping cart: always enable purchases FLP:

Deep Semantic Matching for Amazon Product Search Yi Yiwei ei So Song ng Amazon Product

Fixed Income Investor Presentation FY 2016 Results 24 February 2017 Ewen Stevenson Chief

Unit 4: Performance &amp; Benchmarking CPU Performance Performance Pitfalls

1. X-ray and gamma-ray Astronomy PhD Course, University of Padua Page 1 High Energy and Time

Scalable Machine Learning 3. Data Streams Alex Smola Yahoo! Research and ANU

Material structure elucidation methods X-ray analysis dr. va Mak 1 Major branches of

Theoretical results Ignoring demand dynamics, nave old pricing model works well. Theorem : In

Replica and all that Giorgio Parisi In this talk I will present an history of the replica method.

Scheduling 3 / Threads 1 last time shortest job fjrst/shortest remaining time fjrst response

Dynamo & Bigtable CSCI 2270, Spring 2011 Irina Calciu Zikai Wang Dynamo Amazon's highly

DynamO Workshop Introduction to Event-Driven Dynamics and DynamO Dr Marcus N. Bannerman & Dr

Unit 4: Performance & Benchmarking CPU Performance Performance Pitfalls