Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci - PowerPoint PPT Presentation

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast

Hazelcast ▪ The leading open source Java IMDG ▪ Distributed Java collections, concurrency primitives, ... ▪ Distributed computations, messaging, ...

In-Memory Data Grids ▪ Distributed caching ▪ Keeping data in local JVM for fast access & processing ▪ Elasticity, availability, high throughput, and low latency ▪ Multiple copies of data to tolerate failures

Replication ▪ Putting a data set into multiple nodes ▪ Fault tolerance ▪ Latency ▪ Throughput

Challenges ▪ Where to perform reads & writes? ▪ How to keep replicas sync? ▪ How to handle concurrent reads & writes? ▪ How to handle failures?

CAP Principle ▪ Pick two of C , A , and P ▪ CP versus AP

Consistency/Latency Trade-off

PACELC Principle ▪ If there is a network partition (P) , we have to choose between availability and consistency (AC) . ▪ Else (E) , during normal operation, we can choose between latency and consistency (LC) .

Let’s build the core replication protocol of Hazelcast

Primary Copy ▪ Operations are sent to primary replicas. ▪ Strong consistency when the primary is reachable.

Partitioning (Sharding) ▪ Partitioning helps to scale primaries. ▪ A primary replica is elected for each partition.

Updating Replicas

Updating Replicas partition id = hash(serialize(key)) % partition count

Updating Replicas

Async Replication ▪ Each replica is updated separately. ▪ High throughput and availability

Anti-Entropy ▪ Backup replicas can fall behind the primary. ▪ Non-sync backups are fixed with an active anti-entropy mechanism.

Replicas are not sync ▪ The client reads a key from the current primary replica.

Network Partitioning ▪ The client reads the same key.

Split-Brain ▪ Strong consistency is lost.

Resolving the Divergence ▪ Merge policies: higher hits, latest update / access, … ▪ Merging may cause lost updates.

Let’s classify this protocol with PACELC

Hazelcast is PA/EC ▪ Consistency is usually traded to availability and latency together. ▪ Hazelcast works in memory and mostly used in a single computing cluster. ▪ Consistency - latency trade-off is minimal. ▪ PA/EC works fine for distributed caching.

Favoring Latency (PA/EL)

Scaling Reads ▪ Reads can be served locally from near caches and backup replicas.

Favoring Consistency (PC/EC)

Failure Detectors ▪ Local failure detectors rely on timeouts. ▪ Operations are blocked after the cluster size falls below a threshold.

Failure Detectors ▪ It takes some time to detect an unresponsive node. ▪ Minimizes divergence and maintains the baseline consistency.

Isolated Failure Detectors ▪ Configure failure detectors independently for data structures ▪ Phi-Accrual Failure Detector

CP Data Structures ▪ IDGenerator ▪ Distributed impls of java.util.concurrent.* ▪ PA/EC is not the perfect fit for CP data structures.

Flake IDs ▪ Local unique id generation ▪ Nodes get a unique node id during join. ▪ K-ordered IDs

CRDTs ▪ CRDTs: Conflict-free Replicated Data Types ▪ Replicas are updated concurrently without coordination. ▪ Strong eventual consistency ▪ Counters, sets, maps, graphs, ...

PN-Counter

Sync Replication ▪ Concurrency primitives imply the true CP behavior. ▪ Paxos, Raft, ZAB, VR ▪ Re-implementing Hazelcast concurrency primitives with Raft

Recap ▪ http://bit.ly/hazelcast-replication-consistency ▪ http://bit.ly/hazelcast-network-partitions ▪ http://dbmsmusings.blogspot.com/2017/10/hazelcast-an d-mythical-paec-system.html

Thanks! You can find me at ▪ @metanet ▪ ebkahveci@gmail.com

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci - PowerPoint PPT Presentation

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The leading open source Java IMDG Distributed Java collections, concurrency primitives, ... Distributed computations, messaging, ... In-Memory

HAZELCAST DISTRIBUTED DATA STRUCTURES FOR JAVA WHO AM I Fuad Malikov @fuadm Hazelcast

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

DEEP DIVE DEEP DIVE INT INTO O SEO SEO Private and Confidential. Property of Whereoware, LLC.

java.util.concurrent for distributed coordination Ensar Basri Kahveci Hazelcast @metanet

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

RTGEN (AGC) & ICCP Deep Dive February 23, 2012 Shari Brown and Matt Beck CBA Project Staff

DataPower DataPower-MQ Integration MQ Integration Deep Dive Deep Dive Robin Wiley (Robin

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Galera Replication Synchronous Multi-Master Replication for InnoDB ...well, why not for any other

Replication and Migration Background, Requirements and Strawman Migration and Replication

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DRBD 9 Linux Storage Replication Lars Ellenberg LINBIT HA Solutions GmbH Vienna, Austria

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

NETWORK MODELS ECE 422 DATA COMMUNICATION & COMPUTER NETWORKS Wednesday, 12 February 2020

RFC 821 SIMPLE MAIL TRANSFER PROTOCOL Jonathan B. Postel August 1982 Information Sciences

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci - PowerPoint PPT Presentation

Replication Distilled: Hazelcast Deep Dive Ensar Basri Kahveci Hazelcast Hazelcast The leading open source Java IMDG Distributed Java collections, concurrency primitives, ... Distributed computations, messaging, ... In-Memory

HAZELCAST DISTRIBUTED DATA STRUCTURES FOR JAVA WHO AM I Fuad Malikov @fuadm Hazelcast

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

DEEP DIVE DEEP DIVE INT INTO O SEO SEO Private and Confidential. Property of Whereoware, LLC.

java.util.concurrent for distributed coordination Ensar Basri Kahveci Hazelcast @metanet

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

RTGEN (AGC) &amp; ICCP Deep Dive February 23, 2012 Shari Brown and Matt Beck CBA Project Staff

DataPower DataPower-MQ Integration MQ Integration Deep Dive Deep Dive Robin Wiley (Robin

New features in MySQL Replication Lars Thalmann, Development Manager, Replication &amp; Backup

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Galera Replication Synchronous Multi-Master Replication for InnoDB ...well, why not for any other

Replication and Migration Background, Requirements and Strawman Migration and Replication

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DRBD 9 Linux Storage Replication Lars Ellenberg LINBIT HA Solutions GmbH Vienna, Austria

Consistency, Completeness, and Classicality Adam P renosil Institute of Computer Science,

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation Eva

8. Ordinary Differential Equations Indispensable for many technical applications! 8. Ordinary

Approaches to Voting Credit for several visuals: Ariel D. Procaccia CSC2556 - Nisarg Shah 1

Verifying Strong Eventual Consistency in -CRDTs Taylor Blau University of Washington June,

(In)consistency of the combinatorial codifferential Gantumur Tsogtgerel (McGill University) Joint

NETWORK MODELS ECE 422 DATA COMMUNICATION &amp; COMPUTER NETWORKS Wednesday, 12 February 2020

RFC 821 SIMPLE MAIL TRANSFER PROTOCOL Jonathan B. Postel August 1982 Information Sciences

RTGEN (AGC) & ICCP Deep Dive February 23, 2012 Shari Brown and Matt Beck CBA Project Staff

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

NETWORK MODELS ECE 422 DATA COMMUNICATION & COMPUTER NETWORKS Wednesday, 12 February 2020