Lightweight Causal Cluster Consistency Boris Koldehofe, Anders - PowerPoint PPT Presentation

Lightweight Causal Cluster Consistency Boris Koldehofe, Anders Gidenstam, Marina Papatriantafilou, and Philippas Tsigas 1

Outline � Introduction � Collaborative environments � Problem definition � Causal Cluster Consistency � Protocol implementing Causal Cluster Consistency � Framework � Cluster Management � Dissemination and Causal delivery � Recovery � Results � Conclusion and Future Work 2

Collaborative Environments � Possible applications with physically distributed “users”: users � Conferencing, CVEs � Simulation, Training, Entertainment objects � Administration of distributed (e.g. telecom, transport) systems � Decentralised solution (self-)modify World � Avoid single point of failure mobile � Share the load evenly create/read/modify/delete � Scalability join � Trade-off response � Overhead vs. Consistency leave 3

Defining the problem � Goal: Support large Collaborative Environments � Provide Consistency (order of updates matter) � Scalable communication media � Focus: Group communication � Propagate events (updates) to all interested processes � Ordered event delivery � Causal order � Opportunities � Delivery with high probability is enough � Limited per-user domain of interest � Nobody is interested in changing everything at once � Events have lifetimes/deadlines � Often more observers than updaters 4

Example: Collaborative Environments � World � Consists of Clusters � Consists of Objects � Clusters represent interest … � Only few updaters per cluster � Forming the Core Core Cluster 5

Causal Cluster Consistency � n constant known by all processes � Given a set of clusters C 1 , …, C m � Cluster corresponding to region of interest � Processes can join and leave any cluster C i � A process in C i ⇒ receives events disseminated in C i w.h.p. � events can be observed in optimistic causal order � A dynamic non-empty subset forms the core of C i � at most n processes inside a core � Only those processes create new events 6

Outline � Introduction � Collaborative environments � Problem definition � Causal Cluster Consistency � Protocol implementing Causal Cluster Consistency � Framework � Cluster Management � Dissemination and Causal delivery � Recovery � Results � Conclusion and Future Work 7

Overview: A Layered approach Application � Point-2-point communication layer Ordered, predictably reliable Join/ � Dissemination layer disseminate/receive leave � Gossip protocol Cluster Ordered Delivery � Reader membership Manager � Causal layer Cluster Consistency � Cluster Manager disseminate/receive � Controls concurrent Dissemination: recover updates PrCast � Causal delivery send/receive � Recovery Network transport service 8

Cluster Management � Each cluster corresponds to a process group � Interested processes join � Readers – everyone Cluster � Join the process group � Updaters � At most n Core at a time � Core of the cluster 9

Managing the Core � Assign unique identity for each process � Ids ∈ {0, …, n-1} � Two processes never own the same id Core � Even in the occurrence of failures � Stop failures � Communication failures � Reclaim tickets 10

Cluster Management Algorithm � Inspired by DHT Successor 0 � Ids form a cycle (max n) n-1 1 � Each process manage the entries immediately p 1 before it. � Contact any coordinator to join 2 n-2 � Notify successor if given an entry � Notify all about the new coord. p 2 3 n-3 � Failure detection p 4 � Heartbeats � Send to 2k + 1 closest successors � Receive from 2k + 1 closest predecessors p 3 � If < k + 1 received, stop 11

PrCast � Gossip based protocol � Epidemic style dissemination � Good scalability and fault-tolerance � no ordering of events provided � Use dissemination scheme providing delivery guarantee w.h.p. � W.h.p. = with probability O(1-n -k ), k>1. � Only a small number of processes is not receiving an event ⇒ only few messages require recovery 12

Causally ordered delivery � Vector timestamps Processes � For each event in cluster 1 � #simultaneous updaters limited => 2 bounded number of vector entries in Timestamp vector 3 timestamps 4 � ID of the cluster manager 5 corresponds to entry in the vector clock 6 � Can detect missing dependencies 7 � Deliver in causal order � Skip events not recovered in time 13

Recovery � Some events may not be delivered by PrCast � Can detect these events with the help of the vector timestamp � Queue of delayed events � Queue of missing event ids � A delayed event is delivered latest after a lifetime � Exp(time to disseminate + time to recover) � Recovery of missing events if a delayed event has a lifetime ≥ Exp(time to disseminate) 14

Recovery Schemes � Recover from source + Only small buffer size needed � Sender buffers only own events + Only one message per recovery – Source may fail before recovery starts – Too many processes may contact the source � Alternatively recover from k peers (chosen at random) � Avoids problems above � Needs to buffer some of the received events � Can evaluate buffer size and k suitable for high probability recovery 15

Experimental Evaluation � Evaluate � Scalability � effect of limited number of updaters � Reliability � Measure effect of recovery schemes � ‘‘Real network‘‘ experiment � Used self-implemented group communication framework � Test application performing on up to 125 workstations � Configured to provide maximum throughput and performing stable 16

Experiments: Scalability 17

Experiments: Scalability 18

Experiments: Reliability 19

Overhead 20

Results � Can combine predictable reliable protocols and causal delivery � The number of concurrent updaters � Important for the performance � Scalable solutions require a bound on the number of updaters � Recovery � Increases delivery rate for many concurrent events � Recovery fails if � Only few processes received the event � Recovered event arrives late 21

Conclusions and Future Work � Causal Cluster Consistency � Suitable preserving optimistic causal order relations � Interesting for Collaborative Environments � Good predictable delivery guarantees � Scalability � requires a natural clustering of objects � Recovery � Can increase delivery rate � Good match with protocols providing delivery w.h.p. � Source recovery (R1) vs. decentralised recovery (R4) � Here no real difference � For larger systems R4 expected to perform better � Future work � Recovery for larger systems � Different ordering and time stamping schemes (e.g. plausible clocks) � Evaluate effect on dynamic systems 22

Recovery Success 23

Lightweight Causal Cluster Consistency Boris Koldehofe, Anders - PowerPoint PPT Presentation

Lightweight Causal Cluster Consistency Boris Koldehofe, Anders Gidenstam, Marina Papatriantafilou, and Philippas Tsigas 1 Outline Introduction Collaborative environments Problem definition Causal Cluster Consistency Protocol

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Consistency - Chapter 5 Introduce several notions of Local Consistency: arc consistency,

Constraint Programming - An overview Node-consistency Arc-consistency Path-consistency

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

1 Applications ? Trading Consistency for Performance Applications ? Trading Consistency for

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

I I Can ant Belie elieve e It Its No Not C Causal ! ! Scalable Causal Consistency

A Lightweight Approach Wolfgang Jeltsch to Start Time Consistency in Haskell Introduction

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

Causal Consistency CS 240: Computing Systems and Concurrency Lecture 16 Marco Canini Credits:

PhxSQL: A High-Availability & Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

Inverse potential problems in divergence form: some uniqueness, separation and recovery issues L.

Investor Meetings June 2020 Contact Information and Safe Harbor Statement Investor Relations

002 - Motivating Examples EPIB 607 - FALL 2020 Sahir Rai Bhatnagar Department of Epidemiology,

Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning for Natural

Demetres Briassoulis AUA-DNRAE Virtual Final STAR-ProBio Workshop, 28/4/2020 Agricultural

R I C K M U N C R I E F , C H A I R M A N & C E O A U G U S T 6 , 2 0 1 9 WPX

Workers and Employment Relations in RECESSION and Recovery 1. UK in Global Perspective 2.

JPEG Anti-forensics Using Non-parametric DCT Quantization Noise Estimation and Natural Image

Lightweight Causal Cluster Consistency Boris Koldehofe, Anders - PowerPoint PPT Presentation

Lightweight Causal Cluster Consistency Boris Koldehofe, Anders Gidenstam, Marina Papatriantafilou, and Philippas Tsigas 1 Outline Introduction Collaborative environments Problem definition Causal Cluster Consistency Protocol

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Consistency - Chapter 5 Introduce several notions of Local Consistency: arc consistency,

Constraint Programming - An overview Node-consistency Arc-consistency Path-consistency

Web Cache Consistency Web Cache Consistency Web Cache Consistency Web Cache Consistency

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

1 Applications ? Trading Consistency for Performance Applications ? Trading Consistency for

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

I I Can ant Belie elieve e It Its No Not C Causal ! ! Scalable Causal Consistency

A Lightweight Approach Wolfgang Jeltsch to Start Time Consistency in Haskell Introduction

Cluster Architectures Overview Cluster Computing The Problem The Solution The Anatomy

Causal Consistency CS 240: Computing Systems and Concurrency Lecture 16 Marco Canini Credits:

PhxSQL: A High-Availability &amp; Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

Inverse potential problems in divergence form: some uniqueness, separation and recovery issues L.

Investor Meetings June 2020 Contact Information and Safe Harbor Statement Investor Relations

002 - Motivating Examples EPIB 607 - FALL 2020 Sahir Rai Bhatnagar Department of Epidemiology,

Data-Driven and Ontological Analysis of FrameNet for Natural Language Reasoning for Natural

Demetres Briassoulis AUA-DNRAE Virtual Final STAR-ProBio Workshop, 28/4/2020 Agricultural

R I C K M U N C R I E F , C H A I R M A N &amp; C E O A U G U S T 6 , 2 0 1 9 WPX

Workers and Employment Relations in RECESSION and Recovery 1. UK in Global Perspective 2.

JPEG Anti-forensics Using Non-parametric DCT Quantization Noise Estimation and Natural Image

PhxSQL: A High-Availability & Strong-Consistency MySQL Cluster Ming CHEN@WeChat Why PhxSQL

R I C K M U N C R I E F , C H A I R M A N & C E O A U G U S T 6 , 2 0 1 9 WPX