Distributed Systems (ICE 601) Replication & Consistency - Part 1 - PDF document

Distributed Systems (ICE 601) Replication & Consistency - Part 1 Dongman Lee ICU Class Overview • Introduction • Replication Model • Request Ordering • Consistency Models • Consistency Protocols • Case study – Lazy replication – ISIS – Transactions with Replicated Data Distributed Systems - Replication&Consistency (Part1)

Why Replication? • Purpose – increase availability, dependability and/or performance without knowledge of replica visibility • Replication transparency – hiding the replication of state in a system � active vs. primary/stand-by replicas � generic functions: active and passive replication mechanisms Logical shared resource A B C D Distributed Systems - Replication&Consistency (Part1) Replication Model • Replication model spectrum – consistency � totally synchronous model � complete synchronization among replicas � asynchronous model � asynchronous update of replicas - that is, allow temporal inconsistency among replicas � most replication models are somewhere between these two models – purpose � performance improvement � reduction of delay by caching or replicating a server near clients � availability � make the service accessible (close to 100%) in the presence of process and network failures (partition and disconnection) � fault tolerance � guarantee strictly correct behavior despite of failures (byzantine and crash) Distributed Systems - Replication&Consistency (Part1)

Replication Model (cont.) • Basic architectural model for replicated data management Requests and replies RM RM C FE Clients Front ends Service Replica C FE RM managers Distributed Systems - Replication&Consistency (Part1) Replication System Model [Weismann] • Replication protocol model – Request phase � active replication � passive replication – Server coordination � message ordering: FIFO, causal, total – Execution – Agreement coordination � necessary in database while ordering guarantee is enough for distributed systems – Client response � synchronous vs. lazy or asynchronous Phase 1: Phase 2: Phase 3: Phase 4: Phase 5: Client Server Execution Agreement Client Coordination contact coordination response client client replica 1 update replica 2 replica 3 update Distributed Systems - Replication&Consistency (Part1)

Replication System Model (cont.) • Replication model – Active replication � deterministic execution � request sent to replicas using atomic totally ordered multicast � no need of agreement – Passive replication � non-deterministic execution � view synchronization � no need of server coordination – Semi-active replication � non-deterministic execution � request sent to replicas using atomic totally ordered multicast � leader informs followers of its choice using view synchronization – Semi-passive replication � same as passive without view synchronization � allow for aggressive time-outs values and suspecting crashed processes without incurring too high cost for incorrect failure suspicions Distributed Systems - Replication&Consistency (Part1) Group System Model • Definition – a set of one or more objects, joined through a common interface, acting as a single unit for purposes of naming and function • Group types – replicate � replicated members for highly availability and/or reliability � primary/stand-by � modular redundant – partition � members executing a common job in a divided manner � e.g. highly parallel array processing – aggregate � non-replicated members sharing the provision of the service defined by group’s interface � e.g. group conferencing Distributed Systems - Replication&Consistency (Part1)

Group Communication Model • Group communication = membership service + multicast with ordering support Group address expansion Leave Group send Object Multicast Group membership group Fail agent communication management object Join Process group Object group interaction model Distributed Systems - Replication&Consistency (Part1) Membership Service • Membership service – interface for group membership changes – failure detection – membership change notification – group address expansion • View delivery – view: a list of the currently active and connected members in a group – basic requirements for view delivery (view notification) � order � If a process p delivers view v ( g ) and the v ’( g ), then no other process q = p delivers v ’( g ) before v( g ) � integrity � If process p delivers view v ( g ) then p v ( g ) � Non-triviality � If process q joins a group and is or becomes indefinitely reachable from process p = q , then eventually q is always in the views that p delivers Distributed Systems - Replication&Consistency (Part1)

Membership Service (cont.) • View-synchronous group communication – Guarantees provided by view-synchronous group communication � agreement � correct processes deliver the same set of messages in any given view � integrity � if a process p delivers message m , then it will not deliver m again � validity � correct processes always deliver the messages that they send Distributed Systems - Replication&Consistency (Part1) Membership Service (cont.) • View-synchronous group communication (cont.) a (allowed). b (allowed). p crashes p crashes p p q q r r view (q, r) view (q, r) view (p, q, r) view (p, q, r) c (disallowed). d (disallowed). p crashes p crashes p p q q r r view (q, r) view (q, r) view (p, q, r) view (p, q, r) Distributed Systems - Replication&Consistency (Part1)

Request Ordering • Why ordering is concerned? – concurrent execution of update requests at replicas may result in inconsistency among replicated data � serial equivalence of update requests is required � expense of ordering should also be considered • Ordering requirements – total ordering � requests are processed in the same order at all replicas – causal ordering � causally related requests are only ordered at all replicas – sync ordering � requests are ordered in sync before or after a certain request at all replicas Distributed Systems - Replication&Consistency (Part1) Request Ordering (cont.) • Request handling at replicas – every request is held-back until ordering constraints can be met – request is defined to be stable at a replica once no request from a client and bearing a lower unique identifier can be subsequently delivered to replica; that is, all prior requests have been processed • Request ordering implementation – group communication � ISIS – exchanging gossip messages among replicas � lazy replication Distributed Systems - Replication&Consistency (Part1)

Request Ordering (cont.) • Properties for request ordering – safety � no message will be delivered out of order from hold-back queue to processing queue � once a message has been in the processing queue, no prior request should not be in there – liveness � no message should wait indefinitely in hold-back queue Request processing Hold-back Processing queue queue Request ordering Incoming request Distributed Systems - Replication&Consistency (Part1) Request Ordering (cont.) • Total ordering implementation – requires a mechanism to uniquely sequence each request, which enables sequential ordering among messages – unique id generation � sequencer approach � request id is generated by a designated process, sequencer � every request is sent to the sequencer which assigns a unique id being incremented monotonically and forwards the request to replicas � sequencer may become performance bottleneck and point of failure � data update protocol approach � token holder sends a request with a temporary id to all replicas � each replica (site i) replies with a new id of max(temp id, id) + 1 + i/N; token holder selects largest id among proposed id from all replicas and uses it as the agreed id � token holder notifies all replicas of the final id; replica readjusts the message’s position at hold-back queue Distributed Systems - Replication&Consistency (Part1)

Request Ordering (cont.) • Causal ordering implementation – requires a mechanism to enable causally related requests to be ordered – vector timestamp approach � all replicas p i initializes VT i (vector time) to zeros � when p i generates a new event, it increments VT i [I] by 1; it attaches the value vt = VT i on outgoing messages � when p j handles a request with timestamp vt , it updates it vector clock such as Vt j = merge (Vt j , vt) Distributed Systems - Replication&Consistency (Part1)

Distributed Systems (ICE 601) Replication & Consistency - Part 1 - PDF document

Distributed Systems (ICE 601) Replication & Consistency - Part 1 Dongman Lee ICU Class Overview Introduction Replication Model Request Ordering Consistency Models Consistency Protocols Case study Lazy

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Ice streams, shear margins, and glacier stick-slip motion v (m/yr) 4000 1000 300 75 20 5 a

Water, Steam, and Ice the temperatures of the ice and the water compare? The ice is colder than

Ice Streams und Isbr 1 Ice Streams: Causes for Rapid Ice Flow Fast ice flow is controlled by

Distributed Systems (ICE 601) Concurrency Control - Part2 Dongman Lee ICU Class Overview

Distributed Systems (ICE 601) Fault Tolerance Dongman Lee ICU Class Overview Introduction

world of In Inter Ic Ice-Pump JAN 2016 Presentation of Inter Ice-Pump 1 Inter Ice-Pump ApS //

ICE Analysis Training Program Module 2: How to Establish the ICE Analysis Geographical Boundary

Ice and Stride [ a ] Common User Complaints Common User Complaints Difficult to Ice Specific

TRICKLE ICE TRICKLE ICE draft-ietf-mmusic-trickle-ice Emil Ivov, Eric Rescorla, Justin Uberti

Ice sheets with rapid basal sliding Ian Hewitt, University of Oxford Antarctic ice sheet Ice

Distributed Systems (ICE 601) Transactions & Concurrency Control - Part1 Dongman Lee ICU

Distributed Systems (ICE 601) Replication & Consistency - Part 3 Dongman Lee ICU Class

Distributed Systems (ICE 601) Replication & Consistency - Part 2 Dongman Lee ICU Class

ICE Analysis Training Program Module 6: Presentation of ICE Analysis in the Environmental

Greg Miller EVP Market Development & Sales gmiller@ice-energy.com, 970-227-9406 Discover the

Verteilte Systeme (Distributed Systems) Karl M. Gschka Karl.Goeschka@tuwien.ac.at

Solving Atomic Broadcast Eden : a Consensus Based Group Communication System p.1/ ?? Solving

From Distributed Logs to Database Replication Dr. Samuel Benz How to achieve scalability, fault

Computer Networks M Group issues and policies Antonio Corradi Academic year 2015/2016 Groups

Important Lessons Lamport & vector clocks both give a logical timestamps Total

Design and Validation of Cloud Storage Systems using Maude Peter Csaba Olveczky University

G-DUR A Middleware for Assembling, Analyzing, and Improving Transactional Protocols Masoud Saeida

Failures and Consensus Failures and Consensus Coordination Coordination If the solution to

Distributed Systems (ICE 601) Replication & Consistency - Part 1 - PDF document

Distributed Systems (ICE 601) Replication & Consistency - Part 1 Dongman Lee ICU Class Overview Introduction Replication Model Request Ordering Consistency Models Consistency Protocols Case study Lazy

Distributed Systems (ICE 601) Distributed Transactions Dongman Lee ICU Class Overview

Ice streams, shear margins, and glacier stick-slip motion v (m/yr) 4000 1000 300 75 20 5 a

Water, Steam, and Ice the temperatures of the ice and the water compare? The ice is colder than

Ice Streams und Isbr 1 Ice Streams: Causes for Rapid Ice Flow Fast ice flow is controlled by

Distributed Systems (ICE 601) Concurrency Control - Part2 Dongman Lee ICU Class Overview

Distributed Systems (ICE 601) Fault Tolerance Dongman Lee ICU Class Overview Introduction

world of In Inter Ic Ice-Pump JAN 2016 Presentation of Inter Ice-Pump 1 Inter Ice-Pump ApS //

ICE Analysis Training Program Module 2: How to Establish the ICE Analysis Geographical Boundary

Ice and Stride [ a ] Common User Complaints Common User Complaints Difficult to Ice Specific

TRICKLE ICE TRICKLE ICE draft-ietf-mmusic-trickle-ice Emil Ivov, Eric Rescorla, Justin Uberti

Ice sheets with rapid basal sliding Ian Hewitt, University of Oxford Antarctic ice sheet Ice

Distributed Systems (ICE 601) Transactions &amp; Concurrency Control - Part1 Dongman Lee ICU

Distributed Systems (ICE 601) Replication &amp; Consistency - Part 3 Dongman Lee ICU Class

Distributed Systems (ICE 601) Replication &amp; Consistency - Part 2 Dongman Lee ICU Class

ICE Analysis Training Program Module 6: Presentation of ICE Analysis in the Environmental

Greg Miller EVP Market Development &amp; Sales gmiller@ice-energy.com, 970-227-9406 Discover the

Verteilte Systeme (Distributed Systems) Karl M. Gschka Karl.Goeschka@tuwien.ac.at

Solving Atomic Broadcast Eden : a Consensus Based Group Communication System p.1/ ?? Solving

From Distributed Logs to Database Replication Dr. Samuel Benz How to achieve scalability, fault

Computer Networks M Group issues and policies Antonio Corradi Academic year 2015/2016 Groups

Important Lessons Lamport &amp; vector clocks both give a logical timestamps Total

Design and Validation of Cloud Storage Systems using Maude Peter Csaba Olveczky University

G-DUR A Middleware for Assembling, Analyzing, and Improving Transactional Protocols Masoud Saeida

Failures and Consensus Failures and Consensus Coordination Coordination If the solution to

Distributed Systems (ICE 601) Transactions & Concurrency Control - Part1 Dongman Lee ICU

Distributed Systems (ICE 601) Replication & Consistency - Part 3 Dongman Lee ICU Class

Distributed Systems (ICE 601) Replication & Consistency - Part 2 Dongman Lee ICU Class

Greg Miller EVP Market Development & Sales gmiller@ice-energy.com, 970-227-9406 Discover the

Important Lessons Lamport & vector clocks both give a logical timestamps Total