Handling Nondeterminism in Multi-Tiered Distributed Systems Joseph - PowerPoint PPT Presentation

Handling Nondeterminism in Multi-Tiered Distributed Systems Joseph Slember Priya Narasimhan Electrical & Computer Engineering Department Carnegie Mellon University Pittsburgh, PA

Carnegie Mellon Motivation Consistent state-machine replication requires determinism � � Any two deterministic replicas should reach the same final state if � They start from the same initial state and � Execute the same ordered sequence of operations � Even if the replicas run on completely different machines Challenges � � Many primary (first-hand) sources of nondeterminism � System calls, multithreading, …… � Nondeterminism can “propagate” through invocations and responses in a distributed multi-tier, multi-client application Research question � � How do we live with nondeterminism in a multi-client, multi-tier distributed system, without compromising replication? 2 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon The Problem � Multi-tier setting � End-to-end operation spanning all (server) tiers � Client � Server 1 � Server 2 � ………….. � Server n � Forward (downstream) path of invocations � Client � Server 1 � Server 2 � ………….. � Server n � Backward (upstream) path of replies � Client � Server 1 � Server 2 � ………….. � Server n � Nondeterminism in any tier can “contaminate” other tiers � Forward nondeterminism – on the invocation path � Backward nondeterminism – on the reply path � Multiple clients can aggravate this further � Clients’ operations can intermingle and execute concurrently at each tier 3 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Just How “Ugly” Can It Get? Or the Multi-Tier, Multi-Client Problem Forward nondeterministic state in each tier can diverge in state Replicas in each tier Client 1 Client 2 Replicated Replicated Replicated Tier 4 Tier 3 Tier 2 Backward nondeterministic state in each tier 4 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Objectives � Consistent server replication in the face of � Any kind of nondeterminism at a server tier � Forward propagation of nondeterminism across tiers � Backward propagation of nondeterminism across tiers � Multiple clients causing concurrency side-effects at server tiers � Failures (loss of a replica) at any of the server tiers � Efficiency in addressing only the nondeterminism that matters � Programmer intent must be respected � Retain the application-level semantics that the programmer desires � Example: Uphold any concurrency programmed into the application 5 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Our Approach � Midas: Synergistic combination of � Compile-time analysis with runtime compensation � Compile-time static analysis � (Currently) targets application-level nondeterminism � Requires access to application source-code � Flags nondeterminism that will cause replica divergence � Tracks the propagation of nondeterminism � Inserts code to perform compensation � Runtime compensation � Two possible techniques to restore consistency � Transfer of nondeterministic checkpoints � Re-execution of inserted code 6 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Taxonomy of Nondeterminism – I Pure (or first-hand) nondeterminism � Originating (primary) source of nondeterministic execution � random(), gettimeofday(), …. � Must directly touch the persistent state that matters for replication � Shared state among threads Contaminated (or second-hand) nondeterminism � Persistent state that has any dependency on pure nondeterministic state � Example for (int j = 0; j < 100; j++ ) { foo[ j ] = random(); bar[ j + 100 ] = foo[ j ]; } 7 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Taxonomy of Nondeterminism – II Superficial nondeterminism � Potentially nondeterministic execution that does not ultimately lead to divergence in persistent state across replicas � Nondeterministic functions that do not touch persistent state � System calls that appear to be nondeterministic but do not affect consistent replicated state, upon further examination � “Shared” state between threads, where each thread only operates on its individual and distinct piece of the state Superficial nondeterminism does not matter for consistent replication! Pure determinism � Persistent state that has neither any dependency on pure nondeterminism nor represents pure nondeterminism in itself for (int j = 0; j < 100; j++ ) bar[ j ] = bar[ j ] + 10; 8 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Midas’ Static-Analysis Framework – I � Front-end of a compiler � Source-code analyzer and regenerator � Control-flow and data-flow analyses to determine the extent to which nondeterminism has pervaded the application code � Custom-built for analyses of various kinds � Nondeterminism analysis – presence/type/amount of nondeterminism � Concurrency analysis – thread-level interactions and interleaving � Dependency analysis – dependencies across clients/servers � Forward nondeterminism � Backward nondeterminism 9 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Midas’ Static-Analysis Framework – II � (Currently) works for C, C++ and Java distributed applications � Converts all source-code to annotated intermediate representation � Similar to an AST (abstract syntax tree) � Intermediate representation is amenable to our analyses � “Nondeterminism dictionary” � 262 system calls � read , write , gettimeofday , etc. � 163 library functions within C/C++ standard I/O, memory and machine- dependent OS libraries 10 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Midas for Multi-Tier Architectures � Midas’ program analysis used to analyze the architecture � To extract dependencies between tiers � To extract effects on state within each tier � Architecture across tiers broken down into compensation-tier pairs � Consider each tier in conjunction with its immediate communicating tiers � Compensation of nondeterminism can then be performed in a scalable way � Architecture at each tier broken down into tier-centric slivers � Consider execution within each tier in terms of blocks (“slivers”) of code � Each sliver encapsulates a basic unit of forward/backward nondeterminism at that tier � Allows for easier compensation 11 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Tier-Centric Slivers Forward sliver � An incoming request from an upstream tier 1. Some post-request processing that might lead to execution and state 2. changes An outgoing (nested) request to some downstream tier 3. Backward sliver � Incoming replies for requests sent in the previous step 4. Some post-reply processing that might lead to additional execution and 5. state changes An outgoing reply to the upstream tier that issued the request in step 1 6. Possible nested behavior where steps 3, 4 and 5 repeat � � Yields multiple forward slivers and one backward sliver 12 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Compensation Tier-Pairs � Replicas in each tier need to know which state is actually used by the adjacent tiers with which they communicate � If the replicas of tier A make a downstream request to tier B, which replica’s request was chosen by tier B? � Consider an operation C � T1 � T2 � T3 � T4 � Possible compensation tier-pairs: (C, T1), (T1, T2), (T2, T3) and (T3, T4) � A tier can be in more than one pair, e.g., tier T2 � Group into forward and backward compensation tier-pairs � Forward compensation tier-pairs encapsulate forward slivers’ communication � Backward compensation tier-pairs encapsulate backward slivers’ communication 13 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Midas’ Compensation Techniques � Technique #1: Checkpoint-to-compensate � Track all first-hand and second-hand nondeterminism � Nondeterministic checkpoint consists of the tracked information � Technique #2: Reexecute-to-compensate � Track only first-hand nondeterminism � Execute inserted code to regenerate second-hand nondeterministic state, given the tracked (first-hand) information as input � Totally ordered, reliable multicast messages between tiers � How does compensation happen at runtime? � Tier T1 issues a request to Tier T2 � T2’s replicas track nondeterminism and piggyback it to reply to T1 � T1 sends an asynchronous callback to T2’s replicas with choice of T2 replica and that replica’s nondeterminism � T2’s replicas copy received nondeterministic information onto their state � Re-execute, if technique #2 is being used; otherwise, nothing to do 14 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Carnegie Mellon Putting It All Together 1 st 2 nd foo() { bar() { Forward State a = random(); e = random(); b = a + 5; f = a + 5; 1 st 2 nd Backward State bar(); } c = gettimeofday(); Forward Request d = c * 60; Reply } � Fwd Callback T3:R1 T2:R1 Bwd Callback Client � � T3:R2 Tier 1 T2:R2 Tier 3 Tier 2 15 Joe Slember Handling Nondeterminism in Multi-Tier Distributed Systems

Handling Nondeterminism in Multi-Tiered Distributed Systems Joseph - PowerPoint PPT Presentation

Handling Nondeterminism in Multi-Tiered Distributed Systems Joseph Slember Priya Narasimhan Electrical & Computer Engineering Department Carnegie Mellon University Pittsburgh, PA Carnegie Mellon Motivation Consistent state-machine

Nondeterministic Finite Automata Nondeterminism Subset Construction 1 Nondeterminism A

Multi-Tiered System of Support (MTSS) December 4, 2018 What is MTSS? Multi-Tiered System of

Material Handling Chapter 5 Designing material handling systems Overview of material

Type Safe Nondeterminism A Formal Semantics of Java Threads Andreas Lochbihler University of

Revisiting: Algebraic laws for nondeterminism and concurrency Matthew Hennessy Milner-Symposium,

BU CS 332 Theory of Computation Lecture 3: Reading: Nondeterminism Sipser Ch 1.1 1.2

Balancing Fairness and Efficiency in Tiered Storage Systems with Bottleneck-Aware Allocation Hui

Tiered Rates Tiered Rates Cathedral City Cathedral City City City Council Study Session

CPSC 875 CPSC 875 John D McGregor John D. McGregor Ocarina Tiered Tiered Ocarina Ocarina

Frogmoor Concept Design Options 3 Concept Design Options: 1) Traditional tiered fountain 2)

Multi-Tiered Systems of Support The MTSS Framework for Brunswick County Schools Board of

Self-Care for Parents 1 Welcome! LD East Parent and Community LD East Multi-Tiered Systems of

UNIFIED SCHOOL DISTRICT OF MARSHFIELD Multi-Tiered System of Support Update What is MTSS?

Multi-Tiered System of Supports Local Control and Accountability Plan LCAP Meeting November 16,

Multi-Tiered Systems of Support: Interventions and Assessment Strategies Karen Karp Johns

Powerpoint Presentation On Manual Handling Powerpoint Presentation On Manual Handling We proudly

Conceptual Models to Practical Implementations Dr Peter Popov Centre for Software Reliability

File and Metadata Replication in XtreemFS Bjrn Kolbeck Zuse Institute Berlin File and Metadata

OceanSt Stor or S2600 Main Slides www.huawei.com HUAWEI TECHNOLOGIES CO., LTD. Contents

Distributed Systems Principles and Paradigms Chapter 11 (version October 15, 2007 ) Maarten van

DISTRIBUTED SYSTEMS II REPLICATION CNT. II The Quorum consensus method for Replication To

High-speed Checkpointing for High Availability Brendan Cully brendan@cs.ubc.ca Department of

Tim OMahony Technical Support # Previouslyin Global Distributed Perforce Dont do

protoDUNE-SP Data Quality Monitoring Maxim Potekhin (BNL) ProtoDUNE-SP Data Exploitation