An Empirical Study of High Availability in Stream Processing Systems - PowerPoint PPT Presentation

IBM Research An Empirical Study of High Availability in Stream Processing Systems Yu Gu, Zhe Zhang , Fan Ye, Hao Yang, Minkyong Kim, Hui Lei, Zhen Liu 12/3/2009

IBM Research Stream Processing Model software operators ∆ ∞ (PEs) ∩ ∫ Ω ∑ … … subjob … …  Unexpected machine failures … – Loss of data and internal state deployment machines – Disruption to normal processing  Challenge: how to preserve data / state and minimize disruption? 2 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Existing approaches: vs. Passive Standby Active Standby Passive Standby Active Standby ∆ ∆ ∆ ∩ ∩ ∩ ∑ ∑ ∑ ∆ ∆ ∆ ∩ ∩ ∩ ∑ ∑ ∑ 3 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Basic Tradeoff between AS and PS  Active Standby – Overhead: double processing load; at least double message load – Recovery delay: almost zero  Passive Standby – Overhead: checkpoint messages – Recovery delay: failure detection + deploy new job + recover state 4 12/3/2009

IBM Research Motivation  Tradeoffs of AS & PS not fully understood – Only systematic comparison: [Hwang ICDE05] • Used a variant of PS with high overhead • Evaluated in simulations rather than real systems  Our contributions – A sweeping checkpointing method • Reducing checkpoint overhead by one order of magnitude • Proof of consistency – A real prototype distributed stream processing system – Comprehensive and empirical evaluation of AS and PS 5 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Outline  Background and Motivation  Design and Implementation – Sweeping Checkpointing – System Architecture  Performance Evaluation  Related Work  Conclusions 6 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Overview of Sweeping Checkpointing  What to include recoverable from – Input queues upstream output queues – Internal states dominating ckpt size – Output queues with high data rates  When to trim  Checkpointing Multiple PEs  Proof of consistency 7 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 1 1 1 1 8 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 1 1 1 9 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D upstream downstream node U node D √ ≡ 5 4 3 2 2 1 2 1 10 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D checkpoint upstream downstream node U node D ≡ √ ≡ 5 4 3 2 2 1 2 2 1 1 11 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research When to Trim In U’s output queue, only removing those packets that have been processed and checkpointed by D ≡ 2 1 checkpoint upstream downstream node U node D √ ≡ 5 4 3 2 2 1 2 1 1 and 2 have been processed and checkpointed 12 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Checkpointing Multiple PEs – Synchronous  Freeze all PEs, then snapshot of the checkpoint all state, whole sub job then resume all PEs checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ ≡ √ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 13 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Checkpointing Multiple PEs – Individual  Freeze / checkpoint / resume each PE individually checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 14 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Checkpointing Multiple PEs – Sweeping  Checkpoint a PE immediately after receipt of acknowledgement and output queue trimming checkpoint manager CM CM ∆ ∆ ∩ ∩ ≡ ≡ √ √ ∑ ∑ sub job 1 sub job 2 Site 2 Site 1 15 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Sketch of Proof for Consistency  Scenario: single node failure (N i ) only trimmed to reflect – Actions for recovery latest checkpoint of N i • Recovering operator state • Recovering input queue from output queues of upstream • Reprocessing affected elements  Scenario: multiple concurrent node failures – Actions for recovery • Finding and recovering most upstream failed node • Recovering other nodes recursively 16 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research System Architecture  Remote Execution Coordinator – manage HA protection for distributed jobs  Node Job Management – manage job deployment  Checkpoint Manager FM REC – manage checkpoint tasks according to assigned checkpoint mechanism CM  Failover Manager JMN monitor other nodes and initiate recovery –  Jobs and Processing Nodes – take data from upstream, execute processing tasks, and send results to downstream Job ∩  Features: – A distributed job consists of multiple subjobs, each of which can choose its own specific HA mechanism (AS, PS) ∆ ∑ – The system coordinates the deployment and protection of subjobs among all machines Job 17 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Outline  Background and Motivation  Design and Implementation  Performance Evaluation – Experiment Setup – Overhead and Delay Results  Related Work  Conclusions 18 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Experiment Setup  Testbed: a cluster environment – Dual Xeon 3.06GHz CPUs, 800MHz, 512KB L2 caches, 4GB memory, 80GB disk – 1Gbps LAN – A distributed job containing 4 subjobs, each having 2 processing nodes running on one machine  Metrics – Recovery delay – Message overhead 19 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Avg. Checkpoint Queue Size Comparison 3000 elements/second Sweeping reduces checkpoint size by about 96% 20 20 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

IBM Research Checkpoint Time Comparison checkpoint interval = 500 ms Sweeping reduces checkpoint time by about 75% 21 21 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

IBM Research Message Overhead Comparison AS-AS incurs almost 4 times message overhead vs. PS 22 22 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

IBM Research Recovery Delay Decomposition Detection delay becomes dominant with large heartbeat interval 23 23 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

IBM Research Outline  Background and Motivation  Design and Implementation  Performance Evaluation  Related Work  Conclusions 24 An Empirical Study of High Availability in DSPS 12/3/2009

IBM Research Related Work  Borealis 1. “ Fault tolerance in the Borealis distributed stream processing system ” (SIGMOD ‘05) A variant of AS  Achieving flexible trade-off between availability and consistency by introducing tentative data  concept 2. “ Fast and reliable stream processing over wide area networks ” (ICDE ’07) A variant of AS  Most expensive variant; upstream sending to all downstream replicas  No switch required when failure occurs  3. “ A cooperative, self-configuring high-availability solution for stream processing ” (ICDE ‘07) A variant of PS  Novel checkpoint scheduling and backup assignment  Balances recovery load over multiple servers  4. “ Borealis-R: a replication-transparent stream processing system for wide-area monitoring applications ” (SIGMOD ‘08) A variant of AS  Same technique as in [2]  Novel mechanism to allow replicas execute without coordination but still produce consistent  results 25 25 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

IBM Research Related Work  System S 5. “Towards automatic fault recovery in System-S” (ICAC ‘07) Checkpoint state  Recovery of JMN, not jobs  6. “Failure recovery in cooperative data streaming analysis” (ARES ’07) How to select a backup site on demand, not recovery technique  7. “Online failure forecast for fault-tolerant data stream processing” (ICDE ‘08) Prediction of potential failures, a monitoring technique  Leverages varies system metrics (system productivity, available CPU, etc.) to predict failures  before they occur  Comparison of AS and PS 8. “High-availability algorithms for distributed stream processing” (ICDE ‘05) Valuable summaries of basic tradeoffs  PS variant has large overhead  Evaluation mainly based on simulations  26 26 An Empirical Study of High Availability in DSPS 12/3/2009 12/3/2009

An Empirical Study of High Availability in Stream Processing Systems - PowerPoint PPT Presentation

IBM Research An Empirical Study of High Availability in Stream Processing Systems Yu Gu, Zhe Zhang , Fan Ye, Hao Yang, Minkyong Kim, Hui Lei, Zhen Liu 12/3/2009 IBM Research Stream Processing Model software operators (PEs)

Drupal High Availability High Performance Samstag, 3. November 12 Drupal High Availability

for High Availability Martin Thompson - @mjpt777 What Is High Availability ?

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

Chapter 4: Implementing High Availability and Redundancy in a Campus Network CCNP-RS SWITCH

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Availability Knob Flexible User-Defined Availability in the Cloud Mohammad Shahrad and David

Extending CSP with tests for availability Gavin Lowe Extending CSP with tests for availability

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

Contents Introduction Basic Model High Availability, Scalable Storage, Availability

High Availability with the openais project Prepared by: Steven Dake October 2005 Agenda

High Availability with the openais project Prepared by: Steven Dake 7/12/05 Agenda Service

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Fresh water stream ecosystem Gr ov p 2 The description of stream lives Quadrat 1: Hong Kong Newt

UPLOAD VIDEOS TO MICROSOFT STREAM VIA ACCESSUH To upload a video on Microsoft Stream, go to

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

CS162: Introduction to Computer Science II Streams 1 Streams A stream is a flow of data

Time-Dependent Density Functional Theory and Real-Time Dynamics of Fermi Superfluids Aurel

B U I L D I N G & C O O L I N G S I N G A P O R E I N A N E R A O F C L I M AT E C H A

Accelerator Modeling Through High Performance Computing Z. Li NERSC,LBNL Advanced Computations

Together in the Sandbox: Board and Staff Relationships Welcome! The webinar will begin at 10:00

iRODS Tutorial II. Data Grid Administration iRODS Tutorial Preview I. iRODS

How to Rank Your Website on Page #1 of Google SEARCH ENGINE OPTIMISATION (SEO) Equinet Academy

Some Design Considerations (and a Few Matrix Implementation Details) in PETSc, the Portable,

Goal: Let MPI survive partial system failure 2 Managed by UT-Battelle for the Department of

An Empirical Study of High Availability in Stream Processing Systems - PowerPoint PPT Presentation

IBM Research An Empirical Study of High Availability in Stream Processing Systems Yu Gu, Zhe Zhang , Fan Ye, Hao Yang, Minkyong Kim, Hui Lei, Zhen Liu 12/3/2009 IBM Research Stream Processing Model software operators (PEs)

Drupal High Availability High Performance Samstag, 3. November 12 Drupal High Availability

for High Availability Martin Thompson - @mjpt777 What Is High Availability ?

? sync ref chosen as sync source by Listener Stream B: Presentation Stream C: timestamps

Chapter 4: Implementing High Availability and Redundancy in a Campus Network CCNP-RS SWITCH

Stream Ciphers Stream Ciphers 1 Stream Ciphers Generalization of one-time pad Trade

Availability Knob Flexible User-Defined Availability in the Cloud Mohammad Shahrad and David

Extending CSP with tests for availability Gavin Lowe Extending CSP with tests for availability

Phase III Stream Assessment Study: Potential Stream Restoration Projects Strawberry Run and

Contents Introduction Basic Model High Availability, Scalable Storage, Availability

High Availability with the openais project Prepared by: Steven Dake October 2005 Agenda

High Availability with the openais project Prepared by: Steven Dake 7/12/05 Agenda Service

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Fresh water stream ecosystem Gr ov p 2 The description of stream lives Quadrat 1: Hong Kong Newt

UPLOAD VIDEOS TO MICROSOFT STREAM VIA ACCESSUH To upload a video on Microsoft Stream, go to

Assessing stream and riparian conditions Stream Habitat Assessment Conducted yearly

CS162: Introduction to Computer Science II Streams 1 Streams A stream is a flow of data

Time-Dependent Density Functional Theory and Real-Time Dynamics of Fermi Superfluids Aurel

B U I L D I N G &amp; C O O L I N G S I N G A P O R E I N A N E R A O F C L I M AT E C H A

Accelerator Modeling Through High Performance Computing Z. Li NERSC,LBNL Advanced Computations

Together in the Sandbox: Board and Staff Relationships Welcome! The webinar will begin at 10:00

iRODS Tutorial II. Data Grid Administration iRODS Tutorial Preview I. iRODS

How to Rank Your Website on Page #1 of Google SEARCH ENGINE OPTIMISATION (SEO) Equinet Academy

Some Design Considerations (and a Few Matrix Implementation Details) in PETSc, the Portable,

Goal: Let MPI survive partial system failure 2 Managed by UT-Battelle for the Department of

B U I L D I N G & C O O L I N G S I N G A P O R E I N A N E R A O F C L I M AT E C H A