State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang - PowerPoint PPT Presentation

State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang Huang Chih-shang Chen

Outline 1.Introduction 2.BFT-SMART Design 3.Implementation 4.Alternative Configuration 5.Evaluation 6.Lessons Learned 7.Conclusion

Introduction Reason: 1.PBFT’s architecture does not fully exploit modern hardware 2.UpRight exhibits a performance significantly lower than other systems. Characteristic: 1.Java-based 2.high-performance and correctness 3.support reconfigurations of the replica

Design principles ● Tunable fault model ○ non-malicious Byzantine-faults ○ malicious Byzantine-faults ○ Simplified SMR protocol ● Simplicity ○ emphasis on protocol correctness ○ avoid optimizations that could bring extra complexity

Design principles ● Modularity ○ uses a well defined consensus primitive in its core ○ easy to implement and reason about

Design principles ● Simple and extensible application programming interface ○ Provide simple API such us invoke(command) and execute(command) ○ Implemented using a set of alternative calls, callbacks or plug-ins (if API not support some methods) ● Multi-core Awareness ○ Take advantage of multi-core architecture of servers

System model Configuration: 1.n ≥ 3f+1 to tolerate Byzantine faults 2.n ≥ 2f+1 to tolerate Crash faults 3.reconfigure replicas at runtime Links: 1.message authentication code(MAC) over TCP/IP 2.Symmetric keys for replica-replica channel 3.Optional signed request for client-replica channels.

Core protocol ● Total order multicast ○ During normal execution, clients send their requests to all replicas and wait for their replies ○ Total order is achieved through consensus protocol

Core protocol (con’t) State Transfer ● to log batches of operations in a single disk ● take snapshots at different points of the execution in different replicas ● perform state transfer in a collaborative way

Core protocol (con’t) Reconfiguration: ● Initiated by View Manager client ● Must be signed with a special private key ● View Manager sends a special message to the replica that is waiting to be added or removed from the system informing the replica.

Implementation 1.Staged message processing 2.Bounded queue Netty thread ● Check unordered or ordered request ● Verify client’s request Proposer thread ● Assemble a batch of requests ● Transmitting the PROPOSE message Sender thread ● Serialize message and produce a MAC ● Send it using TCP sockets

Implementation Receiver thread ● Deserialize message ● Put it on the inqueue Message processor thread ● Fetch messages from the inqueue ● Process message if they belong to current consensus stage ● Put finished decided batch on decide queue Delivery thread ● Remove request on client queue ● Invoke service replica to generate replies

Implementation Reply thread ● Fetch request from reply queue ● Send it back to client Request timer thread ● Activated periodically to verify If some requests remained more Than a predefined time.

Alternative Configurations 1. Crash Fault Tolerance (CFT) Every node that do not give a reply is assumed to be in a crashed state. Tolerance: f < n/2 (simple minority) Sol => bypass WRITE step 2. Malicious Byzantine Faults Malicious leader to lasuch undetectable attacks. Sol => periodic leader changes

Evaluation 1. Raw throughput and Latency 2. Performance in different systems 3. The performance of a BFT-SMART-based system when withstanding faults and reconfiguration.

Raw Throughput and Latency

Raw Throughput and Latency Result 1: CFT setup is always better than BFT Result 2: Payload size increases -> BFT-SMART performance decreases

Raw Throughput and Latency

Performance in Different System

Performance of BFT-SMART-based System Replica 0~3 Replica 1 becomes new leader Replica 3 exits Replica 0~4 Replica 0 recovers

Lessons Learned 1. BFT in Java 2. How To Test BFT 3. Dealing with Heavy Load 4. Maintenance & Robustness

Lessons Learned 1. BFT in Java a. Easy to use b. Feasible implementation of secure software Notice: Need to be used carefully! 2. How To Test BFT a. Test on JUnit b. Identify the malicious behaviors => carefully analyze c. How to inject code for malicious behaviors on replicas => AOP or simple commented code

Lessons Learned 3. Dealing with Heavy Load a. Late f replicas in message processing (cuz only needs n-f to progress) b. non-Ordered requirements c. Thrashing: dropping down throughput under heavy load 4. Maintenance & Robustness a. Complex but completed

Core protocol ● Total order multicast ○ During normal execution, clients send their requests to all replicas and wait for their replies ○ Total order is achieved through consensus protocol

Lessons Learned 3. Dealing with Heavy Load a. Late f replicas in message processing (cuz only needs n-f to progress) b. non-Ordered requirements c. Thrashing: dropping down throughput under heavy load 4. Maintenance & Robustness a. Complex but completed

Conclusions 1. This paper mainly report the process and results in building BFT-SMART library. 2. Describing how to implement the protocol in a safe and efficient way.

Thanks for Listening

State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang - PowerPoint PPT Presentation

State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang Huang Chih-shang Chen Outline 1.Introduction 2.BFT-SMART Design 3.Implementation 4.Alternative Configuration 5.Evaluation 6.Lessons Learned 7.Conclusion Introduction

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Reasoning About Replication: State Machine Approach & Chain Replication Partial slides

Galera Replication Synchronous Multi-Master Replication for InnoDB ...well, why not for any other

Replication and Migration Background, Requirements and Strawman Migration and Replication

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DRBD 9 Linux Storage Replication Lars Ellenberg LINBIT HA Solutions GmbH Vienna, Austria

Scaling State Machine Replication Fernando Pedone University of Lugano (USI) Switzerland State

Time, Clocks, and State Machine Replication Dan Ports, CSEP 552 Todays question How

Fault Tolerance via the State Machine Replication Approach Favian Contreras Implementing

in Tashkent CSEP 545 Transaction Processing Sameh Elnikety Replication for Performance

Deep Learning in Smart Spaces Markus Loipfinger Advisor(s): Marc-Oliver Pahl, Stefan Liebald

Migrating code with SmaCC John Brant brant@refactoryworkers.com Migration Strategy

http://ars.userfriendly.org/cart oons/?id=20080627 CS 152: Programming Language Paradigms Blocks

Example: List filter -> (val ns (List withAll: (1 2 3 4 5))) List( 1 2 3 4 5 ) -> (ns

Privacy in the Smartphone Age Di Ma NSF US/Mid-East Workshop on Trustworthiness in Emerging

Cyber Security and Privacy Issues in Smart Grids Acknowledgement: Slides by Hongwei Li from

Smart programming languages, smart program analysis Varmo Vene Institute of Cybernetics at TUT

Cloud Tutorial: AWS IoT CSE 520S Spring, Jan. 16, 2020 Ruixuan Dai XaaS: Basics in Cloud

State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang - PowerPoint PPT Presentation

State Machine Replication for the Masses with BFT-SM A R T Hsin-Yang Huang Chih-shang Chen Outline 1.Introduction 2.BFT-SMART Design 3.Implementation 4.Alternative Configuration 5.Evaluation 6.Lessons Learned 7.Conclusion Introduction

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Asynchronous Replication

August 23, 2012 Data Replication/ETL: Terms Data Replication : Data Replication is the process of

MySQL Replication Tutorial Mats Kindahl Senior Software Engineer Replication Technology Lars

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

Asynchronous Replication and Bayou Asynchronous Replication and Bayou Jeff Chase CPS 212, Fall

New features in MySQL Replication Lars Thalmann, Development Manager, Replication &amp; Backup

Todays Topics - Chapter 15 Slide 1 performance enhancement Replication Replication of

Reasoning About Replication: State Machine Approach &amp; Chain Replication Partial slides

Galera Replication Synchronous Multi-Master Replication for InnoDB ...well, why not for any other

Replication and Migration Background, Requirements and Strawman Migration and Replication

Consistency and Replication Chi Zhang czhang@cs.fiu.edu Object Replication (1) Organization of

DRBD 9 Linux Storage Replication Lars Ellenberg LINBIT HA Solutions GmbH Vienna, Austria

Scaling State Machine Replication Fernando Pedone University of Lugano (USI) Switzerland State

Time, Clocks, and State Machine Replication Dan Ports, CSEP 552 Todays question How

Fault Tolerance via the State Machine Replication Approach Favian Contreras Implementing

in Tashkent CSEP 545 Transaction Processing Sameh Elnikety Replication for Performance

Deep Learning in Smart Spaces Markus Loipfinger Advisor(s): Marc-Oliver Pahl, Stefan Liebald

Migrating code with SmaCC John Brant brant@refactoryworkers.com Migration Strategy

http://ars.userfriendly.org/cart oons/?id=20080627 CS 152: Programming Language Paradigms Blocks

Example: List filter -&gt; (val ns (List withAll: (1 2 3 4 5))) List( 1 2 3 4 5 ) -&gt; (ns

Privacy in the Smartphone Age Di Ma NSF US/Mid-East Workshop on Trustworthiness in Emerging

Cyber Security and Privacy Issues in Smart Grids Acknowledgement: Slides by Hongwei Li from

Smart programming languages, smart program analysis Varmo Vene Institute of Cybernetics at TUT

Cloud Tutorial: AWS IoT CSE 520S Spring, Jan. 16, 2020 Ruixuan Dai XaaS: Basics in Cloud

New features in MySQL Replication Lars Thalmann, Development Manager, Replication & Backup

Reasoning About Replication: State Machine Approach & Chain Replication Partial slides

Example: List filter -> (val ns (List withAll: (1 2 3 4 5))) List( 1 2 3 4 5 ) -> (ns