P2P Systems: Gossip Protocols CS 6410 By Alane Suhr & Danny - PowerPoint PPT Presentation

P2P Systems: Gossip Protocols CS 6410 By Alane Suhr & Danny Adams 1

Outline ❖ Timeline ❖ CAP Theorem ❖ Epidemic algorithms for replicated database maintenance ❖ Managing update conflicts in Bayou, a weakly connected replicated storage system ❖ Conclusion 2 A

Timeline 1978 1982 1985 1987 1990 1995 1998 Lamport Lamport FLP Demers Schneider Terry Lamport Time, Clocks, The Byzantine Impossibility of Epidemic Implementing Managing The part-time and the Generals Distributed algorithms for fault-tolerant update conflicts parliament services using Ordering of Problem Consensus with replicated in Bayou, a the state Events in a One Faulty database weakly machine Distributed Process maintenance connected approach: A System replicated tutorial storage system 3 A

CAP ● Consistency -- all nodes contain the same state ● Availability -- requests are responded to promptly ● Partition ○ part of a system completely independent from the rest of the system ○ ideally should maintain itself autonomously ● Partition tolerance -- system can stay online and functional even when message passing fails 4 A

CAP Theorem ● Paxos: prioritize consistency given a network partition ● Gossip: prioritize availability Paxos given a network partition & Gossip 5 A

Gossip 6 D

Gossip Overview ❏ Authors ❏ Motivations ❏ Epidemic Models ❏ Direct Mail ❏ Anti-Entropy ❏ Rumor mongering ❏ Evaluation ❏ DC’s ❏ Spatial Distribution 7 D

A u t Carl Hauser Alan Demers Dan Greene h PhD Cornell Cornell PARC Washington University Research o State University r Scott Shenker s Doug Terry EECS Amazon Web Berkeley Services 8 D

Motivations ● Unreliable network ● Unreliable nodes ● CAP: *AP* ○ always be able to respond to a (read/write) request ○ eventual consistency 9 D

Epidemic Models 10 A

Proposers and Acceptors ● Proposer ○ In Paxos: clients propose an update to the database ○ Epidemic model: a node infects its neighbors ● Acceptor ○ In Paxos: acceptor accepts an update based on one or more proposals ○ Epidemic model: a node is infected by a neighbor 11 A

Types of Epidemics ❖ Direct Mail ❖ Anti-Entropy ❖ Rumor Mongering A 12

Advantages ➢ Simple algorithms ➢ High Availability ➢ Fault Tolerant ➢ Tunable ➢ Scalable ➢ Works in Partition 13 A

● Notify all neighbors of an update ● Timely and reasonably efficient ● n messages per update Direct Mail 14 D

Direct Mail 15 D

Direct Mail 16 D

Direct Mail Messages sent: O(n) where n is number of neighbors Not fault tolerant -- doesn’t guarantee eventual consistency High volume of traffic with site at the epicenter 17 D

Anti-Entropy ❏ Site chooses random partner to share data ❏ Number of rounds til consistency: O(log n) ❏ Sites use custom protocols to resolve conflicts ❏ Fault tolerant 18 A

Anti-Entropy 19 A

Anti-Entropy 20 A

Anti-Entropy 21 A

Anti-Entropy 22 A

Anti-Entropy 23 A

Anti-Entropy 24 A

Anti-Entropy 25 A

Anti-Entropy 26 A

Anti-Entropy What happens next? 27 A

Mechanism: Push & Pull 28 D

Push vs. Pull Push Pull {A, B} {A, C} {A, B} {A, C} {A, B} {A,B,C} {A, B, C} {A, C} 29 D

{A, B} {A, C} What is Push-Pull? 30 {A, B, C} {A,B,C} D

Propagation times of Push vs. Pull Push: P i+1 = P i e -1 2 Pull: P i+1 = P i Pull is faster!! P= Probability node hasn’t received update after the i th round 31 D

Rumor Mongering 1. Sites choose a random neighbor to share information with 2. Transmission rate is tuneable 3. How long new updates are interesting is also tuneable 4. Can use push or pull mechanisms 32 A

Rumor Mongering Complexity ● O(ln n) rounds leads to consistency with high probability ● Push requires O(n ln n) transmissions until consistency ● Further proved lower bound for all push-pull transmissions: 0(n ln ln n) Karp et al 2000. Randomized rumor spreading. In FOCS. 33 A

Analogy to epidemiology ● Susceptible: site does not know an update yet ● Infective: actively sharing an update ● Removed: updated and no longer sharing Rumor mongering: nodes go from susceptible to infective and eventually (probabilistically) to removed 34 A

Rumor mongering 35 A

Rumor mongering A 41 A

Rumor mongering Pros: Cons: ● ● Fast A site can potentially miss an ● Low call on resources update ● Fault-Tolerant ● Less traffic 42 A

Backups Anti-entropy can be used to ● “update” the network regularly after direct mail or rumor mongering If inconsistency found in ● anti-entropy, run the original algorithm again 43 D

Death Certificates ❖ How are items deleted using epidemic models? 44 D

I DON’T like I like Bread Bread! I like orange juice 45 D

Death Certificates ❖ How to remove items from epidemic model? ❖ Drawbacks ➢ Space ➢ Increases traffic ➢ DC Can be lost ❖ Dormant death certificates & retention 46 D

Evaluating Epidemic Models ➢ Residue: remaining susceptibles when epidemic finishes Traffic: ➢ ➢ Delay: ○ T avg : Average time between start of outbreak and arrival of update @ given site ○ T last : Delay until last update 47 D

Spatial Distribution Helping Or Hurting 48 A

Convergence Times and Traffic ● Linear network: anti entropy ○ Nearest-neighbors ■ O(n) convergence ■ O(1) traffic ○ Random connections ■ O(log(n)) convergence ■ O(n) traffic 49 A

Optimizations for realistic network distributions ● Select connections from list of neighbors sorted by distance ● Treat network as linear ● Compute probabilities based on position in list 50 A

Rumor Mongering Non-Standard Distribution ● Increase k -- number of rounds a rumor is “interesting” ● Use push-pull 51 A

Takeaways ● Availability >> consistency ● Updates can be expensive ● Distribution protocols should be robust ● Network design can hurt overall performance ● Byzantine Behavior not addressed Questions? 52 A

Additional Reading Managing update conflicts in Bayou, a weakly connected replicated storage system 1995 53 D

● Weak consistency makes unstable network applications possible ● Developing good interfaces allows for complex functions like merging to be interchangeable via the application 54 D

Timeline 1978 1982 1985 1987 1990 1995 1998 Lamport Lamport FLP Demers Schneider Terry Lamport Time, Clocks, The Byzantine Impossibility of Epidemic Implementing Managing update The part-time and the Generals Distributed algorithms for fault-tolerant conflicts in parliament services using Bayou, a weakly Ordering of Problem Consensus with replicated the state connected Events in a One Faulty database machine replicated Distributed Process maintenance approach: A storage system System tutorial 55 D

What is Bayou? ● Storage system designed for mobile computing ○ Network is not stable ○ Parts of the network may not be connected all the time ○ Goal: high availability ○ Guarantees weak consistency 56 D

Write (unique ID) Client Anti-Entropy Server Read Request Server Client Data Bayou System Diagram 57 D

Consistent Replicas ● Writes are first tentative ● Eventually they are committed, ordered by time ● Clients can tell whether writes are stable (committed) ● Primary servers deal with committing updates 58 A

Detecting and Resolving Conflicts ● Dependency checks ● Merge procedures ● Described by the clients, application-dependent 59 A

Conclusions ● Distributed systems need a form of consensus ● Effectively choosing the correct consensus model for a system has to be weighed carefully with the attributes of the system 60 A

Acknowledgements Content Inspired by: Ki Suh Lee: “Epidemic Techniques”[2009] Eugene Bagdasaryan: “P2P Gossip Protocols” [2016] Photos www.pixabay.com www.unsplash.com www.1001freedownloads.com/free-cliparts 61 A

P2P Systems: Gossip Protocols CS 6410 By Alane Suhr & Danny - PowerPoint PPT Presentation

P2P Systems: Gossip Protocols CS 6410 By Alane Suhr & Danny Adams 1 Outline Timeline CAP Theorem Epidemic algorithms for replicated database maintenance Managing update conflicts in Bayou, a weakly connected replicated

Gossip and Self-Stabilization Lonnie Princehouse CS 5412 February 28, 2012 Gossip Protocols

CS5412/LECTURE 12 Ken Birman GOSSIP PROTOCOLS CS5412 Spring 2019

CS5412: USING GOSSIP TO BUILD OVERLAY NETWORKS Lecture XX Ken Birman Gossip and Network

Ken Birman i Cornell University. CS5410 Fall 2008. Gossip 201 Last time we saw that gossip

P2P-NEXT EUROPEAN UNION FRAMEWORK 7 PROJECT WWW.P2P-NEXT.ORG Johnathan Ishmael

Middleware for Gossip Protocols Michael Chow and Robbert van Renesse Cornell University Mo:va:on

P2P Combinatorial Optimization Amir H. Payberah (amir@sics.se) P2P Combinatorial Optimization, 13

Heterogeneous Gossip Davide Frey Rachid Guerraoui Anne-Marie Kermarrec Boris Koldehofe Maxime

Gossip Gossip pping in pp pp pping in p g g Bolo Bolo ogna ogna Ozalp Ba Ozalp Ba

CS5412: BIMODAL MULTICAST ASTROLABE Lecture XIX Ken Birman Leiden; Dec 06 Gossip 201 2

Hierarchical Spatial Gossip for Hierarchical Spatial Gossip for Multi- -Resolution

Balancing Gossip Exchanges in Networks with van Renesse and Firewalls L. Rodrigues

Epistemic Gossip Protocols Krzysztof R. Apt CWI Based on joint work with Wiebe van der Hoek

Distributed Gossip Protocols Krzysztof R. Apt CWI and University of Amsterdam Based on joint

Distributed Adaptive Systems (DAS) Unit Self-organising P2P Antonio Bucchiarone Fondazione Bruno

Bar Gossip Aleksander Jurkowski University of Warsaw aj262944@students.mimuw.edu.pl October 18,

Linear Systems of Equations I Example Solve x 1 + 2 x 2 + x 3 = 1 x 1 + 3 x 2

Distributed Systems: Ordering and Consistent Cuts by Maofan (Ted) Yin my428@cornell.edu Time,

Ordering and Consistent Cuts Edward Tremel 11/7/2013 Synchronizing Distributed Systems

(6) a. ERG agreement ABS agreement (not encoded in (3)) SUBJ OBJ [!] F ROM S YNTAX TO E

Intervention tracks scope-taking (in Japanese and English) Michael Yoshitaka Erlewine Hadas

A model selection algorithm for mixture experiments including process variables Hugo Maruri and

Part 10: Vector Space Classification Francesco Ricci 1 Content p Recap on nave Bayes p

Basic Elec. Engr Basic Elec. Engr. Lab . Lab ECS 204 ECS 204 Asst. Prof. Dr. Prapun Suksompong