a content propagation metric for efficient content
play

A Content Propagation Metric for Efficient Content Distribution Ryan - PowerPoint PPT Presentation

A Content Propagation Metric for Efficient Content Distribution Ryan S. Peterson ! , Bernard Wong ! , and Emin Gn Sirer ! Department of Computer Science, Cornell University School of Computer Science, University of Waterloo !


  1. A Content Propagation Metric for Efficient Content Distribution Ryan S. Peterson † ! , Bernard Wong ‡ ! , and Emin Gün Sirer † ! † Department of Computer Science, Cornell University ‡ School of Computer Science, University of Waterloo ! United Networks, LLC August 18, 2011

  2. Content Distribution cache servers origin server end users

  3. BW in Client-Server cache servers origin server end users

  4. BW in Peer-to-Peer cache servers origin server end users in swarms

  5. BW in Antfarm cache servers origin server end users in swarms

  6. Goal cache servers Efficiently use all available bandwidth origin server end users in swarms

  7. Problem Definition • The general multi-swarm content distribution problem • given: hosts, swarms, and swarm memberships • find: allocation of each host’s upload bandwidth among its swarms that maximizes system-wide bandwidth

  8. Approach New metric that steers hosts toward a globally efficient allocation of resources Enables each host to measure its impact on each swarm and adjust its bandwidth allocations accordingly

  9. Approach New metric that steers hosts toward a globally efficient allocation of resources Content Propagation Metric

  10. Outline Evaluation The CPM V-Formation

  11. Benefit of a Block s 1 s 2 p p ’s choice: upload the next block to s 1 or s 2 ? Which swarm will benefit more?

  12. Determining Benefit • What block p uploads • Distribution of blocks in the swarms • Sizes of the swarms • Network conditions among peers • The direct recipient of p ’s block Use history to predict the future

  13. Intuition s 1 s 2 p Measure how “fast” p ’s blocks propagate in each swarm Use the result as an estimate of the benefit that the swarms derive from p ’s blocks

  14. Content Propagation Metric s 1 s 2 p 17/ τ 9/ τ Block propagation bandwidth: rate that an uploaded block propagates in a fixed time interval τ CPM: rolling average of a peer’s recent block propagation bandwidths for a swarm

  15. Using the CPM • Each host measures random uploaded blocks to maintain a CPM value for each swarm • Hosts upload to swarms with the largest CPM values when faced with competing requests • Hosts proactively probe new swarms and swarms with stale CPM values

  16. CPM Case Study s 1 s 2 CPM time

  17. CPM Case Study s 1 s 2 p 1 p 2 competition for CPM block propagation time

  18. CPM Case Study s 1 s 2 p 1 p 2 bandwidth from cache time

  19. CPM Overview • Identifies neediest swarms • Easy to measure • Can allocate bandwidth from a single server • Accounts for interference from competing hosts

  20. Outline Evaluation The CPM V-Formation

  21. V-Formation • Based on our hybrid architecture • A logically centralized coordinator provides efficient bookkeeping • A token protocol enables the coordinator to track blocks and monitor peers

  22. Coordinator • Measures swarm dynamics • tracks block transfers based on spent tokens • Computes peers’ CPM values • periodically sends updates to peers • Provides accountability • detects and blocks misbehaving peers

  23. Wire Protocol Goals • Track block transfers among peers • Disseminate CPM values and peer lists • Enforce peer behavior

  24. Wire Protocol s 1 coordinator join s 1 peerlist

  25. Wire Protocol s 1 tokens coordinator get tokens

  26. Wire Protocol s 1 coordinator want block block

  27. Wire Protocol s 1 coordinator token deposit tokens

  28. Wire Protocol s 1 coordinator coordinator’s state b 1 time

  29. Wire Protocol s 1 CPM value coordinator coordinator’s state announce b 1 time

  30. Coordinator Design stores membership info, processor processor processor processor propagation data, and CPMs distributed, shared state web server web server web server continuously process block propagation data handle peer requests, record block propagation data

  31. Coordinator State • Soft state stored in memcached • Swarm: peers, number of blocks • Peers: addr, swarms, block propagation bandwidths, CPMs • Blocks: swarm, propagation graph with timestamped, peer-identified nodes • Updated via atomic CAS operations

  32. Outline Evaluation The CPM V-Formation

  33. Evaluation • Built and deployed V-Formation as a video- sharing service called FlixQ • Uses the CPM to achieve high performance • Coordinator scales to large deployments

  34. Experimental Setup • Coordinator on Amazon EC2 • 380 peers on PlanetLab with realistic bandwidth capacities • 200 swarms based on IMDb movie popularities and sizes • 20% of peers belong to multiple swarms • 2 caches with different subsets of content

  35. End-to-End Performance BitTorrent Antfarm V-Formation 6000 aggregate bandwidth (KB/s) 4500 3000 1500 0 0 400 800 1200 1600 time (s)

  36. Performance of Heuristics V-Formation Largest swarm Global rarest Random Smallest swarm 8000 aggregate bandwidth (KB/s) 6000 4000 2000 0 0 1000 2000 3000 time (s)

  37. Scalability bandwidth state size 500 30 25 coordinator bandwidth (KB/s) 400 coordinator state (MB) 20 300 15 200 10 100 5 0 0 0 2500 5000 7500 10000 number of peers

  38. Related Work • Content Distribution Networks • Antfarm, Akamai, CoBlitz, CoDeeN, ECHOS, Coral, Slurpie, YouTube, Hulu, GridCast, Tribler, Joost, Huang et al. 2008, Freedman et al. 2008, ... • P2P Swarming • BitTorrent, BitTyrant, PropShare, BitTornado, BASS, Annapureddy et al. 2007, Guo et al. 2005, Pouwelse et al. 2005, Zhang et al. 2011, OneSwarm, ... • Incentives and microcurrencies • Dandelion, BAR Gossip, Samsara, Karma, SHARP , PPay, Kash et al. 2007, Levin et al. 2009, iOwe, ...

  39. Conclusions • New hybrid approach for efficient bandwidth allocation • Decentralized metric enables hosts to measure their global benefit • Centralized implementation drives hosts toward globally efficient use of resources http://flixq.com

Recommend


More recommend