John S. Otto, Mario A. Sanchez, David R. Choffnes*, Fabián E. Bustamante, Georgos Siganos** Northwestern, EECS * U. Wash, CSE ** Telefónica Research http://aqualab.cs.northwestern.edu
2 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
A large, global peer-to-peer system � Millions of users exchanging content � Virtually every country in the world � 3 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� System’s measured network impact depends on measurement vantage point – How much of network traffic is from BitTorrent? Germany 37% (ipoque) Eastern Europe No, Germany is 9-15% 57% (ipoque) (Maier et al. IMC’09) South America 20% (ipoque) 4 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� A view from a broad set of end users – To sample its overall network traffic – Understand where it flows – Who pays for it (and how expensive it is) � This work – Relies on end users as vantage points • Captures a sample of all BitTorrent traffic • Reveals traffic’s path through the network – Public view is not sufficient to map most BitTorrent traffic – ISP data provides context to understand cost of BitTorrent traffic 5 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� Representative sample of users – 500,000 users, 3,300 networks, 169 countries � Running extensions (Ono & NEWS) for Vuze BitTorrent client – Anonymously report statistics – Provide application-level data • e.g. session length, per-connection transfer volumes • Log 13 TB of traffic per day – Conduct active measurements to reveal traffic paths • With public view alone, we can map 25% of traffic • Supplemented with traceroutes, we can map 89% 6 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� How BitTorrent is being used – Who is using BitTorrent? – When do people run BitTorrent? – How much traffic does it generate? – Study data from Nov. 2008 to Nov. 2010 � Where the generated traffic flows � Who pays for it and how much 7 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Overall population reduced by 10% � Locations of users change over time � Connected peers EU 52% EU 61% by continent 2009 2010 AF 1% AF 2% OC 2% OC 3% SA 4% SA 4% NA 20% AS 13% NA 19% AS 19% Decrease in Europe � Increase in Asia, Africa and Oceania � 8 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Rate of growth of connected users per continent relative to Nov. 2008 Europe continues to drop � N. America, S. America remain stable since 2009 � 76% growth in Africa and 47% in Asia � 9 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
European peers seen on weekdays Normalized number Normalized % of hourly peers seen 100 of peers seen per hour in Europe, 80 depending on time of day 60 40 20 2009 0 06 08 10 12 14 16 18 20 22 00 02 04 06 Shift away from overnight use � Peak usage aligns with evening hours, local time � – Potential impact on ISPs’ costs under burstable billing 10 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
European peers seen on weekdays Normalized number Normalized % of hourly peers seen Normalized % of hourly peers seen 100 100 of peers seen per hour in Europe, 80 80 depending on time of day 60 60 40 40 2009 20 20 2010 2009 0 0 06 08 10 12 14 16 18 20 22 00 02 04 06 06 08 10 12 14 16 18 20 22 00 02 04 06 Shift away from overnight use � Peak usage aligns with evening hours, local time � – Potential impact on ISPs’ costs under burstable billing 11 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Hourly Peer Download Volume (MB) 160 140 Per-peer hourly 120 100 download volume 80 (in MB) over the 60 last year 40 20 0 Nov ’09 Jan ’10 Mar ’10 May ’10 Jul ’10 Sep ’10 Nov ’10 25% increase in per-peer hourly download volume � Despite a 20% drop in total connections, � a 12% increase in overall system traffic 12 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� Overall population reduced by 10% – But large increase in Africa and Asia � Peak usage aligns with evening hours � 12% increase in overall system traffic – 25% increase in per-peer hourly download volume So where’s the traffic? 13 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� How “deep” does traffic go in the network? � Who is paying for it? � Traffic path analysis to see which networks carry most BitTorrent traffic – Tier 1: Well-known networks – Tier 2: Large transit providers – Tier 3: Small transit providers – Tier 4: Content/access/hosting providers Enterprise customers Tiers based on Dhamdhere and Dovrolis, IMC 2008 14 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Tier 1 Tier 2 Tier 3 Tier 4 1 Smaller 0.9 0.8 fraction Fraction of each 0.7 CDF [X ≤ x] of traffic peer’s traffic that 0.6 reaches Tier X 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Proportion of Traffic Most traffic stays at or below Tier 3 � Significant fraction of traffic never reaches Tiers 1 or 2 � – Typically missed by in-network monitoring studies from the core 15 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Tier 1 Tier 2 Tier 3 Tier 4 1 0.9 0.8 Fraction of each 0.7 CDF [X ≤ x] peer’s traffic that 0.6 reaches Tier X 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Proportion of Traffic Most traffic stays at or below Tier 3 � Significant fraction of traffic never reaches Tiers 1 or 2 � – Typically missed by in-network monitoring studies from the core 16 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Traffic from Tier 2 to Tier 2 Traffic from Tier 3 to Tier 3 1 1 0.9 0.9 0.8 0.8 0.7 0.7 CDF [X ≤ x] 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 Tier 1 0.2 0.2 Tier 1 Tier 2 0.1 0.1 Tier 2 Tier 3 0 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Proportion of Traffic Proportion of Traffic Traffic generally stays in the originating tier � Tier 2 networks do not provide “intermediate” level of � connectivity between Tiers 1 & 3 17 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� How BitTorrent is being used � Where the generated traffic flows – Most traffic is handled at or below Tier 3 � Who pays for it and how much 18 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
� Determine BitTorrent cost relative to other traffic – ISP X’s data provides context to interpret traffic sample � Study at granularity of individual network links � Consider common burstable billing model – e.g. 95 th -percentile billing � Data for several Providers ISP A ISP B of ISP X’s links over 1 week ISP X ISP C ISP G Customers 19 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Aggregate link volume for each 5 minute bin � Cost is based on 95 th -percentile bin’s value � Under burstable billing model, not all bytes may have the � same cost – Peak-hour bytes are more expensive than off-peak When value is defined 95 th -percentile value All Traffic 20 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
BitTorrent peaks at 3AM BitTorrent peaks at 9PM Other Other BT BT BitTorrent at peak hour is more expensive � Use Shapley value to determine relative cost of BitTorrent � – Shapley value gives the cost contribution of BitTorrent traffic – Compare to other traffic on the network – Is BitTorrent’s cost more than its “fair share” by volume? 21 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
BitTorrent peaks at 3AM BitTorrent peaks at 9PM BitTorrent’s contribution to cost Other BT BT Other Other BT Other BT BitTorrent at peak hour is more expensive � Use Shapley value to determine relative cost of BitTorrent � – Shapley value gives the cost contribution of BitTorrent traffic – Compare to other traffic on the network – Is BitTorrent’s cost more than its “fair share” by volume? 22 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Additional cost of BitTorrent traffic, percent above relative cost of 1 ISP A ISP B ISP X BitTorrent traffic is generally more � expensive than other traffic What traffic characteristics result in ISP C ISP G � high relative cost? 23 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Out-of-phase peaks Aligned peaks Small peaks Large peaks High relative cost of BitTorrent � – Large coefficient of variation (“C.V.”, size of peaks in BitTorrent traffic) – Small cross-correlation offset (“X-corr”, alignment with overall traffic) 24 Otto, Sánchez, Choffnes, Bustamante & Siganos On Blind Mice and the Elephant
Recommend
More recommend