measuring the impact of sharing abuse data with web
play

Measuring the Impact of Sharing Abuse Data with Web Hosting - PowerPoint PPT Presentation

Measuring the Impact of Sharing Abuse Data with Web Hosting Providers Marie Vasek , Matthew Weeden, and Tyler Moore University of Tulsa WISCS 24 October 2016 1 of 27 StopBadware Founded in 2006 by Harvards Berkman Klein Center for


  1. Measuring the Impact of Sharing Abuse Data with Web Hosting Providers Marie Vasek , Matthew Weeden, and Tyler Moore University of Tulsa WISCS 24 October 2016 1 of 27

  2. StopBadware • Founded in 2006 by Harvard’s Berkman Klein Center for Internet and Society • Now housed at the University of Tulsa • Provides independent reviews of websites appearing on 3 malware blacklists 3 of 27

  3. Review Requests for Individual URLs 4 of 27

  4. Review Requests for Bulk URLs 5 of 27

  5. Research Questions Does sending bulk reports help? • Short term: ◦ Do reported URLs get cleaned up? ◦ Which URLs are more likely to get cleaned up? • Long term: ◦ Do ASes get better at cleaning URLs after receiving bulk reports? 6 of 27

  6. Overview • Brief overview of study • Define metrics • Direct impact of sharing abuse data • Indirect impact of sharing abuse data • Conclusions 7 of 27

  7. Bulk Requests over Time 5000 # URLs shared 500 50 5 1 2010 2011 2012 2013 2014 2015 Date shared 8 of 27

  8. Summary Statistics • Google Safebrowsing Data used exclusively • 6 year time frame (2010 - 2015) • 69 stakeholders requested reports • 41 web hosting providers in our study ◦ Responsible for entire AS ◦ Sent Google Safebrowsing Data ◦ Had at least a month of data before/after • 28 548 URLs reported 9 of 27

  9. Malware Cleanup Metrics • Clean ◦ Off the blacklist ◦ Stays off for 3 weeks • Recompromise ◦ A previously blacklisted URL is clean and then is reblacklisted 10 of 27

  10. Measuring Direct and Indirect Impact of Reporting • Direct Impact ◦ Are the URLs we shared cleaned up? • Indirect Impact ◦ Are networks “better” after receiving a bulk review from StopBadware? • Do they clean malware URLs faster? • Do they clean malware URLs more effectively? 11 of 27

  11. Measurement Timeline blacklist to clean blacklist to report report to clean blacklisted reported clean 12 of 27

  12. Cleanup of URLs Shared with ASes URLS shared with ASes 1.0 Pr(report to clean days >= X) 0.8 0.6 0.4 0.2 0.0 1 5 50 500 Report to Clean (days) 13 of 27

  13. Measurement Timeline blacklist to clean blacklist to report report to clean blacklisted reported clean 14 of 27

  14. Long Lived Malware Takes Longer to Clean Median Report to Clean (Days) [Bar] Blacklist to Report (Days) [Line] 1000 500 ● 800 400 ● 300 600 ● 400 200 ● 100 200 ● ● ● ● ● 0 0 ● 0− 10− 20− 30− 40− 50− 60− 70− 80− 90− 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Decile for Blacklist to Report (Days) 15 of 27

  15. Pre- vs. Post-Contact Cleanup Survival probability before and after contact 1.0 pre−contact Pr(blacklist to clean days >=X) post−contact 0.8 0.6 0.4 0.2 0.0 1 2 5 10 50 200 Blacklist to Clean (days) 16 of 27

  16. Pre- vs. Post-Contact Cleanup: Improved AS 17 of 27

  17. Pre- vs. Post-Contact Cleanup: Worsened AS 18 of 27

  18. Pre- vs. Post-Contact Cleanup: Unclear effect AS 19 of 27

  19. Change in Metrics Pre- and Post- Sharing # ∆ days to clean ∆ recomp. rate Improved 13 58 0.010 Worsened 3 -176 0.085 Unclear 17 13 0.008 20 of 27

  20. Comparing Change in Metrics by AS Median recompromise rate pre−sharing − post−sharing 0.15 ● 0.10 ● 0.05 ● 0.00 ● −0.05 ● ● Top Quartile Report to Clean 2nd Quartile Report to Clean −0.10 3rd Quartile Report to Clean Bottom Quartile Report to Clean ● −300 −200 −100 0 100 Median blacklist to clean pre−sharing − post−sharing 21 of 27

  21. Matched Pair Analysis • What would happen if StopBadware had not sent out reviews? • Matched pairs between reported-to ASes and similar ASes • Similar? ◦ Same country ◦ Similar level of badness • Key Assumption: All else equal, ASes would exhibit similar patterns 22 of 27

  22. Measurement Timeline blacklist to clean blacklist to report report to clean blacklisted reported clean 23 of 27

  23. Matched Pair: Cleanup of URLs Shared with ASes URLS shared with ASes 1.0 reported ASes matched pairs Pr(report to clean days >= X) 0.8 0.6 0.4 0.2 0.0 1 5 50 500 Report to Clean (days) 24 of 27

  24. Matched Pair: Pre- vs. Post-Contact Cleanup Survival probability before and after contact 1.0 pre−contact Pr(blacklist to clean days >=X) post−contact pre−contact (mp) 0.8 post−contact (mp) 0.6 0.4 0.2 0.0 1 2 5 10 50 200 Blacklist to Clean (days) 25 of 27

  25. Responsive ASes Improve Long Term after Report Median recompromise rate pre−sharing − post−sharing 0.15 ● 0.10 ● 0.05 ● 0.00 ● −0.05 ● ● Top Quartile Report to Clean 2nd Quartile Report to Clean −0.10 3rd Quartile Report to Clean Bottom Quartile Report to Clean ● −300 −200 −100 0 100 Median blacklist to clean pre−sharing − post−sharing 26 of 27

  26. Conclusions • Directly sharing URLs helps clean up those URLs ◦ Consistent with prior work on individual reports ◦ This work finds it to be true for bulk reporting • No evidence for long term change overall ◦ Improvements on individual providers • Long lived malware a scourge ◦ Lots of efforts concentrating on newly infected websites ◦ Lurking infections continue to harm, perhaps compounding ◦ Current efforts not sufficient for stopping this “immortal” malware 27 of 27

Recommend


More recommend