AdGraph: A Graph-Based Approach to Ad and Tracker Blocking Umar Iqbal, Peter Snyder, Shitong Zhu, Benjamin Livshits, Zhiyun Qian, and Zubair Shafiq IEEE Symposium on Security and Privacy, 2020
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads 1
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Interactive Advertising Bureau (IAB) ‘19 2
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Problems with online advertising ecosystem 3
Online Advertising Advertising enables “free”content Publishers show content “I see ad ads for things I dream am ab about.” Earn revenue with ads “M “My y phone is s eave vesd sdropping on me me” Problems with online advertising ecosystem Privacy concerns – Behavioral targeting 4
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Problems with online advertising ecosystem Privacy concerns – Behavioral targeting Performance issues – Slow page load 5
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Problems with online advertising ecosystem Privacy concerns – Behavioral targeting Performance issues – Slow page load Malvertising 6
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Problems with online advertising ecosystem Privacy concerns – Behavioral targeting Performance issues – Slow page load Malvertising Intrusive 7
Online Advertising Advertising enables “free”content Publishers show content Earn revenue with ads Problems with online advertising ecosystem Privacy concerns – Behavioral targeting Performance issues – Slow page load Malvertising Intrusive Solution Ad & tracker blockers 8
Outline State of Ad/Tracker Blocking Ads & Trackers Filter list blocking Machine learning based blocking AdGraph Graph-based representation Machine learning on graph representation Evaluation 9
Outline State of Ad/Tracker Blocking Ads & Trackers Filter list blocking Machine learning based blocking 10
What are Ads and Trackers? 11
What are Ads and Trackers? Ads are audio-visual promotional content 12
What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information Tracking Pixel 13
What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information They are: Created with JavaScript Requested with HTTP Displayed with HTML Ads and trackers involve HTML, Network, and JavaScript Tracking Pixel 14
JavaScript What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information They are: Created with JavaScript Tracking Pixel 15
What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information HTTP They are: Created with JavaScript Requested with HTTP Tracking Pixel 16
What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information HTML They are: Created with JavaScript Requested with HTTP Displayed with HTML Tracking Pixel 17
JavaScript What are Ads and Trackers? Ads are audio-visual promotional content Trackers collect sensitive information HTML HTTP They are: Created with JavaScript Requested with HTTP Displayed with HTML Ads and trackers involve HTML, Network, and JavaScript Tracking Pixel 18
Filter List Based Blocking Manually curated with crowdsourcing 19
Filter List Based Blocking Manually curated with crowdsourcing Leads to scalability issue s 3 months to add new rules [Iqbal et al. ‘17] 20
Filter List Based Blocking Manually curated with crowdsourcing Leads to scalability issue s 3.8 year to remove rules [Snyder et al. ‘20] 21
Filter List Based Blocking Manually curated with crowdsourcing Leads to scalability issue s 90% rules are useless [Snyder et al. ‘20] 22
Filter List Based Blocking Manually curated with crowdsourcing Leads to scalability issue s Operate at HTML/Network/JS layer in isolation 23
Filter List Based Blocking Manually curated with crowdsourcing Block network request Leads to scalability issue s Operate at HTML/Network/JS layer in isolation Leads to accuracy issues 24
Filter List Based Blocking Manually curated with crowdsourcing Block network request Leads to scalability issue s Operate at HTML/Network/JS layer in isolation Leads to accuracy issues Hide HTML elements 25
Filter List Based Blocking Manually curated with crowdsourcing Block network request Leads to scalability issue s Operate at HTML/Network/JS layer in isolation Leads to accuracy issues Hide HTML elements Block script execution 26
Filter List Based Blocking Manually curated with crowdsourcing Block network request Leads to scalability issue s Operate at HTML/Network/JS layer in isolation Leads to accuracy issues Hide HTML elements Block script execution 27
Filter List Based Blocking Suffer from scalability issues Suffer from accuracy issues 28
Machine Learning Based Blocking Network layer [Bhagavatula et al. 14, Gugelmann et al. ’15] HTTP header properties as features presence of words like “ad” cookies set by response 29
Machine Learning Based Blocking Network layer [Bhagavatula et al. 14, Gugelmann et al. ’15] HTTP header properties as features presence of words like “ad” cookies set by response JavaScript layer [Wu et al. ‘16, Ikram et al. ‘17] JS API names as features document.cookie element.clientWidth 30
Machine Learning Based Blocking Solve scalability issues 31
Machine Learning Based Blocking Solve scalability issues Do not solve accuracy issues 32
Outline AdGraph Graph-based representation Machine learning on graph representation Evaluation 33
AdG AdGra raph ph �������������������� ���� �� ����� � ������� ������ �� ������ ����������������������� �������������������������� ��������������������������� ������������������� �������������������� ��������������������� ��������������������� �������������������������� ���������������������� ������������� ������������������ ����������������� 34
AdGra AdG raph ph Graph-based cross-layer representation of ad/tracker behavior �������������������� ���� �� ����� � ������� ������ �� ������ ����������������������� �������������������������� ��������������������������� ������������������� �������������������� ��������������������� ��������������������� �������������������������� ���������������������� ������������� ������������������ ����������������� 35
AdG AdGra raph ph Graph-based cross-layer ML to automatically learn representation of ad/tracker behavior ad/tracker behavior �������������������� ���� �� ����� � ������� ������ �� ������ ����������������������� �������������������������� ��������������������������� ������������������� �������������������� ��������������������� ��������������������� �������������������������� ���������������������� ������������� ������������������ ����������������� 36
Recommend
More recommend