NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: - PDF document

Proceedings on Privacy Enhancing Technologies ; 2020 (2):45–66 Anastasia Shuba* and Athina Markopoulou NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: Today’s mobile apps employ third-party ad- 1 Introduction vertising and tracking (A&T) libraries, which may pose a threat to privacy. State-of-the-art detects and blocks The mobile ecosystem is rife with third-party track- outgoing A&T HTTP/S requests by using manually ing. App developers often integrate with third-party li- curated filter lists ( e.g. EasyList), and recently, using braries, which can be broken into roughly three cate- machine learning approaches. The major bottleneck of gories: advertisement and analytics libraries, social li- both filter lists and classifiers is that they rely on ex- braries ( e.g. Facebook ), and development libraries [1]. perts and the community to inspect traffic and man- These libraries inherit the same permissions as their ually create filter list rules that can then be used to parent app, and can thus collect rich personal and con- block traffic or label ground truth datasets. We propose textual information [2, 3]. NoMoATS – a system that removes this bottleneck by To protect themselves, privacy-conscious users rely reducing the daunting task of manually creating filter on tools such as DNS66 [4] and AdGuard [5]. These rules, to the much easier and scalable task of labeling apps require no rooting and instead rely on VPN APIs A&T libraries. Our system leverages stack trace anal- to intercept outgoing traffic and match it against a list ysis to automatically label which network requests are of rules, such as EasyPrivacy [6]. Such lists are man- generated by A&T libraries. Using NoMoATS, we col- ually curated, by experts and the community, and are lect and label a new mobile traffic dataset. We use this thus difficult to maintain in the quickly changing mo- dataset to train decision tree classifiers, which can be bile ecosystem. More recently, multiple works [7–9] have applied in real-time on the mobile device and achieve proposed to train machine learning models, which are an average F-score of 93%. We show that both our au- more compact and generalize. However, in order to ob- tomatic labeling and our classifiers discover thousands tain ground truth ( i.e. labeled datasets) to train the of requests destined to hundreds of different hosts, pre- machine learning models, current state-of-the-art still viously undetected by popular filter lists. To the best of relies on filter lists [7, 8] or a combination of filter lists our knowledge, our system is the first to (1) automati- and manual labeling [9]. Therefore, obtaining accurate cally label which mobile network requests are engaged ground truth is a crucial part and a major bottleneck of in A&T, while requiring to only manually label libraries both filter-lists and machine learning approaches. to their purpose and (2) apply on-device machine learn- In this paper, we aim to reduce the scope of man- ing classifiers that operate at the granularity of URLs, ual labeling required to identify mobile network requests can inspect connections across all apps, and detect not that are either requesting ads or are tracking the user only ads, but also tracking. (A&T requests). We start by noting that tracking and Keywords: mobile; privacy; tracking; advertising; filter advertising on mobile devices is usually done by third- lists; machine learning party libraries whose primary purpose is advertising or analytics (A&T libraries). Throughout this paper, we DOI 10.2478/popets-2020-0017 will refer to a an HTTP request (or a decrypted HTTPS Received 2019-08-31; revised 2019-12-15; accepted 2019-12-16. request) as an A&T request (or packet), if it was generated by an A&T library. Another key observation is that it is possible to determine if a network request came from the application itself or from a library by examin- ing the stack trace leading to the network API call. More *Corresponding Author: Anastasia Shuba: Broadcom specifically, stack traces contain package names that Inc. (the author was a student at the University of Cali- identify different entities: app vs. library code. Thus, fornia, Irvine at the time the work was conducted), E-mail: ashuba@uci.edu to label which network requests are A&T, we just need Athina Markopoulou: University of California, Irvine, E- a list of libraries that are known to be A&T. mail: athina@uci.edu

NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: - PDF document

Proceedings on Privacy Enhancing Technologies ; 2020 (2):4566 Anastasia Shuba* and Athina Markopoulou NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: Todays mobile apps employ third-party ad- 1 Introduction vertising and

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen Grauman Prof. Kristen Grauman

Automatic Defect Detection Andrzej Wasylkowski Overview Automatic Defect Detection

Automatic Disfluency Automatic Disfluency Detection in Multi-party Detection in Multi-party

CVPR 2 CVPR 2019 Tracking and Detection Challenge 16/06/2019 www.motchallenge.net CVPR 2019

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

Detection vs. tracking Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen

Towards a Unified Framework for Mobile Device Security Wayne A. Jansen, NIST Mobile Device

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Big data: What are you missing? the risks of assuming data equals all David J. Hand Imperial

Earnings Call August 6, 2019 9:00am ET Innophos Holdings, Inc. | March 2019 Forward-Looking

Premises in the Aftermath of Crises UNU-WIDER Conference on Responding to Crises

PROTECTING DIGITAL INFORMATION Roadmap: Fall 2017 But First, An Aside: This is Misleading

Truncated Sums, Matrix Iteration Giacomo Boffi

Variational quantum eigensolver of interacting bosons with NISQ devices Andy C. Y. Li This

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Travel and transport MOTOR CYCLES Exceeding 50 cc MOTOR CARS Light Vehicle Cars including

NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: - PDF document

Proceedings on Privacy Enhancing Technologies ; 2020 (2):4566 Anastasia Shuba* and Athina Markopoulou NoMoATS: Towards Automatic Detection of Mobile Tracking Abstract: Todays mobile apps employ third-party ad- 1 Introduction vertising and

People-Tracking-by-Detection and People-Detection-by-Tracking Mykhaylo Andriluka Stefan Roth

Tracking H akan Ard o February 22, 2012 H akan Ard o Tracking February 22, 2012 1

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

MOBILE ADVERTISING Agenda Get off to a mobile start with Media Impact! Why mobile? MI

Automatic Verification of Automatic Verification of Automatic Verification of Automatic

Tracking H akan Ard o March 4, 2013 H akan Ard o Tracking March 4, 2013 1 / 57

Foreground detection and tracking in 2D/3D Jos Luis Landabaso Montse Pards Outline 2D

Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen Grauman Prof. Kristen Grauman

Automatic Defect Detection Andrzej Wasylkowski Overview Automatic Defect Detection

Automatic Disfluency Automatic Disfluency Detection in Multi-party Detection in Multi-party

CVPR 2 CVPR 2019 Tracking and Detection Challenge 16/06/2019 www.motchallenge.net CVPR 2019

Overview Introduction Object Tracking Vehicle Tracking Theory &amp; Implementation

Mobile Capabilities And Credentials Contents Mobile Landscape Mobile Functionalities

Detection vs. tracking Lecture 21: Motion and tracking Thursday, Nov 29 Prof. Kristen

Towards a Unified Framework for Mobile Device Security Wayne A. Jansen, NIST Mobile Device

Automatic Enrollment and Automatic IRAs David C. John The Heritage Foundation The Retirement

Big data: What are you missing? the risks of assuming data equals all David J. Hand Imperial

Earnings Call August 6, 2019 9:00am ET Innophos Holdings, Inc. | March 2019 Forward-Looking

Premises in the Aftermath of Crises UNU-WIDER Conference on Responding to Crises

PROTECTING DIGITAL INFORMATION Roadmap: Fall 2017 But First, An Aside: This is Misleading

Truncated Sums, Matrix Iteration Giacomo Boffi

Variational quantum eigensolver of interacting bosons with NISQ devices Andy C. Y. Li This

Why Im NOT Why Im NOT Jewish/ Christian Atheist Agnostic Hindu Muslim Buddhist

Travel and transport MOTOR CYCLES Exceeding 50 cc MOTOR CARS Light Vehicle Cars including

Overview Introduction Object Tracking Vehicle Tracking Theory & Implementation