A Critical Evaluation of Website Fingerprinting Attacks Marc Juarez - PowerPoint PPT Presentation

A Critical Evaluation of Website Fingerprinting Attacks Marc Juarez 1 Sadia Afroz 2 Gunes Acar 1 Claudia Diaz 1 Rachel Greenstadt 3 1 KU Leuven, ESAT/COSIC and iMinds, Leuven, Belgium 2 UC Berkeley, US 3 Drexel University, US CCS 2014, Scottsdale, AZ, USA, November 4, 2014

Introduction: how does WF work? Tor Web User Adversary User = Alice Webpage = ?? 2

Why is WF so important? ● Tor as the most advanced anonymity network ● Allows an adversary to discover the browsing history ● Series of successful attacks ● Low cost to the adversary Number of top conference publications on WF (25) 3

Introduction: unrealistic assumptions Tor Client settings : e.g., browsing behaviour Web User Adversary 4

Introduction: unrealistic assumptions Tor Web Adversary : User e.g., replicability Adversary 4

Introduction: unrealistic assumptions Tor Web : e.g., staleness Web User Adversary 4

Contributions ● A critical analysis of the assumptions ● Evaluation of variables that affect accuracy ● An approach to reduce false positives ● A model of the adversary’s cost 5

Methodology ● Based on Wang and Goldberg’s ○ Batches and k-fold cross-validation ○ Fast-levenshtein attack (SVM) ● Comparative experiments ○ Key: isolate variable under evaluation (e.g., TBB version) 6

Comparative experiments: example ● Step 1: ● Step 2: 7

Comparative experiments: example ● Step 1: Train: on data with default value Acc. Control Test: on data with default value ● Step 2: 7

Comparative experiments: example ● Step 1: Train: on data with default value Test: on data with value of interest Acc. Test ● Step 2: 7

Datasets ● Alexa Top Sites ● Active Linguistic Authentication Dataset (ALAD) ○ Real-world users (80 users, 40K unique URLs) ○ Training on Alexa and testing on ALAD? 8

Datasets ● Alexa Top Sites ● Active Linguistic Authentication Dataset (ALAD) ○ Real-world users (80 users, 40K unique URLs) ○ Training on Alexa and testing on ALAD? 45% not in Alexa top 100 Prohibitive number of FPs 8

Experiments: multitab browsing ● FF users use average 2 or 3 tabs 9

Experiments: multitab browsing ● FF users use average 2 or 3 tabs ● Experiment with 2 tabs: 0.5s, 3s, 5s 9

Experiments: multitab browsing Foreground Foreground Background Background ● FF users use average 2 or 3 tabs ● Experiment with 2 tabs: 0.5s, 3s, 5s ● Background page picked at random 9

Experiments: multitab browsing ● FF users use average 2 or 3 tabs ● Experiment with 2 tabs: 0.5s, 3s, 5s ● Background page picked at random ● Success: detection of either page 9

Experiments: multitab browsing Accuracy for different time gaps Tab 1 Tab 2 77.08% BW Time 9.8% 7.9% 8.23% Control Test (3s) Test (0.5s) Test (5s) 10

Experiments: TBB versions ● Coexisting Tor Browser Bundle (TBB) versions ● Versions: 2.4.7, 3.5 and 3.5.2.1 (changes in RP, etc.) 11

Experiments: TBB versions ● Coexisting Tor Browser Bundle (TBB) versions Latest version of RP ● Versions: 2.4.7, 3.5 and 3.5.2.1 (changes in RP, etc.) 79.58% 66.75% 6.51% Control Test Test (3.5.2.1) (3.5) (2.4.7) 11

Experiments: network conditions VM Leuven VM New York VM Singapore KU Leuven DigitalOcean (virtual private servers) 12

Experiments: network conditions VM Leuven VM New York VM Singapore 66.95% 8.83% Control (LVN) Test (NY) 12

Experiments: network conditions VM Leuven VM New York VM Singapore 66.95% 9.33% Control (LVN) Test (SI) 12

Experiments: network conditions VM Leuven VM New York VM Singapore 76.40% 68.53% Control (SI) Test (NY) 12

Experiments: entry guard config. ● What entry config. works better for training? ● 3 configs.: ○ Fix 1 entry guard ○ Pick entry from a list of 3 entries guards (default) ○ Pick entry from all possible entries guards (Wang and Goldberg) 13

Experiments: entry guard config. Accuracy for different entry guard configurations 70.38% 64.40% 62.70% any 3 entry 1 entry guards guard 14

Experiments: data staleness Staleness of our collected data over 90 days Less than 50% after 9d. Accuracy (%) Time (days) 15

Summary 16

The base rate fallacy: example ● Breathalyzer test: ○ 0.88 identifies truly drunk drivers (true positives) ○ 0.05 false positives ● Alice gives positive in the test ○ What is the probability that she is indeed drunk? ( BDR ) ○ Is it 0.95? Is it 0.88? Something in between? 17

The base rate fallacy: example ● Breathalyzer test: ○ 0.88 identifies truly drunk drivers (true positives) ○ 0.05 false positives ● Alice gives positive in the test ○ What is the probability that she is indeed drunk? ( BDR ) Only 0.1! ○ Is it 0.95? Is it 0.88? Something in between? 17

The base rate fallacy: example ● Circumference represents the world of drivers. ● Each dot represents a driver. 18

The base rate fallacy: example ● 1% of drivers are driving drunk ( base rate or prior ). 19

The base rate fallacy: example ● From drunk people 88% are identified as drunk by the test 20

The base rate fallacy: example ● From the not drunk people, 5% are erroneously identified as drunk 21

The base rate fallacy: example ● Alice must be within the black circumference ● Ratio of red dots within the black circumference: BDR = 7/70 = 0.1 ! 22

The base rate fallacy in WF ● Base rate must be taken into account ● In WF: ○ Blue: webpages ○ Red: monitored ○ Base rate? 23

The base rate fallacy in WF ● Probability of visiting a monitored page? ● “false positives matter a lot” 1 ● Experiment: 35K world 1 Mike Perry, “A Critique of Website Traffic Fingerprinting Attacks”, Tor project Blog, 2013. https://blog. torproject.org/blog/critique-website-traffic-fingerprinting-attacks. 24

Experiment: BDR in a 35K world ● Uniform world ● Non-popular pages from ALAD Size of the world 25

Classify, but verify ● Verification step to test classifier confidence ● Number of FPs reduced 397-42 (400) ● But BDR is still very low for non popular pages 26

Cost for the adversary ● Adversary’s cost will depend on: ○ Number of pages 27

Versions of a page: St Valentine’s doodle Total trace size 10 5 0 700 800 900 1000 1100 (KBytes) 13 Feb 2013 14 Feb 2013 28

Cost for the adversary ● Adversary’s cost will depend on: ○ Number of pages ○ Number of targets 29

Non-targeted attacks Tor Users Web . . . ISP router 30

Cost for the adversary ● Adversary’s cost will depend on: ○ Number of pages ○ Number of targets ○ Training and testing complexities 31

Cost for the adversary ● Adversary’s cost will depend on: ○ Number of pages ○ Number of targets ○ Training and testing complexities ● To maintain a successful WF system is costly 32

Limitations ● We took samples and may not be representative of all possible practical scenarios ● Variables difficult to control ○ Time gap ○ Tor circuit 33

Conclusions ● WF attack fails in realistic conditions ● We do not completely dismiss the attack ● Attack can be enhanced at a greater cost ● Defenses might be cheaper in practice 34

Thank you for your attention. Questions?

A Critical Evaluation of Website Fingerprinting Attacks Marc Juarez - PowerPoint PPT Presentation

A Critical Evaluation of Website Fingerprinting Attacks Marc Juarez 1 Sadia Afroz 2 Gunes Acar 1 Claudia Diaz 1 Rachel Greenstadt 3 1 KU Leuven, ESAT/COSIC and iMinds, Leuven, Belgium 2 UC Berkeley, US 3 Drexel University, US CCS 2014, Scottsdale,

k -fingerprinting: a Robust Scalable Website Fingerprinting Technique George Danezis Jamie Hayes

Acoustic Fingerprinting Soundz Jake Runzer June 28, 2018 Jake Runzer Acoustic Fingerprinting

Website fingerprinting attacks against Tor Browser Bundle: a comparison between HTTP/1.1 and

Website Fingerprinting Attacks and Defenses in the Tor Onion Space Marc Juarez imec-COSIC KU

Feature Selection in Website Fingerprinting Junhua Yan Advisor: Prof. Jasleen Kaur July 24,

Bayes, not Nave Security Bounds on Website Fingerprinting Defenses Giovanni Cherubin Privacy

Fingerprinting hardware devices Fingerprinting hardware devices using clock-skewing using

CO 447 | LEC6 BLOCKCHAIN SECURITY Dr. Benjamin Livshits Stateless Fingerprinting 2 EFF

Website fingerprinting on Tor: attacks and defenses Claudia Diaz KU Leuven Joint work with: Marc

Chapter 12. Evaluation Research Chapter 12. Evaluation Research evaluation research? evaluation

User Interface Evaluation Empirical evaluation Heuristic evaluation 1 CS 349 - UI evaluation

Visit our website www.parasdewsgurgaon.in Visit our website www.parasdewsgurgaon.in Visit our

Website Website www.nhec.coop www.nhec.coop Website Website www.nhec.coop www.nhec.coop

Articulus Detecting IP Hijacking Through Server Fingerprinting Research Question How can we

Fingerprinting of Defendants October 11, 2018 VIRGINIA STATE CRIME COMMISSION N I A S I G

Fingerprinting ECUs for Vehicle Intrusion Detection Kyong-Tak Cho, Kang G. Shin, University of

Mi#ga#ng Browser Fingerprint Tracking: Mul#-level Reconfigura#on

Homomorphic Sketches Shrinking Big Data without Sacrificing Structure Andrew McGregor University

VISABIO : French Biometric Visa System 1 CONTENTS Lessons learnt from the BIODEV 1 Pilot

Morellian Analysis for Browsers: Making Web Authentication Stronger With Canvas Fingerprinting

Fingerprints in Compressed Strings (In Proc. WADS 2013) Philip Bille 1 , Patrick Hagge Cording 1 ,

CS 528 Mobile and Ubiquitous Computing Lecture 10b: Mobile Security and Mobile Measurements

Biometrics Outline Biometrics What is a Biometric Signature? What is an Authentication

AniFilter: Parallel and Failure-Atomic Cuckoo Filter for Non-Volatile Memories Hyungjun Oh 1 ,