Measurement Research to the Web Calamity's Rescue Gregory BLANC - PowerPoint PPT Presentation

Measurement Research to the Web Calamity's Rescue Gregory BLANC Internet Engineering Laboratory Nara Institute of Science Technology WIDE member 3rd CAIDA-WIDE-CASFI Measurement Workshop April 24-25, 2010, Osaka Sunday, April 25, 2010

What measurement does? • CAIDA: malicious activity analysis, traffic classification, data sharing • CASFI: performance measurement, traffic analysis, data sharing • WIDE-mawi: DNS behavior analysis, traffic measurement, data sharing • overall, deploying probes at the network layer and measuring traffic characteristics 2 Sunday, April 25, 2010

What measurement does? (from the leaders) • Kenjiro CHO ~ “AJAX generates a lot of traffic” • Brad HUFFAKER ~ “HTTP is king” • Sue MOON ~ “The Web admin left” 3 Sunday, April 25, 2010

What measurement can do? • distinguishing application won’t help • we need to look deeper in the application layer • draw statistics of what is actually flowing • collect samples of what interests us 4 Sunday, April 25, 2010

Common Issues in Web Security Research • we often encounter issues when evaluating proposals (systems): • lack of datasets: nothing to play with • homogeneous datasets: too much of the same thing • outdated datasets: remember the KDD Cup 1999? • unbalanced datasets: might not be representing the reality 5 Sunday, April 25, 2010

Existing methods to collect JS samples (1): crawling • merits • JS may represent a small percentage • automated • solution: targeting blacklisted websites • can collect loads of data • user contribution • demerits • do not understand • Example: AJAX • can not mimic • crawler.archive.org accurately the user • target site should be wisely chosen 6 Sunday, April 25, 2010

Existing methods to collect JS samples (2): analysis website • merits • solution: to encourage sharing • only malicious JS • but it will be limited to what users would • often deobfuscated want to contribute • available online • demerits • Example • size depends on user • wepawet.cs.ucsb.edu contribution • jsunpack.jeek.org • dataset is not enough varied • data is not always available 7 Sunday, April 25, 2010

No solution in the wild (1) • we do not capture malicious JS because it is volatile in nature: • volatileness • obfuscation • transience • duplication • redirection • application layer • silent bidirectional communication 8 Sunday, April 25, 2010

No solution in the wild (2) • no efficient crawlers • no attractive sharing platforms • small user contribution • new ways to get samples in the wild: • network probes with deep packet inspection -> overhead • browser monitoring -> privacy • logs 9 Sunday, April 25, 2010

JS measurement • what to measure? is it measurable? • degree of obfuscation of benign Web 2.0 traffic: obfuscation does not indicate maliciousness • spread of JS malware: Samy was fast but noisy • JS malware code collection: overall lack of reliable datasets 10 Sunday, April 25, 2010

Web 2.0 • not only a buzzword • paradigm shift: • shift in the development • shift in the usage 11 Sunday, April 25, 2010

Development Shift • Rich Internet Applications (desktop) • Asynchronous Communication • Cross-domain Interaction • Web Services 12 Sunday, April 25, 2010

Usage Shift • Software Consumption • Collaboration/Participation • Content Sharing • Syndication/Aggregation • Social Networking 13 Sunday, April 25, 2010

Browser Model Shift • To cope with the Web 2.0 offer, the browser model has also changed: • plugins (Flash) • APIs (Ajax, custom, etc.) • interconnection (ActiveX, JavaVM) 14 Sunday, April 25, 2010

15 Sunday, April 25, 2010

User is the new victim This new browser model provides a better user experience but provides the attacker with a wider attack space • server side: too many websites with too many inputs to validate or control • client side: the user is left defenseless even against deemed benign popular sites Attackers prefer to concentrate on the most vulnerable, the end-user: phishing, drive-by attacks,etc. 16 Sunday, April 25, 2010

JS malware (1) • JS is a dynamic prototype-oriented event-drivent scripting language • a good tool to program automated elaborated script that can do massive harm • JS malwre: observed and defined by some security researchers (Brian Hoffman, Jeremiah Grossman, Martin Johns, etc.) 17 Sunday, April 25, 2010

JS malware (2) • propagates like conventional malware • wide category regrouping JS-based malicious code • PoC: XSS tunnel/proxy/botnet • in-the-wild examples: BeEF, BrowserRider, XSS-proxy, Samy worm, Yamanner 18 Sunday, April 25, 2010

Strengths of JS Malware • 1) stealth: property of going unnoticed by the user and the server • use of the XHR object • 2) polymorphism: ability of changing its form dynamically to evade signature • use of prototype hijacking • 3) obfuscation 19 Sunday, April 25, 2010

JavaScript Analysis • dynamic execution [Moshchuk’07] • static/dynamic tainting [Vogt’07] • control flow graph [Guha’09] • semantics [Hou’08] • machine-learning based [Choi’09, Hou’10, Likarish’09] 20 Sunday, April 25, 2010

JavaScript Deobfuscation • manual deobfuscation • semi-automated (Malzilla) • anti-analysis tricks: • recursive obfuscation • anti-crawling traps • argument.callee 21 Sunday, April 25, 2010

Conclusion • Our research area suffers a great lack of reliable and representative data • We have the methods and tools to carry out analysis but no data • Measurement research has made progress not only on collection but also on efficiency • It is time to cooperate! 22 Sunday, April 25, 2010

Overture • JavaScript is not the only matter of concern • VBScript, ActionScript (Flash) • new media of propagation (SNS) • distribution websites structure 23 Sunday, April 25, 2010

Questions / Discussion • Thank you for your attention • Let’s start a cooperation: gregory@is.naist.jp 24 Sunday, April 25, 2010

References • [Moshchuk’07]: SpyProxy: Execution-based Detection of Malicious Web Content, USENIX Security’07 • [Vogt’07]: Cross-Site Scripting Prevention with Dynamic Data Tainting and Static Analysis, NDSS’07 • [Hou’08]: Malicious Webpage Detection by Semantics-Aware Reasoning, ISDA’08 • [Choi’09]: Automatic Detection for JavaScript Attacks in Web Pages through String Pattern Analysis, FGIT’09 • [Guha’09]: Using Static Analysis for Ajax Intrusion Detection, WWW’09 • [Likarish’09]: Malicious Javascript Detection Using Classification Techniques, MALWARE’09 • [Hou’10]: Malicious Web Content Detection by Machine Learning, Expert Systems with Applications #37 25 Sunday, April 25, 2010

Measurement Research to the Web Calamity's Rescue Gregory BLANC - PowerPoint PPT Presentation

Measurement Research to the Web Calamity's Rescue Gregory BLANC Internet Engineering Laboratory Nara Institute of Science Technology WIDE member 3rd CAIDA-WIDE-CASFI Measurement Workshop April 24-25, 2010, Osaka Sunday, April 25, 2010 What

How Calamity Days Affect Summer Retirements 1 50 343c, 6/16/1 Calamity days During

ONTARIO MINE RESCUE ONTARIO MINE RESCUE PROGRAM PROGRAM 3 rd International Mines Rescue

St. Johns County Fire Rescue Master Plan February 2019 Fire Rescue Department Overview 911

CTIF Rescue and Fire Services CTIF Rescue and Fire Services on Airports on Airports CTIFs

(FWE-RRK 3:1/FWE-RRK 5:1) R-ALF RESCUE KIT 3:1 / 5:1 Australia The Ferno R-ALF Rescue Kit is a

Web Services Web Services Towards Web Services Towards Web Services Towards Web Services A

The Development of HKFSDs High Angle Rescue Team Experience and Insight Sharing Swiss Army

POLAND Polish mine rescue system Tasks, Structures of Polish mine rescue system The Act

Wallace Weir Fish Rescue Facility Issue Wallace Weir Fish Rescue Facility Issue Wallace Weir

How competition affects evolutionary rescue: theoretical insight Matthew Osmond Claire de

Some Thoughts on the Financial Crisis THE LONG AND THE SHORT (TERM) OF IT The calamity in the

Climate calamity Psycho-spiritual implications Sensei Kritee (Kanko), Ph.D. Interface, Feb 2016

Strategies to Combat Arsenic Calamity in West Bengal, India as Madhum umita Roy , Sut Sutapa Muk

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Lecture 1: Semantic Web and RDF Aidan Hogan aidhog@gmail.com THE WEB The Web is now 26 years

Presentation to Ontario Smart Grid Working Group Who is Measurement Canada? Measurement: A part

DAQ application using open source tools for Plasma heating experiment by Rameshkumar Joshi

Web Design Guidelines SWEN-444 Design Principles and Guidelines User Populations (Shared human

PHP and Rich Internet Applications Mike Potter http://www.riapedia.com/ mike.potter@adobe.com

Lecture 10: Floating Point, Digital Design Todays topics: FP arithmetic Intro to

What's Happening in the What's Happening in the Apache Flex Project Apache Flex Project Flex

TryF# building a system for multi-platform access to a managed language Nigel Horspool

RIA Contact Josh Holmes James Ward Microsoft Evangelist Adobe Evangelist

Wt, The Witty Web Toolkit FOSDEM Lightning talk Koen Deforche Wim Dumon Pieter Libin

Sambuz

Useful Links

Newsletter

Mail Us