BotGraph: Large Scale Spamming Botnet Detection Web-account abuse - PowerPoint PPT Presentation

BotGraph: Large Scale Spamming Botnet Detection

Web-account abuse attack recent spamming technic New different approche for sending spam Basing on reputation of email providers Difficult to detect signup detection monitoring users' activity Very difficult to distinguish real user from bot

Solution? tricky, with two challenges 1. designing an algorithm 2. implementing working solution milions of users houndreds of gigabytes activity logs

Solution! bots != user real user bot user Rare and small Tightly connected corelations Spammers never fully Variable and small sent control infected emails per day rate computers Email size varies Higher and steady sent emails rate Emails templates

Problems but... real user bot user mobile users, proxies stealthy and dynamic ips possible counter average is not every technics false positive bot classification unwanted

BotGraph architecture

User login graph simple bot-users login behaviour user login graph vertices - email accounts edges - login from same ip address (ip-day) sharing ip address single bot handles ~50 bot-users single bot-user assigned to many bots over time autonomous systems metric vs dynamic ips and proxies

Giant connected component random graph theorem average degree d = n*p d < 1 => size = O(log n) d > 1 => size = O(n) bot-users forms giant connected component normal users' connected components are small (less then 100 nodes) components varies with sizes bot-users nets may intersect hierarchical extraction (increasing edges weight connection threshold)

legitimate users pruning based on the number of sent emails per day less then 10% users, sent more then 3 emails/day BotGraph consider only nodes, where at least 80% of users sent more then 3 emails/day validation based on emails size, account naming pattern much more effective with users' groups analising

Graph construction & analysis Huge size over 500 milions of login data in one month (220GB) userid, ip address, login timestamp number of edges - hundreds of billions 240 machine cluster 1.5 hours Dryad/DryadLINQ Finding connected component simple divide and conquer 7 minutes on cluster vs 4 hours on single computer

Two methods i.e. "first didn't work" method 1 method 2 partitioning by login ip partition by user ID address direct compare users in one map phase: outputs an partition edge for every two users generating local summaries of sharing an ip from AS used IP-day keys in partition reduce phase: weight and broadcasting them aggregation of edges upon reciving summary, sending related records merging recieved answers for broadcasted summaries

comparison i.e. "why it didn't work" method 1 method 2 sending edges of weight directly computing edge of one. They can not be weight w or more ignored

performance i.e. "how bad it didn't work" method 1 method 2 12.0 TB communication 1.7 TB interrupted 6+ hours 95 min 2.71 TB, 135 min (subset) 460 GB, 28 min 1.02 TB, 116 min 181 GB, 22 min (compression)

Results found 40 bot groups in January 2008 botnet size from few houndrdes up to few milions total of 20.58M of bot-users 16.41M EWMA - 91.83% new findings 8.68M graph-based - 54.10% new findings total of 1.84M of bot-IPs 240 784 EWMA 1.60M graph-based false positive rate estimated: 0.44%

Questions?

BotGraph: Large Scale Spamming Botnet Detection Web-account abuse - PowerPoint PPT Presentation

BotGraph: Large Scale Spamming Botnet Detection Web-account abuse attack recent spamming technic New different approche for sending spam Basing on reputation of email providers Difficult to detect signup detection monitoring users' activity

BotGraph: Large Scale Spamming Botnet Detec5on Yao Zhao Yinglian Xie * , Fang Yu * , Qifa Ke * ,

& 1st large scale oauth stealing botnet & Secure delegation mechanism De-facto

Peer-to-Peer Botnet Detection Using NetFlow Connor Dillon System and Network Engineering

Botnet Detection and Response The Network is the Infection David Dagon dagon@cc.gatech.edu

Anomaly-based Bot Server (and more!) Detection Jim Binkley jrb@cs.pdx.edu Portland State

S MV -H UNTER : Large Scale, Automated Detection of SSL/TLS Man-in-the-Middle Vulnerabilities in

Challenges in Experimenting with Botnet Detection Systems Adam J. Aviv Andreas Haeberlen

MetaNet A botnet with Metasploit integration By : Matan Ramrazker, Guy Gelber What is a Botnet

Evaluation of Amplification attacks in large-scale networks to improve detection performance

25 Million Flows Later Large-scale Detection of DOM-based XSS CCS 2013, Berlin Sebastian

Anomaly Detection and Troubleshooting of Large Scale Systems from Event Logs Presented By Niloy

Data Analysis, Estimation, and Fault detection of Large-Scale Autonomous System of Vehicles Using

Large-scale Evaluation of Distributed Attack Detection Thomas Gamer, Christoph P. Mayer Institut

Botnet Detection through Analyzing Network Traffic using Statistical Signal Processing Methods

Botnet Detection with DNS Monitoring Seminar Future Internet 2014 Christopher Will Advisor:

Large-scale performance monitoring framework for cloud monitoring Run-Time Latency Detection in

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Link Spam Detection Based on Mass Estimation Zoltn Gyngyi , Pavel Berkhin, Hector

Enhancing Memory Error Detection for Large-Scale Applications and Fuzz testing Wookhyun Han ,

1 Domain Flux-based DGA Botnet Detection Using Feedforward Neural Network Md. Ishtiaq Ashiq

Large-Scale API Protocol Mining for Automated Bug Detection Michael Pradel Department of

Transfer Learning Approach for Botnet Detection based on Recurrent Variational Autoencoder

Deviations in Load Testing of Large Scale Systems Haroon Malik Software Analysis and

A Solution for Densely Annotated Large Scale Object Detection Task Yuan Gao, Hui Shen, Donghong

BotGraph: Large Scale Spamming Botnet Detection Web-account abuse - PowerPoint PPT Presentation

BotGraph: Large Scale Spamming Botnet Detection Web-account abuse attack recent spamming technic New different approche for sending spam Basing on reputation of email providers Difficult to detect signup detection monitoring users' activity

BotGraph: Large Scale Spamming Botnet Detec5on Yao Zhao Yinglian Xie * , Fang Yu * , Qifa Ke * ,

&amp; 1st large scale oauth stealing botnet &amp; Secure delegation mechanism De-facto

Peer-to-Peer Botnet Detection Using NetFlow Connor Dillon System and Network Engineering

Botnet Detection and Response The Network is the Infection David Dagon dagon@cc.gatech.edu

Anomaly-based Bot Server (and more!) Detection Jim Binkley jrb@cs.pdx.edu Portland State

S MV -H UNTER : Large Scale, Automated Detection of SSL/TLS Man-in-the-Middle Vulnerabilities in

Challenges in Experimenting with Botnet Detection Systems Adam J. Aviv Andreas Haeberlen

MetaNet A botnet with Metasploit integration By : Matan Ramrazker, Guy Gelber What is a Botnet

Evaluation of Amplification attacks in large-scale networks to improve detection performance

25 Million Flows Later Large-scale Detection of DOM-based XSS CCS 2013, Berlin Sebastian

Anomaly Detection and Troubleshooting of Large Scale Systems from Event Logs Presented By Niloy

Data Analysis, Estimation, and Fault detection of Large-Scale Autonomous System of Vehicles Using

Large-scale Evaluation of Distributed Attack Detection Thomas Gamer, Christoph P. Mayer Institut

Botnet Detection through Analyzing Network Traffic using Statistical Signal Processing Methods

Botnet Detection with DNS Monitoring Seminar Future Internet 2014 Christopher Will Advisor:

Large-scale performance monitoring framework for cloud monitoring Run-Time Latency Detection in

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Link Spam Detection Based on Mass Estimation Zoltn Gyngyi , Pavel Berkhin, Hector

Enhancing Memory Error Detection for Large-Scale Applications and Fuzz testing Wookhyun Han ,

1 Domain Flux-based DGA Botnet Detection Using Feedforward Neural Network Md. Ishtiaq Ashiq

Large-Scale API Protocol Mining for Automated Bug Detection Michael Pradel Department of

Transfer Learning Approach for Botnet Detection based on Recurrent Variational Autoencoder

Deviations in Load Testing of Large Scale Systems Haroon Malik Software Analysis and

A Solution for Densely Annotated Large Scale Object Detection Task Yuan Gao, Hui Shen, Donghong

& 1st large scale oauth stealing botnet & Secure delegation mechanism De-facto