CATCHSYNC: CATCHING SYNCHRONIZED BEHAVIOR IN LARGE DIRECTED GRAPHS - PowerPoint PPT Presentation

CATCHSYNC: CATCHING SYNCHRONIZED BEHAVIOR IN LARGE DIRECTED GRAPHS Meng Jiang, Tsinghua University, Beijing, China Joint work with Peng Cui, Alex Beutel, Christos Faloutsos and Shiqiang Yang August 26, 2014 – NYC, USA

2 Fraud Detection: Graph Analysis Problem [www.buyfollowz.org] [buymorelikes.com]

3 Fraud Detection: Graph Analysis Problem [buycheaplikes.com] [reviewsteria.com]

4 Our Goals • Given: A graph (large-scale, directed, etc.) • Find: Frauds = Anomalous edges • Goals: • G1. Find patterns that distinguish fraudsters from normal users • G2. Design algorithms that catch fraudsters

5 OUTLINE 1. Background 2. Fraudulent Pattern 3. The Algorithm 4. Experiments

6 Anomalies in Degree Distributions • Power-law distribution DBLP Flickr Twitter Author -publication User -user Who -follows-whom [konect.uni-koblenz.de/networks/]

7 Anomalies in Degree Distributions 2009 3.17M 0.41M 41M d=20

8 Linear Classifier with “Degree”: Fail =20? +1 3.17M (Fraud) 0.41M Label Out-degree (+1,-1) d=20 classifier ×

9 Graph Structure Distorted 2011 1.91M 117M 0.44M d=64

10 Traditional Fraud Detection Big? Small? Big? Big? Big? +1 (Fraud) Label Out-degree In-degree #tweet #url in #hashtag (+1,-1) tweets in tweets Content-based features classifier

11 Empty Profile?

12 Few Followers?

13 Many Followings?

14 Content: Unavailable? Look Normal? 0, 0, 0… sorry Label Out-degree In-degree #tweet #url in #hashtag (+1,-1) tweets in tweets Content-based features classifier

15 Behavior is the Key Monetary Incentive Content Behavior/ Links what they what they appear to have to behave behave

17 Behavior-based Features Follower Followee behavior behavior ≈ ≈ Out-degree In-degree 1 st left singular vector 1 st right singular vector (Hubness) (Authoritativeness) 2 nd left singular vector 2 nd right singular vector … …

18 Behavior-based Feature Space Follower Followee behavior behavior

19 Fraudulent Behavior Patterns

24 Fraudulent Behavior Patterns • Synchronized • Abnormal

26 Synchronicity and Normality • Synchronicity

27 Synchronicity and Normality • Normality

28 Synchronicity-Normality Plot

29 Theorem • For any distribution, there is a parabolic lower limit in the synchronicity-normality plot. synchronicity normality • Proof. See our paper 

30 CatchSync Algorithm • Distance-based anomaly detection • Fraudsters • Big synchronicity • Small normality • Away from the densest

31 OUTLINE 1. Background 2. Fraudulent Pattern Mining 3. The Algorithm 4. Experiments

32 Experiments • Q1: Does CatchSync remove anomalies? • Degree distribution • Feature space • Q2: Is CatchSync catching actually fraudulent users? • Q3: Is CatchSync robust?

33 Q1: Does CatchSync Remove Anomalies? 2009 3.17M 41M 0.41M d=20

34 Q1: Does CatchSync Remove Anomalies? 2011 117M

35 Before CatchSync Follower Followee behavior behavior

36 After CatchSync Follower Followee behavior behavior

37 Q2: Is CatchSync Catching Actually Fraudulent Users? 173/1,000 237/1,000

38 Q2: Is CatchSync Catching Actually Fraudulent Users? CatchSync 0.813 +SPOT CatchSync 0.751 0.597 SPOT OutRank 0.412 0 0.2 0.4 0.6 0.8 1

39 Q2: Is CatchSync Catching Actually Fraudulent Users? CatchSync 0.785 +SPOT CatchSync 0.694 0.653 SPOT OutRank 0.377 0 0.2 0.4 0.6 0.8 1

40 Q2: Is CatchSync Catching Actually Fraudulent Users? Recall = 80% Precision in Twitter Precision in Tencent Weibo 83.5% 79.4%

41 Q3: Is CatchSync Robust to Camouflage? Target Popular camouflage Random camouflage

42 Q3: Is CatchSync Robust to Camouflage?

43 Q3: Is CatchSync Robust to Camouflage?

44 Q3: Is CatchSync Robust to Camouflage? Popular Random camouflage camouflage

45 Conclusion • Goals • G1. Find patterns that distinguish fraudulent user behavior from normal behavior • A1: Synchronized & Abnormal! • G2. Design algorithms that catch fraudsters • A2: CatchSync! • Remove spikes • Content free • Robust to camouflage

46 Questions? Meng Jiang mjiang89@gmail.com http://www.meng-jiang.com

CATCHSYNC: CATCHING SYNCHRONIZED BEHAVIOR IN LARGE DIRECTED GRAPHS - PowerPoint PPT Presentation

CATCHSYNC: CATCHING SYNCHRONIZED BEHAVIOR IN LARGE DIRECTED GRAPHS Meng Jiang, Tsinghua University, Beijing, China Joint work with Peng Cui, Alex Beutel, Christos Faloutsos and Shiqiang Yang August 26, 2014 NYC, USA 2 Fraud Detection:

Defeating IMSI catchers CCS 2015 10-13-2015 Denver Fabian van den Broek, Roel Verdult and Joeri

Catching the Future Before it f Catches You A 2010 .edu survey Catching the Future Before

Explicit Locks Alma Orucevic-Alagic 2013-11-28 Synchronized Java incorporates a

SHOW ME THE MONEY! THE NEW ERA OF LIVE OTT. Sye has started the synchronized live OTT revolution.

TSMP Time Synchronized Mesh Protocol Seminar in Distributed Computing, FS 2010, ETH Zrich

Loosely Time-Synchronized Snapshots in Object-Based File Systems Jan Stender, Mikael Hgqvist,

BEHAVIOR @ HOME Behavior Basics Simple strategies that can make a big difference! Presented by

Catching Events in Video Streams Mohan M. Trivedi Computer Vision and Robotics Research

Datatracker Testing Making the users happy by catching bugs Contents How test coverage

Communication trade-offs for synchronized distributed SGD with large step size Aymeric DIEULEVEUT

APPLIED BEHAVIOR ANALYSIS Specialization Overview Agenda What is Applied Behavior Analysis

Structure of Talk Workload-sensitive Timing Behavior Anomaly Detection 1 Motivation in Large

GLIMPSES: GLIMPSES: Memory and program behavior GLIMPSES: GLIMPSES: Memory and program behavior

GLAST Large Area Telescope: GLAST Large Area Telescope: Gamma- -ray Large ray Large Gamma

on Synchronized Rhythms in Neuronal Networks with Inhibitory and Excitatory Populations S.-Y. Kim

Emergence of Sparsely Synchronized Rhythms and Their Responses to External Stimuli in An

TimingCamouflage: Improving Circuit Security against Counterfeiting by Unconventional Timing Grace

Budget Preparation and Development Objectives Participants will: Demonstrate

Quarterly Financial Update Through 6/30/2020 Presented to Pullman City Council July 28, 2020

Fisc Fiscal al Ye Year 2020 2020 21 21 and and Fisc Fiscal al Ye Year 2021 2021 22 22

COPE : : S PO POTTING M ONE ONEY L AU AUNDERING B AS ON G RAP ASED ED ON RAPHS Xiangfeng Li 1 ,

Graph Cut Frdo Durand MIT - EECS Thursday, October 29, 2009 Last Tuesday: optimization

Graph-based Segmentation Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem

CMPSC443 - Introduction to Computer and Network Security Module: Provenance Professor Patrick

Sambuz

Useful Links

Newsletter

Mail Us