CSE 255 Lecture 3 Data Mining and Predictive Analytics Detecting - PowerPoint PPT Presentation

Oct 03, 2023 •233 likes •389 views

CSE 255 Lecture 3 Data Mining and Predictive Analytics Detecting Social Circles Social circles Communities in ego-networks What are the interest groups or communities among my friends? NIPS 2012, TKDD 2014 (w/ Leskovec) Data Why are

CSE 255 – Lecture 3 Data Mining and Predictive Analytics Detecting Social Circles
Social circles
Communities in ego-networks “What are the interest groups or communities among my friends?” NIPS 2012, TKDD 2014 (w/ Leskovec)
Data Why are we friends (facebook)? 200,000 user profiles, in 5,000 hand-labeled communities (we also collect similar data from Google+ and twitter) Facebook app: http://snap.stanford.edu/socialcircles/
Statistics of social circles Disjoint communities Hierarchical communities (from Adamic & Glance, 2005) (from Clauset et al., 2005)
Existing approach Proposal: Edges are more likely between nodes that have many communities in common Task: Identify communities that maximize the likelihood of the graph
Existing approach 1. Edges belong inside communities 2. Non-edges belong outside communities Circles are highly connected people who also have common attributes Q: Does this user belong in this circle? A: Yes, because they attended the same high-school
Constructing features from profiles = [0,0,0,1,1,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0]
A better model Proposal: Learn a similarity metric for each circle: which attributes do x and y have in common? which attributes are relevant to circle k ? Task: Reward edges for belonging to a circle only if they have the relevant attributes in common
Model fitting Repeat steps (1) and (2) until convergence: Step 1: Find circles from circle parameters (solved via pseudo-boolean optimization) Step 2: Find circle (solved via gradient ascent using L-BFGS) parameters (solved using gradient ascent) from circles
Outcomes – applications (Goal 1) Circle prediction: 43% more accurate than alternatives on facebook (26% on Google+, 16% on twitter) blue/grey = true positive/negative red/yellow = false positive/negative
Outcomes – understanding (Goal 2) Circle recommendation: We also generate explanations as to why we recommended each circle to the user
Follow-up: scalability Q: How can we handle attributes in million-node networks? A: Via a continuous relaxation with convex subproblems We apply our model to large networks of Google+ users, flickr users, and Wikipedia articles Two “communities” of wikipedia pages on similar topics ICDM 2013 (w/ Yang & Leskovec)
Follow-up: directed networks Directed networks have different semantics than undirected networks and should be modeled differently: • twitter and Google+ communities are people with common followers • Applied to networks from other domains, e.g. PPI and predator-prey networks photo courtesy of Hector Garcia Molina WSDM 2014 (w/ Yang & Leskovec)
Conclusion • Existing models tend to focus on graph topology (community detection) or on node features (clustering), but not how the two interact in concert • To detect social circles we need to use both – to find communities that are densely linked around particular attributes that are important to each user • Joint work with Jure Leskovec

Recommend

IIT Mumbai First and Last Leg Optimization 127 203 179 212 255 255 175 215 149 195

Title and Content 109 255 131 0 85 214 207 255 56 99 165 73 246 255 155 190 28 42 Dark 1 Light 1 Dark 2 Light 2 Accent 1 Accent 2 185 151 193 255 255 236 175 75 187 221 255 137 164 7 0 62 255 29 Accent 3

288 views • 14 slides

Testing 221 238 197 223 171 213 Manoj Nambiar, Tata Blue 50% Tata Blue 25% Purple 50 %

329 views • 30 slides

CSE 255 Data Mining and Predictive Analytics Introduction What is CSE 255? In this course we

CSE 255 Data Mining and Predictive Analytics Introduction What is CSE 255? In this course we will build models that help us to understand data in order to gain insights and make predictions Examples Recommender Systems Prediction: what

648 views • 60 slides

HSI and RGB Transformation and Applications with Tim Welch (R)ed (G)reen (B)lue Model Color

HSI and RGB Transformation and Applications with Tim Welch (R)ed (G)reen (B)lue Model Color cube representation Range 0-255 Black (0,0,0) White (255,255,255) 1 (H)ue (S)aturation (I)ntensity Model Can represent with a

277 views • 5 slides

ACCIDENT REPORTING 237 217 200 80 252 237 217 200 119 174 237 217 200 27 .59 255 0

1 ACCIDENT REPORTING 237 217 200 80 252 237 217 200 119 174 237 217 200 27 .59 255 0 163 131 239 110 112 62 102 130 255 0 163 132 65 135 92 102 56 120 255 0 163 122 53 120 56 130 48 111 M2S2 Seminar

612 views • 34 slides

Parts of a Circle MP2: Reason abstractly & quantitatively. MP3: Construct viable arguments

Slide 1 / 255 Slide 2 / 255 Geometry Circles 2015-10-23 www.njctl.org Slide 3 / 255 Slide 4 / 255 Table of Contents Throughout this unit, the Standards for Mathematical Practice Click on a topic to go are used. to that section MP1:

1.09k views • 69 slides

MAKING THE DECISION 237 217 200 80 252 237 217 200 119 174 237 217 200 27 .59 255

1 MAKING THE DECISION 237 217 200 80 252 237 217 200 119 174 237 217 200 27 .59 255 0 163 131 239 110 112 62 102 130 255 0 163 132 65 135 92 102 56 120 255 0 163 122 53 120 56 130 48 111 Los Angeles

338 views • 18 slides

Parts of a Circle Euclid defined figures in this way: Definition 13: A boundary is that which is

714 views • 43 slides

Color Blending Sander Tiganik Colors (R,G,B,A?) 3 or 4 channels A channel contains

Color Blending Sander Tiganik Colors (R,G,B,A?) 3 or 4 channels A channel contains information about that color. Values are usually kept in range from 0 to 255 RGB(0,0,0) = black RGB(255,255,255) = white Value = the amount

692 views • 30 slides

CSE 3401 Functional and Logic Programming York University CSE 3401 Vida Movahedi 1 York University

CSE 3401 Functional and Logic Programming York University CSE 3401 Vida Movahedi 1 York University CSE 3401 V. Movahedi CSE 3401 CSE 3401 SC/CSE 3401 3.00 Functional and Logic Programming SC/CSE 3401 3.00 Functional and Logic

381 views • 6 slides

CSE 255 Lecture 6 Data Mining and Predictive Analytics Combining models of ratings and

CSE 255 Lecture 6 Data Mining and Predictive Analytics Combining models of ratings and reviews Ratings Latent Factor Models Two models weve seen so far: 1: Latent Factor Models (Lecture 5) learn my preferences, and the products

164 views • 13 slides

CSE 182-L2:Blast & variants I Dynamic Programming www.cse cse. .ucsd ucsd. .edu

CSE 182-L2:Blast & variants I Dynamic Programming www.cse cse. .ucsd ucsd. .edu edu/classes/fa05/cse182 /classes/fa05/cse182 www. www.cse cse. .ucsd ucsd. .edu edu/~ /~vbafna vbafna www. FA05 CSE182 Searching Sequence

526 views • 40 slides

CSE 312 Final Review: Section AA CSE 312 TAs December 8, 2011 CSE 312 Final Review: Section AA

CSE 312 Final Review: Section AA CSE 312 TAs December 8, 2011 CSE 312 Final Review: Section AA General Information CSE 312 Final Review: Section AA General Information Comprehensive Midterm CSE 312 Final Review: Section AA General

1.21k views • 86 slides

Welcome to CSE 506 Introduc/on & Review Don Porter 1 2 CSE 506: Opera.ng Systems CSE 506:

3/14/16 CSE 506: Opera.ng Systems CSE 506: Opera.ng Systems Why Grad OS? Primary Goal: Demys/fy how computers work Welcome to CSE 506 Introduc/on & Review Don Porter 1 2 CSE 506: Opera.ng Systems CSE 506: Opera.ng Systems An

355 views • 6 slides

4 5 6 CSE 142 vs CSE 143 CSE 142 / AP CS A CSE 143 You learned how to write Return of

4 5 6 CSE 142 vs CSE 143 CSE 142 / AP CS A CSE 143 You learned how to write Return of the objects programs and You learned to solve decompose large more complex tasks problems with: efficiently Print statements Data

705 views • 22 slides

CS 457 Lecture 12 Routing Fall 2011 IP Address and 24-bit Subnet Mask Address 12 34

CS 457 Lecture 12 Routing Fall 2011 IP Address and 24-bit Subnet Mask Address 12 34 158 5 00001100 00100010 10011110 00000101 11111111 11111111 11111111 00000000 255 255 255 0 Mask Scalability Improved Number related

333 views • 20 slides

CS133 Computational Geometry Voronoi Diagram Delaunay Triangulation 1 Nearest Neighbor Problem

CS133 Computational Geometry Voronoi Diagram Delaunay Triangulation 1 Nearest Neighbor Problem Given a set of points and a query point , find the closest point to , , ,

1.65k views • 145 slides

CS488 Polygon Clipping and Circles Luc R ENAMBOT 1 Previous Lectures Frame buffers

CS488 Polygon Clipping and Circles Luc R ENAMBOT 1 Previous Lectures Frame buffers Drawing a line (Midpoint Line Algorithm) Polygon Filling (Edge-table algorithm) Line Clipping (Cohen-Sutherland algorithm) Polygon Clipping

527 views • 31 slides

Todays Presenters Grif Peterson Learning Lead, Peer 2 Peer University ( P2PU) grif@p2pu.org

Todays Presenters Grif Peterson Learning Lead, Peer 2 Peer University ( P2PU) grif@p2pu.org Kate Lapinski Learning & Economic Advancement Librarian, Chicago Public Library Introducing Learning Circles: Online Learning, Offline

561 views • 32 slides

Workplace Circles, Communication and Social Skills in the Workplace Ashley Meyer, M.Ed. Western

1/17/17 Workplace Circles, Communication and Social Skills in the Workplace Ashley Meyer, M.Ed. Western Region ASD Network Coordinator Wh ha at t W Wi il ll l W We e A Ac cc co om mp pl li is sh h . .. .

571 views • 4 slides

1 Slide 1: So. NV. SNPLA: Ring Around the Valley Slide 2: Vegas Valley Rim Trail-Part 1 Slide 3:

1 Slide 1: So. NV. SNPLA: Ring Around the Valley Slide 2: Vegas Valley Rim Trail-Part 1 Slide 3: Vegas Valley Rim Trail-Part 2 Slide 4: Ivanpah Airport: Key to Economys Future Slide 5: Proposed Expansion Area Overview Slide 6: Proposed

108 views • 8 slides

Minimizing the number of Sensors Moved on Line Segment or Circle Barriers M. Mehrandish, L.

Minimizing the number of Sensors Moved on Line Segment or Circle Barriers M. Mehrandish, L. Narayanan, J. Opatrny Department of Computer Science and Software Engineering Concordia University Montreal Canada MinNum Problem, 2011 p.1/22

411 views • 22 slides

+ + Review n Nested Loops n multiple indices n multiple conditions n Trig n unit

2/15/16 + + Review n Nested Loops n multiple indices n multiple conditions n Trig n unit circle n 360 degrees or 2 pi radians n soh cah toa Trigonometry and Arrays n sin relates to height/y

491 views • 4 slides

CMPSC 311- Introduction to Systems Programming Module: Studying Professor Patrick McDaniel Fall

CMPSC 311- Introduction to Systems Programming Module: Studying Professor Patrick McDaniel Fall 2014 CMPSC 311 - Introduction to Systems Programming Oops . Easy C class implementation CMPSC 311 - Introduction to Systems Programming Page

441 views • 19 slides