Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. - PowerPoint PPT Presentation

Introduction Background Problem 1 Problem 2 Experiments Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. Gullo 2 B. Mishra 3 D. Ramazzotti 4 1 ISI Foundation, Italy and Eurecat, Spain, francesco.bonchi@isi.it 2 UniCredit, R&D Dept., Italy, gullof@acm.org 3 New York University, NY, USA, mishra@nyu.edu 4 Stanford University, CA, USA, daniele.ramazzotti@stanford.edu The 27th ACM International Conference on Information and Knowledge Management (CIKM 2018) October 22-26, 2018 Turin, Italy F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Motivation Problem 1 Challenges and contributions Problem 2 Outiline Experiments Motivation Social influence : process motivating the actions of a user to induce similar actions from her peers Mastering the dynamics of social influence is crucial for a variety of applications e.g., viral marketing, trust-propagation analysis, personalization, feed ranking, information-propagation analysis Prior work: Estimating the strength of influence in a social network Empirically analyzing the effects of social influence Distinguishing genuine social influence from homophily and other external factors Social influence is a genuine causal process : there is no principled causal-theory-based approach to learn social influence from empirical information-propagation data We fill this gap! F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Motivation Problem 1 Challenges and contributions Problem 2 Outiline Experiments Challenges and Contributions We devise a principled causal approach to infer social influence from a database of propagation traces Based on Suppes ’ theory of probabilistic causation Output: a set of causal DAGs describing social influence Different DAGs ⇒ different communities, different topics Major challenges: Simpson’s paradox Genuine vs. spurious causes Proposal : a two-step methodology I step : partitioning the input propagation traces, to get rid of Simpson’s paradox II step : inferring minimal causal topology (via MLE), to get rid of spurious causes F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Motivation Problem 1 Challenges and contributions Problem 2 Outiline Experiments Outline Introduction: motivation, challenges, contributions Background information-propagation traces, hierarchical structure, Suppes’ theory General (twofold) problem statement Problem 1: partitioning the propagation set Problem definition Algorithms Problem 2: learning a minimal causal topology Experiments F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Outline Introduction: motivation, challenges, contributions Background information-propagation traces, hierarchical structure, Suppes’ theory General (twofold) problem statement Problem 1: partitioning the propagation set Problem definition Algorithms Problem 2: learning a minimal causal topology Experiments F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Input data A (directed) social graph G = ( V , A ) A set E of entities A set O of observations Triples � v , φ, t � , where v ∈ V , φ ∈ E , t ∈ N + � v , φ, t � ∈ O means: entity φ is observed at node v at time t Entities cannot be observed multiple times at the same node Example: G : social network (follower-followee relations) E : pieces of multimedia content (posts, photos, videos) � v , φ, t � ∈ O : multimedia item φ enjoyed by user v at time t F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Input data: information-propagation traces Observations O can alternatively be viewed as a database D of propagation traces , i.e., traces left by entities “flowing” over G Propagation trace of an entity φ : all observations {� v , φ ′ , t � ∈ O | φ ′ = φ } involving φ O ⇔ D = { D φ | φ ∈ E} of directed acyclic graphs ( dag s) D φ = ( V φ , A φ ) V φ = { v ∈ V | � v , φ, t � ∈ O } A φ = { ( u , v ) ∈ A | � u , φ, t u � ∈ O , � v , φ, t v � ∈ O , t u < t v } No cycles in D φ ∈ D due to time irreversibility All propagations started at time 0 by a dummy node Ω / ∈ V F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Input data: example G D φ 1 D v φ t Ω v 2 v 1 v 3 Ω φ 1 0 v 2 v 3 2 v 2 φ 1 4 v 3 φ 1 v 4 5 v 4 φ 1 v 4 v 5 φ 1 7 Ω φ 2 0 v 5 v 6 v 5 v 7 v 2 φ 2 1 v 1 v 1 φ 2 3 φ 2 6 v 5 v 1 Ω Ω 7 v 2 v 7 φ 2 8 v 6 φ 2 v 2 9 v 4 v 3 φ 2 v 3 Ω φ 3 0 v 1 φ 3 1 v 7 v 2 φ 3 3 v 6 5 v 6 v 6 φ 3 v 7 v 5 7 v 7 φ 3 8 v 4 φ 3 D φ 2 D φ 3 F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Hierarchical structure Gupte et al. , “Finding hierarchy in directed online social networks” , WWW 2011 Notion of agony to reconstruct a proper hierarchical structure of a graph Ranking r : V → N r ( u ) < r ( v ) means u is “higher” in the hierarchy than v i.e., the smaller r ( u ) is, the more u is an “early-adopter” r ( u ) < r ( v ) ⇒ u → v is expected ⇒ no “ social agony ” r ( u ) ≥ r ( v ) ⇒ u → v leads to agony: u has a higher-ranked follower Given a graph G = ( V , A ) and a ranking r : agony of arc ( u , v ): max { r ( u ) − r ( v ) + 1 , 0 } agony of G : a ( G , r ) = � ( u , v ) ∈ A max { r ( u ) − r ( v ) + 1 , 0 } If r is not provided: look for a ranking minimizing the agony of G Agony of a graph G is ultimately computed as a ( G ) = min r a ( G , r ) it takes O ( | A | 2 ) time [Tatti, ECML PKDD 2014] F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Hierarchical structure: example D φ 1 D φ 2 Ω dag s exhibit no agony (just take temporal ordering as a ranking, i.e., r ( u ) = t u ) Merging dag s may lead to non-zero agony E.g., a k -length cycle (non-overlapping with other cycles) has agony equal to k v 1 Ω Minimum-agony ranking for D φ 1 ∪ D φ 2 : v 2 v 4 ( v 2 :0)( v 1 :1)( v 4 :2)( v 5 :3)( v 7 :4)( v 6 :5)( v 3 :6) v 3 No agony on all arcs but v 3 → v 4 Agony on v 3 → v 4 = length of cycle v 6 v 7 passing through v 3 and v 4 = 5 v 5 D φ 1 ∪ D φ 2 F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Suppes’ probabilistic causation theory Definition (Prima facie causes [Suppes, 1970]) For any two events c ( cause ) and e ( effect ), occurring respectively at times t c and t e , under the mild assumption that the probabilities P ( c ) and P ( e ) of the two events satisfy the condition 0 < P ( c ) , P ( e ) < 1, the event c is called a prima facie cause of the event e if it occurs before e and raises the probability of e , i.e., t c < t e ∧ P ( e | c ) > P ( e | c ). Pros: Principled causal theory Well-established practical effectiveness Computationally light (much lighter than other theories, e.g., Judea Pearl’s one) Cons: No notion of spatial proximity Prima facie causes may be genuine or spurious : the latter is undesirable F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments Outline Introduction: motivation, challenges, contributions Background information-propagation traces, hierarchical structure, Suppes’ theory General (twofold) problem statement Problem 1: partitioning the propagation set Problem definition Algorithms Problem 2: learning a minimal causal topology Experiments F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Introduction Background Input data Problem 1 Hierarchical structure Problem 2 Suppes’ theory Experiments General problem statement Main general goal Given a database of propagation traces , derive a set of causal DAGs that are well-representative of the social-influence dynamics underlying the input propagations Desiderata: Get rid of Simpson’s paradox 1 if the input data spans multiple causal processes, causal claims may be hidden or misinterpreted Overcome Suppes’ theory cons (especially the spurious-cause one) 2 We formulate and solve two problems: Agony-bounded Partitioning , a combinatorial-optimization problem, for Desideratum 1 Minimal Causal Topology , a learning problem, for Desideratum 2 F. Bonchi, F. Gullo, B. Mishra, D. Ramazzotti Probabilistic Causal Analysis of Social Influence

Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. - PowerPoint PPT Presentation

Introduction Background Problem 1 Problem 2 Experiments Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. Gullo 2 B. Mishra 3 D. Ramazzotti 4 1 ISI Foundation, Italy and Eurecat, Spain, francesco.bonchi@isi.it 2 UniCredit,

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Social Media and Social Influence Nihar Shah Peter Tu # cs286r 7 November 2012 Nihar Shah

Table of Contents I Probabilistic Reasoning Classical Probabilistic Models Basic Probabilistic

Social influence Conformity Informational influence Influence that produces conformity when a

Causal Realism and Causal Analysis in the Social Sciences Presentation by Daniel Little

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Randomized Experiments The goal of randomized experiments is to identify The causal

http://cs224w.stanford.edu We are more influenced by our friends than strangers 68% of

TYPE INFERENCE Franois Pottier The Programming Languages Mentoring Workshop @ ICFP August 30,

Search and Inference in AI Planning Hctor Geffner ICREA & Universitat Pompeu Fabra

Graphical-model based estimation and inference for differential privacy Ryan McKenna 1 , Daniel

barnacle Chthalamus stellatus . Lotka Volterra Competition Alfred J. Lotka (1880 1949),

Influence of Array Storage and Access Methods on Performance of Multi-Dimensional Arrays Used in

Bandits Under the Influence Silviu Maniu, Stratis Ioannidis, Bogdan Cautis Universit

Influences of an interdisciplinary global health program on cultural awareness and future global

Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. - PowerPoint PPT Presentation

Introduction Background Problem 1 Problem 2 Experiments Probabilistic Causal Analysis of Social Influence F. Bonchi 1 F. Gullo 2 B. Mishra 3 D. Ramazzotti 4 1 ISI Foundation, Italy and Eurecat, Spain, francesco.bonchi@isi.it 2 UniCredit,

Causal Effect Evaluation and Causal Network Learning Zhi Geng Peking University, China June

Political Science 209 - Fall 2018 Causal Inference Florian Hollenbach 7th September 2018 Causal

Foundations of Causal Discovery Frederick Eberhardt KDD Causality Workshop 2016 Causal Discovery

Causal Inference By: Miguel A. Hern an and James M. Robins Part I: Causal inference without

Causal Programming Causal Programming Joshua Brul Joshua Brul

Few-shot Domain Adaptation 1/12 by Causal Mechanism Transfer Domain adaptation Causal mechanism

Causal Discovery from Observational Data Brady Neal causalcourse.com What if we dont have

Probabilistic model Probabilistic model c Probabilistic model Probabilistic model c c

Social Media and Social Influence Nihar Shah Peter Tu # cs286r 7 November 2012 Nihar Shah

Table of Contents I Probabilistic Reasoning Classical Probabilistic Models Basic Probabilistic

Social influence Conformity Informational influence Influence that produces conformity when a

Causal Realism and Causal Analysis in the Social Sciences Presentation by Daniel Little

Introduction to Causal Inference Lan Liu University of Minnesota at Twin Cities liux3771@umn.edu

Week 5 Video 2 Relationship Mining Causal Mining Causal Data Mining These slides developed in

A Brief Introduction to Causal Inference Brady Neal causalcourse.com What is causal inference?

Randomized Experiments The goal of randomized experiments is to identify The causal

http://cs224w.stanford.edu We are more influenced by our friends than strangers 68% of

TYPE INFERENCE Franois Pottier The Programming Languages Mentoring Workshop @ ICFP August 30,

Search and Inference in AI Planning Hctor Geffner ICREA &amp; Universitat Pompeu Fabra

Graphical-model based estimation and inference for differential privacy Ryan McKenna 1 , Daniel

barnacle Chthalamus stellatus . Lotka Volterra Competition Alfred J. Lotka (1880 1949),

Influence of Array Storage and Access Methods on Performance of Multi-Dimensional Arrays Used in

Bandits Under the Influence Silviu Maniu, Stratis Ioannidis, Bogdan Cautis Universit

Influences of an interdisciplinary global health program on cultural awareness and future global

Search and Inference in AI Planning Hctor Geffner ICREA & Universitat Pompeu Fabra