Influence Maximization in Dynamic Social Networks Honglei Zhuang, - PowerPoint PPT Presentation

Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun

Influence ¡Maximiza.on Influence ¡threshold How ¡to ¡find ¡influen4al ¡ users ¡to ¡help ¡promote ¡a ¡ Probability ¡of ¡ B 0.5 new ¡product? influence A 0.8 C 0.1 0.5 Marketer ¡Alice ¡ 0.4 0.6 0.1 0.6 D E F 0.1 Find K nodes (users) in a social network that could maximize the spread of influence (Domingos, 01; Richardson, 02; Kempe, 03)

Influence ¡Maximiza.on • Problem [1] ¡ – Ini.ally ¡all ¡users ¡are ¡considered ¡inac.ve ¡ – Then ¡the ¡chosen ¡users ¡are ¡ac.vated, ¡who ¡may ¡ further ¡influence ¡their ¡friends ¡to ¡be ¡ac.ve ¡as ¡well ¡ • Models ¡ – Linear ¡Threshold ¡model ¡ – Independent ¡Cascading ¡model ¡ [1] ¡D. ¡Kempe, ¡J. ¡Kleinberg, ¡and ¡E. ¡Tardos. ¡Maximizing ¡the ¡spread ¡of ¡influence ¡through ¡a ¡social ¡network. ¡KDD’03, ¡pages ¡137–146, ¡2003.

Approximate ¡Solu.on • NP-‑hard ¡ [1] ¡ The ¡problem ¡is ¡solved ¡by ¡op.mizing ¡ – Linear ¡Threshold ¡Model ¡ a ¡monotonic ¡submodular ¡func.on 00 – Independent ¡Cascading ¡Model ¡ ¡ • Kempe ¡ Prove ¡that ¡approxima.on ¡algorithms ¡can ¡guarantee ¡that ¡the ¡ influence ¡spread ¡is ¡within(1-‑1/e) ¡of ¡the ¡op.mal ¡influence ¡spread. ¡ – Verify ¡that ¡the ¡two ¡models ¡can ¡outperform ¡the ¡tradi.onal ¡heuris.cs ¡ • Recent ¡research ¡focuses ¡on ¡the ¡efficiency ¡improvement ¡ – [2] ¡accelerate ¡the ¡influence ¡procedure ¡by ¡up ¡to ¡700 ¡.mes ¡ • It ¡is ¡s.ll ¡challenging ¡to ¡extend ¡these ¡methods ¡to ¡large ¡data ¡sets ¡ [1] ¡D. ¡Kempe, ¡J. ¡Kleinberg, ¡and ¡E. ¡Tardos. ¡Maximizing ¡the ¡spread ¡of ¡influence ¡through ¡a ¡social ¡network. ¡KDD’03, ¡pages ¡137–146, ¡2003. ¡ ¡ [2] ¡J. ¡Leskovec, ¡A. ¡Krause, ¡C. ¡Guestrin, ¡C. ¡Faloutsos, ¡J. ¡VanBriesen, ¡and ¡N. ¡Glance. ¡Cost-‑effec.ve ¡outbreak ¡detec.on ¡in ¡networks. ¡KDD’07, ¡pages ¡ 420–429, ¡2007. ¡

Influence ¡Maximiza.on ¡in ¡Dynamic ¡Networks ¡ t =0 t =1 Probe Evolve Original edges About ¡6 ¡million ¡links ¡changed ¡on ¡Weibo ¡network ¡ Added edges Removed edges Weibo ¡API ¡limita.on: ¡ ≤ 450 ¡.mes/hr ¡

Problem • Input: ¡For ¡a ¡dynamic ¡social ¡network ¡ { G 0 ,…, G t }, we ¡ have ¡observed ¡ G 0 , ¡but ¡for ¡all ¡ t >0, G t ¡is ¡unknown ¡ • Problem: ¡To ¡probe ¡ b ¡nodes, ¡observe ¡their ¡neighbors ¡to ¡ G t -1 / G 0 ˆ t ˆ obtain ¡an ¡observed ¡network ¡ ¡ ¡ ¡ ¡ ¡ ¡from ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡, ¡such ¡that ¡ G influence ¡maximiza.on ¡on ¡the ¡real ¡network ¡ ¡ ¡ ¡ ¡ ¡can ¡be ¡ t G approximated ¡by ¡that ¡on ¡the ¡observed ¡network. ¡ • Challenge: ¡How ¡to ¡find ¡the ¡ ¡ ¡ ¡influen.al ¡users, ¡if ¡we ¡only ¡ k par.ally ¡observe ¡the ¡update ¡of ¡the ¡social ¡network?

Basic ¡Idea ¡ • Es.mate ¡how ¡likely ¡the ¡neighborhood ¡of ¡a ¡ node ¡will ¡change ¡in ¡a ¡dynamic ¡social ¡network ¡ – Probe ¡nodes ¡that ¡change ¡a ¡lot ¡ • Es.mate ¡how ¡much ¡the ¡influence ¡spread ¡can ¡ be ¡improved ¡by ¡probing ¡a ¡node ¡ – Probe ¡the ¡one ¡maximizes ¡the ¡improvement ¡

Methodologies ¡and ¡Results

Preliminary ¡Theore.cal ¡Analysis ¡ • Formal ¡defini.on ¡of ¡loss ¡ Max ¡seed ¡set ¡on ¡fully ¡observed ¡network ( ) ( ) ⎡ ⎤ = − * * l E Q S Q T ⎣ ⎦ ˆ G G | Max ¡seed ¡set ¡on ¡par4ally ¡observed ¡network • With ¡an ¡specified ¡evolving ¡graph ¡model ¡ – At ¡each ¡.me ¡stamp ¡an ¡edge ¡is ¡chosen ¡uniformly ¡ – and ¡its ¡head ¡will ¡point ¡to ¡a ¡node ¡randomly ¡chosen ¡ with ¡probability ¡propor.onal ¡to ¡the ¡in-‑degree ¡

Preliminary ¡Theore.cal ¡Analysis ¡ • Error ¡bound ¡of ¡Random ¡probing ¡strategy ¡ • Error ¡bound ¡of ¡Degree ¡weighted ¡probing ¡strategy ¡ • In ¡most ¡cases, ¡degree ¡weighted ¡probing ¡strategy ¡ performs ¡beier ¡than ¡random ¡probing ¡strategy ¡

Maximum ¡Gap ¡Probing ¡ • Basic ¡Idea ¡ – Es.mate ¡how ¡much ¡the ¡influence ¡spread ¡can ¡be ¡ improved ¡by ¡probing ¡a ¡node ¡ – Probe ¡the ¡one ¡which ¡maximizes ¡the ¡improvement ¡ • Formally, ¡ ε – For ¡a ¡given ¡tolerance ¡probability ¡ ¡ – The ¡minimum ¡value ¡ ¡ ¡ ¡ ¡ ¡that ¡sa.sfies ¡the ¡following ¡ β ( ) inequality ¡is ¡defined ¡as ¡performance ¡gap ¡ β v ( ) ( ) ( ) ˆ ˆ ⎡ ⎤ − ≥ β ≤ ε P Q S ' v Q S ⎣ ⎦ v o v o Best ¡solu4on ¡ Best ¡solu4on ¡ if ¡ v ¡is ¡probed before ¡probing *To ¡simplify ¡problem, ¡define ¡the ¡quality ¡func.on ¡as ¡the ¡sum ¡of ¡degree ¡in ¡the ¡seed ¡set. ¡

Maximum ¡Gap ¡Probing ¡ • Assume ¡the ¡degree ¡of ¡a ¡node ¡is ¡a ¡mar.ngale. ¡We ¡can ¡ es.mate ¡the ¡degree ¡gap ¡of ¡each ¡node ¡by ¡ ( ) ( ) ⎡ ⎤ − − ≥ − ε ≤ ε t t c P d v d v 2 c ln v ⎣ ⎦ v Last ¡4me ¡when ¡v ¡is ¡probed Defined ¡as ¡ z v • Considering ¡the ¡node ¡to ¡probe ¡is ¡in/not ¡in ¡the ¡current ¡seed ¡ set. ¡ ¡ { } ⎧ ( ) ( ) ˆ ˆ + − ∉ max 0, d v z min d w , v S ⎪ ⎪ v O ∈ ( ) w S β = ⎨ o v { } ( ) ( ) ⎪ ˆ ˆ − + ∈ max 0,max d u d v z , v S ⎪ v O ⎩ ∉ u S o ( ) β v • Each ¡.me, ¡choose ¡the ¡one ¡with ¡maximum ¡gap ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡ ¡to ¡ probe ¡

MaxG ¡Algorithm Finding ¡nodes ¡to ¡probe ¡ by ¡maximizing ¡the ¡ degree ¡gap Perform ¡the ¡standard ¡ greedy ¡algorithm ¡ (degree ¡discount ¡ heuris.cs) ¡for ¡ influence ¡maximiza.on

Experiment ¡Setup ¡ ¡ • Data ¡sets ¡ Data ¡sets ¡ #Users ¡ #Rela4onships ¡ #Time ¡stamps ¡ Synthe.c ¡ 500 ¡ 12,475 ¡ 200 ¡ Twiier ¡ 18,089,810 ¡ 21,097,569 ¡ 10 ¡ Coauthor [1] ¡ ¡ 1,629,217 ¡ 2,623,832 ¡ 27 ¡ • Evalua.on ¡ – Take ¡op.mal ¡seed ¡set ¡ ¡ ¡ ¡ ¡ ¡obtained ¡from ¡ S ' par.ally ¡observed ¡network ¡ ¡ ¡ – Calculate ¡its ¡influence ¡spread ¡on ¡real ¡network ¡ [1] ¡hip://arnetminer.org/cita.on

Experiment ¡Setup ¡ • Comparing ¡methods ¡ – Rand, ¡Enum: ¡ Uniform ¡probing ¡ – Deg, ¡DegRR: ¡ Degree-‑weighted ¡probing ¡ – BEST: ¡ Suppose ¡network ¡dynamics ¡fully ¡observed ¡ • Configura.ons ¡ – Probing ¡budget: ¡ • b=1,5 ¡for ¡Synthe.c; ¡b=100,500 ¡for ¡Twiier ¡and ¡Coauthor ¡ – Seed ¡set ¡size ¡for ¡influence ¡maximiza.on: ¡ ¡ • k=30 ¡for ¡Synthe.c; ¡k=100 ¡for ¡Twiier ¡and ¡Coauthor ¡ – Independent ¡Cascade ¡Model, ¡with ¡uniform ¡p=0.01 ¡

Experimental ¡Results ¡ • Average ¡influence ¡spread ¡ b ¡ Rand ¡ Enum ¡ Deg ¡ DegRR ¡ MaxG ¡ BEST ¡ Data ¡Set ¡ 1 ¡ 13.83 ¡ 13.55 ¡ 13.78 ¡ 14.30 ¡ 14.79 ¡ 15.95 ¡ Synthe4c ¡ 5 ¡ 15.07 ¡ 15.33 ¡ 15.09 ¡ 15.40 ¡ 15.60 ¡ 100 ¡ 987.74 ¡ 987.62 ¡ 988.41 ¡ 1001.47 ¡ 1005.12 ¡ 1011.15 ¡ TwiRer ¡ 500 ¡ 987.45 ¡ 987.67 ¡ 988.36 ¡ 1006.38 ¡ 1010.61 ¡ 100 ¡ 20.34 ¡ 20.82 ¡ 28.67 ¡ 38.94 ¡ 45.51 ¡ 91.51 ¡ Coauthor ¡ 500 ¡ 20.35 ¡ 22.93 ¡ 44.27 ¡ 56.68 ¡ 61.74 ¡ The ¡large, ¡the ¡best

Influence ¡Maximiza.on ¡Results ¡( b =100) ¡ Twiier ¡ Coauthor ¡

Influence ¡Maximiza.on ¡Results ¡( b =500) ¡ Twiier ¡ Coauthor ¡

Conclusions

Influence Maximization in Dynamic Social Networks Honglei Zhuang, - PowerPoint PPT Presentation

Influence Maximization in Dynamic Social Networks Honglei Zhuang, Yihan Sun, Jie Tang, Jialin Zhang, Xiaoming Sun Influence Maximiza.on Influence threshold How to find influen4al users to help promote a

Presented by WAN, Pengfei Dept. ECE, HKUST Wei Chen, et al, Efficient Influence Maximization in

Submodular Maximization applied to Marketing Over Social Networks Vahab Mirrokni Google

On social influence, topics, and communities Francesco Bonchi www.francescobonchi.com Plan of

Submodular Maximization Seffi Naor Lecture 2 4th Cargese Workshop on Combinatorial Optimization

Submodular Maximization Seffi Naor Lecture 3 4th Cargese Workshop on Combinatorial Optimization

Expectation Maximization CMSC 691 UMBC Outline EM (Expectation Maximization) Basic idea Three

CMU 15-896 Social networks 2: Influence Maximization Teacher: Ariel Procaccia Motivation

Handout: Influence Maximization The study of social processes by which ideas and innovations

Social Media and Social Influence Nihar Shah Peter Tu # cs286r 7 November 2012 Nihar Shah

Media Diffusion: Cascading Behavior in Networks Epidemic Spread Influence Maximization 1

Media Cascading Behavior in Networks Epidemic Spread Influence Maximization Introduction

Scalable Influence Maximization for Prevalent Viral Marketing in Large-Scale Social Networks Wei

Social influence Conformity Informational influence Influence that produces conformity when a

Latent Variable Models and Expectation Maximization Oliver Schulte - CMPT 726 Bishop PRML Ch. 9

Maximization of Submodular Functions Seffi Naor Lecture 1 4th Cargese Workshop on Combinatorial

On the dual problem of utility maximization Yiqing LIN Joint work with L. GU and J. YANG

Dynamic Memory Review: automa3c variables Automa'c variable :

Treatment Regime (iii) 11/07/2007 Discussion for today * * ( ,

C ATCH T ARTAN : Representing and Summarizing Dynamic Multicontextual Behaviors Meng Jiang

What about other storage reclamation schemes? Memory management options Manual /explicit

Controlling Fairness and Bias in Dynamic Learning-to-Rank ACM SIGIR 2020 Marco Morik * ,

Dynamic programming Topics for this week: - backtracking revisited: a brute-force technique that

Rigid body dynamics Rigid body simulation Once we consider an object with spatial extent,

2 6 1 2 3 : black nodes 1 1 1 : red nodes What is the size of the root of the above