Infrence de dates dactivit partir dun rseau dinteractions dates - PowerPoint PPT Presentation

Inférence de dates d’activité à partir d’un réseau d’interactions datées Fabrice Rossi & Pierre Latouche SAMM EA 4543 JDS 2013

1370 1370 1318 1345 General setting Decorated interaction networks ◮ interaction between “actors” ◮ each interaction is described by some characteristics ◮ multiple interactions between the same actors

General setting Decorated interaction networks ◮ interaction between “actors” ◮ each interaction is described by some characteristics ◮ multiple interactions between the same actors Ancient Notarial Acts ◮ very precise recording of 1370 1370 transactions about long lasting goods (lands, houses, etc.) ◮ not so precise description of the 1318 1345 persons involved in the transactions (e.g., only first names)

Goal Inference about actors ◮ propagate information associated to interactions to actors ◮ for instance with notarial acts: ◮ dates of acts ⇒ living period ◮ geographical position of the goods ⇒ living area ◮ status in unbalanced interactions ⇒ social status

Goal Inference about actors ◮ propagate information associated to interactions to actors ◮ for instance with notarial acts: ◮ dates of acts ⇒ living period ◮ geographical position of the goods ⇒ living area ◮ status in unbalanced interactions ⇒ social status Timestamped Interaction Network ◮ temporal decoration: a time stamp is associated to each interaction ◮ the network may outlives the actors (notarial acts) ◮ estimate a central date of activity for each actor, based on the time stamps of its interactions ◮ an activity interval can be estimated in some situations

1370 1370 1318 1345 Local solution Simple local solution ◮ “propagate” interaction associated characteristics to the actors ◮ summarize the data (if needed)

Local solution Simple local solution ◮ “propagate” interaction associated characteristics to the actors ◮ summarize the data (if needed) Activity date 1370 1370 ◮ central actor : 1318, 1345, 1370, 1370, with an average of ∼ 1351 1318 ◮ other actors : their unique (or 1345 repeated) date Drawbacks ◮ based only on local interactions not at all on non interaction ◮ summarizes the characteristics but not the network

Global solution Consistency hypotheses ◮ interaction characteristics are close to actors characteristics ◮ interactions happen preferably between actors who share similar characteristics

Global solution Consistency hypotheses ◮ interaction characteristics are close to actors characteristics ◮ interactions happen preferably between actors who share similar characteristics Generative approach ◮ actor i has characteristics Z i ∈ Z (dissimilarity space) ◮ i ↔ j with some probability decreasing with d ( Z i , Z j ) ◮ if i ↔ j , then the decoration is generated ◮ “around” Z i and Z j (same space Z ) ◮ or at least in a way “consistent” with Z i and Z j (possible in another space)

Technicalities (1/2) General Model (single interaction) ◮ data: A adjacency matrix, D decoration table ◮ parameters: ( Z i ) 1 ≤ i ≤ N , θ ◮ likelihood: � p ( A , D | Z , θ ) = P ( A ij = 0 | Z i , Z j , θ ) i � = j , A ij = 0 � × P ( A ij = 1 | Z i , Z j , θ ) p ( D ij | A ij = 1 , Z i , Z j , θ ) . i � = j , A ij = 1

Technicalities (1/2) General Model (single interaction) ◮ data: A adjacency matrix, D decoration table ◮ parameters: ( Z i ) 1 ≤ i ≤ N , θ ◮ likelihood: � p ( A , D | Z , θ ) = P ( A ij = 0 | Z i , Z j , θ ) i � = j , A ij = 0 � × P ( A ij = 1 | Z i , Z j , θ ) p ( D ij | A ij = 1 , Z i , Z j , θ ) . i � = j , A ij = 1 Numerical decorations ◮ logistic connection model (related to Hoff et al., 2002): log P ( A ij = 1 | Z i , Z j , α, β ) P ( A ij = 0 | Z i , Z j , α, β ) = α − β � Z i − Z j � 2 , � � Z i + Z j ◮ Gaussian decoration: D ij | Z i , Z j , Σ ∼ N , Σ . 2

Technicalities (2/2) Logistic connection model 1 ◮ connection probability: P ( A ij = 1 | Z i , Z j , α, β ) = 1 + e β � Z i − Z j � 2 − α 1 1 + e − α : maximal density of the interaction network ◮ 1 β : interaction “radius” ◮

Technicalities (2/2) Logistic connection model 1 ◮ connection probability: P ( A ij = 1 | Z i , Z j , α, β ) = 1 + e β � Z i − Z j � 2 − α 1 1 + e − α : maximal density of the interaction network ◮ 1 β : interaction “radius” ◮ Timestamps � , σ 2 � Z i + Z j ◮ Z i ∈ R : (central) activity date, D ij ∼ N 2 1 β and σ : lifespan of actors ◮

Technicalities (2/2) Logistic connection model 1 ◮ connection probability: P ( A ij = 1 | Z i , Z j , α, β ) = 1 + e β � Z i − Z j � 2 − α 1 1 + e − α : maximal density of the interaction network ◮ 1 β : interaction “radius” ◮ Timestamps � , σ 2 � Z i + Z j ◮ Z i ∈ R : (central) activity date, D ij ∼ N 2 1 β and σ : lifespan of actors ◮ Estimation ◮ here by maximum likelihood: non convex/concave optimization problem, solved by standard techniques ◮ other techniques could be used

Experiments Validation of the model ◮ data generated according to the model ◮ realistic values for β and σ = 20 (lifespan ∼ 80) ◮ α varies to simulate different densities ◮ the Z i are uniformly distributed in [ 1200 , 1400 ] (small size networks with 100 agents) Quality criterion ◮ mean square error (MSE) between true Z i and estimated one ◮ baseline: local average ◮ quality: reduction in MSE with respect to the baseline

Results Noise free 200 100 MSE improvement 0 −100 −200 −300 1 2 3 4 5 6 Average number of edges per vertex

Results Summary ◮ roughly 2200 networks generated Noise free 200 ◮ break even at ∼ 1.3 interaction 100 per actor MSE improvement 0 ◮ (almost) systematic improvement −100 after 2 interactions per actor −200 −300 ◮ some convergence issues (easy 1 2 3 4 5 6 to spot) Average number of edges per vertex Robustness ◮ very bad for low density network: below 1.1 interaction per actor, Z i estimations are frequently very bad ◮ good with respect to misspecification of the date distribution, e.g. using a uniform date distribution rather than a Gaussian one (see the paper)

Noisy networks (1/2) Imperfect data sets ◮ decorations are assumed to be exact or at least precise ◮ but they can be attached to a wrong pair of actors Motivation ◮ notarial acts were exact at their redaction time ◮ but we miss accurate registry of the persons, in particular, many persons share the same name, which are the unique identifiers in the acts ◮ this leads to ambiguous assignment of persons to acts

Noisy networks (2/2) Simulated by random rewiring ◮ generate a network

Noisy networks (2/2) Simulated by random rewiring ◮ generate a network ◮ select (randomly) an edge to rewire

Noisy networks (2/2) Simulated by random rewiring ◮ generate a network ◮ select (randomly) an edge to rewire ◮ chose (randomly) a new “ending” object

Noisy networks (2/2) Simulated by random rewiring ◮ generate a network ◮ select (randomly) an edge to rewire ◮ chose (randomly) a new “ending” object ◮ keep the original date!

Results Noise level: 5% 200 100 MSE improvement 0 −100 −200 −300 −400 1 2 3 4 5 6 Average number of edges per vertex

Results Summary ◮ roughly 2200 networks Noise level: 5% generated, 5 % of edge rewiring 200 ◮ break even at ∼ 2.1 interaction 100 MSE improvement 0 per actor −100 ◮ good behavior after 3 interactions −200 −300 per actor −400 ◮ more convergence issues (easy 1 2 3 4 5 6 Average number of edges per vertex to spot) Robustness ◮ a low level of noise (e.g. 1 %) has almost no effect on the estimation ◮ a high level of noise (10 %) has strong adverse effects

Summary and conclusion A generative model for decorated graphs ◮ introduces a way to “push” edges decorations to agents ◮ estimate characteristics that explain both the network and the decorations ◮ exhibit some robustness to misspecification Future work ◮ real world data ◮ mixture model: generative model + a noise component (ongoing work) ◮ more complex model: explains the network with the characteristics but also with some structural properties (e.g., block model like)

Infrence de dates dactivit partir dun rseau dinteractions dates - PowerPoint PPT Presentation

Infrence de dates dactivit partir dun rseau dinteractions dates Fabrice Rossi & Pierre Latouche SAMM EA 4543 JDS 2013 1370 1370 1318 1345 General setting Decorated interaction networks interaction between

CNBC Matlab Mini-Course Inf and NaN 3/0 returns Inf 0/0 returns NaN David S. Touretzky

Dipl.-Inf. Robert Manthey Dipl.-Inf. Robert Manthey 15. November 2017 1 Dipl.-Inf. Robert

Software Failures Dr. James A. Bednar jbednar@inf.ed.ac.uk http://homepages.inf.ed.ac.uk/jbednar

Software Failures Dr. James A. Bednar jbednar@inf.ed.ac.uk http://homepages.inf.ed.ac.uk/jbednar

UTC offsets W ORK IN G W ITH DATES AN D TIMES IN P YTH ON Max Shron Data Scientist and

La rgression sous WinBugs: une vieille mthode revisite partir d'un exemple

Dtection de communauts dans des rseaux scientifiques partir de donnes relationnelles

TGV Gnration de tests de conformit partir de modles formels Thierry Jron (INRIA /

Reconstruction de volumes ` a partir de coupes Simon Masnou Institut Camille Jordan Universit

ACI DADDi Runion du 18 novembre 2005 Herv Debar (France Tlcom R&D) A partir des

Te Tekna na CEPIs mission and activit vities ies rega garding ding COVI VID-19 19

preventing and managing chronic illness. Deadly De ly Choi oices ac activit ities Healthy

MEASURE Q A MEASURE Q ACTIVIT TIVITY OCT OCTOBER BER DECEMBER 20 ECEMBER 2017 Pr

SU UMMARY OF ISIMA AT CONSU ULTING S.L L ACTIVIT TY. GEN NERAL IN DEX pages 4 1 Compa

Software Quality and Standards Dr. James A. Bednar jbednar@inf.ed.ac.uk

Architectural Patterns Dr. James A. Bednar jbednar@inf.ed.ac.uk

Actor training for all ACT-SF.ORG CHANGE YOUR LIFE. CHANGE THE WORLD. Professional actor

Why actor analysis? Actor and network analysis Bert Enserink Network map of linked Network map

SQL Structured Query Language Standard for relational db systems History:

CS 744: RAY Shivaram Venkataraman Fall 2020 ADMINISTRIVIA late mall week - Assignment

Use Cases Use Cases Use Use cases cases 2003 Giorgini Information Acquisition -- 1 Basi

Object-Oriented Design Lecture 4: Use Case Modeling Sharif University of Technology 1

Speculative Concurrent Processing with Transactional Memory in the Actor Model OPODIS 2013

Actors in the ACE Architecture draft-ietf-ace-actors-02 Stefanie Gerdes, Ludwig Seitz, Goeran

Infrence de dates dactivit partir dun rseau dinteractions dates - PowerPoint PPT Presentation

Infrence de dates dactivit partir dun rseau dinteractions dates Fabrice Rossi & Pierre Latouche SAMM EA 4543 JDS 2013 1370 1370 1318 1345 General setting Decorated interaction networks interaction between

CNBC Matlab Mini-Course Inf and NaN 3/0 returns Inf 0/0 returns NaN David S. Touretzky

Dipl.-Inf. Robert Manthey Dipl.-Inf. Robert Manthey 15. November 2017 1 Dipl.-Inf. Robert

Software Failures Dr. James A. Bednar jbednar@inf.ed.ac.uk http://homepages.inf.ed.ac.uk/jbednar

Software Failures Dr. James A. Bednar jbednar@inf.ed.ac.uk http://homepages.inf.ed.ac.uk/jbednar

UTC offsets W ORK IN G W ITH DATES AN D TIMES IN P YTH ON Max Shron Data Scientist and

La rgression sous WinBugs: une vieille mthode revisite partir d'un exemple

Dtection de communauts dans des rseaux scientifiques partir de donnes relationnelles

TGV Gnration de tests de conformit partir de modles formels Thierry Jron (INRIA /

Reconstruction de volumes ` a partir de coupes Simon Masnou Institut Camille Jordan Universit

ACI DADDi Runion du 18 novembre 2005 Herv Debar (France Tlcom R&amp;D) A partir des

Te Tekna na CEPIs mission and activit vities ies rega garding ding COVI VID-19 19

preventing and managing chronic illness. Deadly De ly Choi oices ac activit ities Healthy

MEASURE Q A MEASURE Q ACTIVIT TIVITY OCT OCTOBER BER DECEMBER 20 ECEMBER 2017 Pr

SU UMMARY OF ISIMA AT CONSU ULTING S.L L ACTIVIT TY. GEN NERAL IN DEX pages 4 1 Compa

Software Quality and Standards Dr. James A. Bednar jbednar@inf.ed.ac.uk

Architectural Patterns Dr. James A. Bednar jbednar@inf.ed.ac.uk

Actor training for all ACT-SF.ORG CHANGE YOUR LIFE. CHANGE THE WORLD. Professional actor

Why actor analysis? Actor and network analysis Bert Enserink Network map of linked Network map

SQL Structured Query Language Standard for relational db systems History:

CS 744: RAY Shivaram Venkataraman Fall 2020 ADMINISTRIVIA late mall week - Assignment

Use Cases Use Cases Use Use cases cases 2003 Giorgini Information Acquisition -- 1 Basi

Object-Oriented Design Lecture 4: Use Case Modeling Sharif University of Technology 1

Speculative Concurrent Processing with Transactional Memory in the Actor Model OPODIS 2013

Actors in the ACE Architecture draft-ietf-ace-actors-02 Stefanie Gerdes, Ludwig Seitz, Goeran

ACI DADDi Runion du 18 novembre 2005 Herv Debar (France Tlcom R&D) A partir des