a versatile probabilistic programming framework for topic
play

A Versatile Probabilistic Programming Framework for Topic Models - PowerPoint PPT Presentation

Latent Topic Networks: A Versatile Probabilistic Programming Framework for Topic Models James Foulds Shachi Kumar Lise Getoor Jack Baskin School of Engineering University of California, Santa Cruz Probabilistic latent variable modeling Data


  1. Latent Topic Networks: A Versatile Probabilistic Programming Framework for Topic Models James Foulds Shachi Kumar Lise Getoor Jack Baskin School of Engineering University of California, Santa Cruz

  2. Probabilistic latent variable modeling Data Complicated, noisy, high-dimensional 2

  3. Probabilistic latent variable modeling Understand, Data explore, predict Complicated, noisy, high-dimensional 3

  4. Probabilistic latent variable modeling Understand, Data explore, predict Complicated, noisy, high-dimensional Latent variable model 4

  5. Probabilistic latent variable modeling Understand, Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations Latent variable model 5

  6. Topic models • Topic models are foundational building blocks for powerful latent variable models – Authorship (Rosen-Zvi et al., 2004) – Conversational Influence (Nguyen et al., 2014) – Knowledge base construction (Movshovitz-Attias and Cohen, 2015) – Machine translation (Mimno et al., 2009) – Political analysis (Grimmer, 2010), (Gerrish and Blei, 2011, 2012) – Recommender systems (Wang and Blei, 2011), (Diao et al., 2014) – Scientific impact (Dietz et al. 2007), (Foulds and Smyth, 2013) – Social network analysis (Chang et al., 2009) – Word-sense disambiguation (Boyd-Graber et al., 2007) – … 6

  7. Custom topic models • Custom latent variable topic models useful for data mining and computational social science • The challenge is scalability 7

  8. Custom topic models • Custom latent variable topic models useful for data mining and computational social science • The challenge is scalability 8

  9. Custom topic models • Custom latent variable topic models useful for data mining and computational social science • The challenge is scalability Sparse, stochastic, collapsed, distributed algorithms, … 9

  10. Custom topic models • Custom latent variable topic models useful for data mining and computational social science • The challenge is scalability Sparse, stochastic, collapsed, distributed algorithms, … There’s no end to speeding up LDA! Max Welling 10

  11. Custom topic models • Custom latent variable topic models useful for data mining and computational social science • The bottleneck is human effort and expertise Design time >> run time 11

  12. Custom topic models Understand, Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations Latent variable model 12

  13. Custom topic models Understand, Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations Latent variable model 13

  14. Custom topic models Understand, Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations Latent variable (Algorithm, model) pair model carefully co-designed for tractability 14

  15. Custom topic models Evaluate, Understand, iterate Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations Latent variable (Algorithm, model) pair model carefully co-designed for tractability 15

  16. Custom topic models Evaluate, Understand, iterate Data explore, predict Low-dimensional, Complicated, noisy, semantically meaningful high-dimensional representations General-purpose modeling framework 16

  17. Our contribution • We introduce latent topic networks – A versatile, general-purpose framework for specifying custom topic models – Models and domain knowledge specified using a simple logical probabilistic programming language – A highly parallelizable EM training algorithm 17

  18. Our contribution • We introduce latent topic networks – A versatile, general-purpose framework for specifying custom topic models – Models and domain knowledge specified using a simple logical probabilistic programming language – A highly parallelizable EM training algorithm 18

  19. Our contribution • We introduce latent topic networks – A versatile, general-purpose framework for specifying custom topic models – Models and domain knowledge specified using a simple logical probabilistic programming language – A highly parallelizable EM training algorithm 19

  20. Latent topic networks 𝛊 Z LDA likelihood 𝚾 W

  21. Latent topic networks Networks of dependencies between topics, distributions over topics 𝛊 𝛊 𝛊 Z LDA likelihood 𝚾 𝚾 𝚾 W 𝚾

  22. Latent topic networks Networks of dependencies between topics, Observed covariates distributions over topics X 𝛊 𝛊 𝛊 X Z LDA likelihood 𝚾 𝚾 𝚾 W 𝚾

  23. Latent topic networks Networks of dependencies between topics, Observed covariates distributions over topics X 𝛊 𝛊 Labeled data 𝛊 Y Y X Z LDA likelihood 𝚾 𝚾 𝚾 W 𝚾

  24. Latent topic networks Networks of dependencies between topics, Observed covariates distributions over topics X 𝛊 𝛊 Labeled data 𝛊 Y Z Latent variables Y X Z LDA likelihood 𝚾 Z 𝚾 𝚾 W 𝚾

  25. Previously… + = Grad student ≈6 months Topic modeling research paper 25

  26. Previously… + = Grad student ≈6 months Topic modeling research paper 26

  27. Previously… + = Grad student ≈6 months Topic modeling research paper 27

  28. Previously… + = Grad student ≈6 months Topic modeling research paper 28

  29. Previously… + = Grad student ≈6 months Topic modeling research paper 29

  30. Latent topic networks + = Grad student ≈6 months New custom topic model 1 weekend Shachi Kumar Master’s student, UCSC 30

  31. Related work Correlations / Observed Additional Constraints Probabilistic Dependencies Covariates Latent Variables Programming Systems for Encoding Domain Knowledge, Covariates, and Correlations      CTM (Blei and Lafferty, 2007)      DMR (Mimno & McCallum, 2008)      Dirichlet Forests (Andzejewski et al., 2009      xLDA (Wahabzada et al., 2010)      SAGE (Eisenstein et al., 2011)      STM (Roberts et al., 2013) Graphical Modeling and Probabilistic Programming Systems      CTRF (Zhu & Xing, 2010)      Fold.all (Andrzejewski et al., 2011)      Logic LDA (Mei et al., 2014)      Latent Topic Networks 31

  32. Related work Correlations / Observed Additional Constraints Probabilistic Dependencies Covariates Latent Variables Programming Systems for Encoding Domain Knowledge, Covariates, and Correlations      CTM (Blei and Lafferty, 2007)      DMR (Mimno & McCallum, 2008)      Dirichlet Forests (Andzejewski et al., 2009      xLDA (Wahabzada et al., 2010)      SAGE (Eisenstein et al., 2011)      STM (Roberts et al., 2013) Graphical Modeling and Probabilistic Programming Systems      CTRF (Zhu & Xing, 2010)      Fold.all (Andrzejewski et al., 2011)      Logic LDA (Mei et al., 2014)      Latent Topic Networks 32

  33. Example: modeling influence in citation networks 33 Foulds and Smyth (2013), EMNLP

  34. Example: modeling influence in citation networks Which are the most important articles? 34 Foulds and Smyth (2013), EMNLP

  35. Example: modeling influence in citation networks What are the influence relationships between articles? 35 Foulds and Smyth (2013), EMNLP

  36. Topical influence regression Latent variables for document influence citation edge influence 36 Foulds and Smyth (2013), EMNLP

  37. Topical influence regression Latent variables for document influence citation edge influence Probabilistic dependencies along the citation graph 37 Foulds and Smyth (2013), EMNLP

  38. Encoding dependencies via logical rules Citing document also has the topic Restrict dependencies Influence and topic to citation graph value are both high 38

  39. Encoding dependencies via logical rules Citing document also has the topic Restrict dependencies Influence and topic to citation graph value are both high 39

  40. Encoding dependencies via logical rules Citing document also has the topic Restrict dependencies Influence and topic to citation graph are both high 40

  41. Encoding dependencies via logical rules Citing document also has the topic Restrict dependencies Influence and topic to citation graph are both high 41

  42. Encoding dependencies via logical rules Citing document also has the topic Restrict dependencies Influence and topic to citation graph are both high Entire model with just 5 rules! 42

  43. Statistical relational learning • An “ interface layer for AI .” – Programming languages for specifying models and encoding domain knowledge – Typically based on first-order logic 43

  44. Probabilistic soft logic (PSL) • A first-order logic-based SRL language 5.0: Logical operators Predicate Rule weight Continuous random variables! • Used to specify hinge-loss MRFs, a class of highly scalable continuous graphical models 44

Recommend


More recommend