Exploiting compositionality to explore a large space of model - PowerPoint PPT Presentation

Exploiting compositionality to explore a large space of model structures R. Grosse, R. Salakhutdinov, W. Freeman, & J. Tenenbaum Best Student Paper at UAI 2012 Jan Gasthaus Tea talk 31st Aug 2012 1 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best Mainly a computational problem; Proposed solution: 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best Mainly a computational problem; Proposed solution: ◮ Pick a rich class of models: matrix decomposition models 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best Mainly a computational problem; Proposed solution: ◮ Pick a rich class of models: matrix decomposition models ◮ Fit more complex models re-using computations from simple ones 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best Mainly a computational problem; Proposed solution: ◮ Pick a rich class of models: matrix decomposition models ◮ Fit more complex models re-using computations from simple ones ◮ Approximate model selection criterion 2 / 15

Motivation Goal: Given a data set, determine the right model to use for that data set Ideal approach ◮ Implement all models ever published ◮ Fit them to the data set ◮ Compare them using some model selection criterion and pick the best Mainly a computational problem; Proposed solution: ◮ Pick a rich class of models: matrix decomposition models ◮ Fit more complex models re-using computations from simple ones ◮ Approximate model selection criterion ◮ Greedy heuristic for exploring the space of structure exploiting compositionality 2 / 15

In A Nutshell Grammar for generative models for matrix factorization ◮ Express models as algebraic expressions such as MG + G ◮ Devise CFG that generates these expressions with rules like G → GG + G Search over model structures greedily by applying the production rules and using an approximate lower bound on model score Initialize sampling in model by using a specialized algorithm for each production rule 3 / 15

Components 4 / 15

Grammar 5 / 15

Models 6 / 15

Inference: Individual Models Initialize state using one-shot algorithm for each rule application Latent dimensionality is determined during initialization using BNP Then run simple Gibbs sampler (no details provided . . . ) 7 / 15

Initialization 8 / 15

Scoring Candidate Structures Criterion used: predictive likelihood of held-out rows and columns ◮ Marginal likelihood not feasible ◮ MSE not selective enough Use a (stochastic) lower bound on predictive likelihood, computed using a variational approximation combined with annealed importance sampling (this is about as much detail as is in the paper . . . ) 9 / 15

Search Over Structures Greedy search following grammar Start with G 1 Expand using all possible rules 2 Fit & score models 3 Keep top K models 4 Go to 2 5 Assumes that good simple models will lead to good more complex models when refined Assumption seems to be warranted: K = 3 yields the same results as K = 1 in experiments 10 / 15

Results on Synthetic Data 11 / 15

Results on Real Data 12 / 15

Computing Predictive Likelihood 15 / 15

Exploiting compositionality to explore a large space of model - PowerPoint PPT Presentation

Exploiting compositionality to explore a large space of model structures R. Grosse, R. Salakhutdinov, W. Freeman, & J. Tenenbaum Best Student Paper at UAI 2012 Jan Gasthaus Tea talk 31st Aug 2012 1 / 15 Motivation Goal: Given a data set,

Exploiting multilingual lexical resources to predict the compositionality of MWEs Paul Cook

Exploiting compositionality to explore a large space of model structures Roger Grosse Dept. of

Compositionality and Asynchrony Dr. Liam OConnor University of Edinburgh LFCS (and UNSW) Term

Compositionality and Asynchrony Dr. Liam OConnor University of Edinburgh LFCS (and UNSW) Term

Distributional Compositionality Compositionality in DS Raffaella Bernardi University of Trento

2019 Pursuing Peace Conference Justice Prerequisite to Peace creating space to

2019 Pursuing Peace Conference Justice Prerequisite to Peace creating space to

9.4 Local Perception Filters 9.4 Local Perception Filters Exploiting Exploiting Perceptual

Compositionality in DS Raffaella Bernardi University of Trento November, 2019 Raffaella

Adaptive Multi-Compositionality for Recursive Neural Models with Applications to Sentiment

Compositionality in Recursive Neural Networks Martha Lewis ILLC University of Amsterdam SYCO3,

Models of Language Evolution Iterated learning Michael Franke Facets of EvoLang

Models of Language Evolution Session 10 : Iterated Learning and the Evolution of Compositionality

Evaluating compositionality in sentences embeddings Ishita Dasgupta Harvard University,

Di ff erentially-Private Batch Query Answering Exploiting the Workload vs. Exploiting the Data

Exploiting Private Local Exploiting Private Local Memories to Reduce the Memories to Reduce the

Executive Chairman Steven Olsen Executive Chairman Steven Olsen

The role of Geoinformatic Geoinformatic Literacy in Literacy in The role of Promoting romoting

November 2011 Corporate Background A fully reporting, publicly listed junior mining company

Methodology for the optimization of waste management resulting from the clean-up of contaminated

Flight-to-Liquidity in the Equity Markets during Periods of Financial Crisis Azi Ben-Rephael Tel

Prior Choice . . . . . A HMAD P ARSIAN S CHOOL OF M ATHEMATICS , S TATISTICS AND C OMPUTER S

Five-Year Forecast Greg Grootendorst, Chief Economist, Hampton Roads Planning District Commission

The dis istribution of f pension wealth in in Europe Ja Javie ier Oliv ivera Luxembourg