MODELING ANNOTATED DATA Reviewer: Saurabh Singh (ss1@uiuc.edu)

Problem • Modeling of associated document items • Images & Annotations • Papers & Bibliographies • Genes & Functions • Documents are considered as pairs of data streams. • One type provides annotation for the other type.

Uses • Retrieval, Clustering, Classification • Automatic annotation • Retrieval of un-annotated data.

This paper Models Images ( r ) and Annotations ( w ) Three primary tasks • Joint distribution of an image and its caption (Clustering, Organization) • Conditional distribution of words given an image. (Automatic annotation, text based retrieval) • Conditional distribution of words given a region of an image. (Automatic labeling of regions)

Modeling K factors or topics • Each a distribution over words • Each a distribution over image regions Latent variables • Topic assignments • Distribution parameters (for components) Features Document: (r, w), N regions, M words Distributions p( r , w ), p(w | r ), p(w | r , r n )

Text annotations Vocabulary: 168 Terms (V) Captions: 2-4 Words per Image Multinomials on V conditioned on topics

Images Composed of 6-10 regions via N-cuts Each region summarized as a feature vector ~40 • Size: Percentage of image • Position: Center of mass [0, 1] • Color: µ, σ of R,G,B, L, a, b etc. • Texture: µ, σ of filter responses • Shape: area/perimeter 2 , moment of inertia etc. Multivariate Gaussian over features: µ , Σ

Models Three hierarchical probabilistic models Gaussian Multinomial mixture 1. Gaussian Multinomial LDA 2. Correspondence LDA 3.

Gaussian Multinomial Mixture µ r N σ z λ w β M D θ d α Z d,n W d,n β k η N D K

Gaussian Multinomial LDA µ z r N σ α θ v w β M D θ d α Z d,n W d,n β k η N D K

Correspondence LDA µ z r α θ N σ y w β M D θ d α Z d,n W d,n β k η N D K

Inference & Estimation • Variational Inference • Exact intractable • Approximate assuming factorizable distribution • Minimize KL-Divergence via iterative updates to parameters • Parameter Estimation • EM algorithm • E: Compute variational posterior. • M: MLE estimate of the model parameters.

Evaluation • 7000 Images and their captions • 75% Training & 25% Testing • Test set likelihood • Automatic annotation • Text based retrieval

Eval: Test set likelihood 650 600 Average negative log probability 550 500 450 400 Corr − LDA GM − Mixture GM − LDA 350 ML 0 50 100 150 200 Number of factors

Eval: Automatic Annotation D M d D perplexity = exp { − m =1 log p ( w m | r d ) / d =1 M d } . d =1 Maximum likelihood Empirical Bayes smoothed 100 100 90 90 Caption perplexity Caption perplexity 80 80 70 70 60 60 50 50 Corr − LDA Corr − LDA GM − Mixture GM − Mixture 40 40 GM − LDA GM − LDA ML ML 30 30 0 50 100 150 200 0 50 100 150 200 Number of factors Number of factors

Eval: Automatic Annotation (Qual.) True caption True caption True caption clouds jet plane fish reefs water scotland water Corr − LDA Corr − LDA Corr − LDA sky plane jet mountain clouds fish water ocean tree coral scotland water flowers hills tree GM − LDA GM − LDA GM − LDA sky water people tree clouds water sky vegetables tree people tree water people mountain sky GM − Mixture GM − Mixture GM − Mixture sky plane jet clouds pattern fungus mushrooms tree flowers leaves water sky clouds sunset scotland

Eval: Automatic Annotation (Qual.) 3 Corr − LDA: GM − LDA: 1. PEOPLE, TREE 1. HOTEL, WATER 4 2 2. SKY, JET 2. PLANE, JET 3. SKY, CLOUDS 3. TUNDRA, PENGUIN 4. SKY, MOUNTAIN 4. PLANE, JET 5. PLANE, JET 5. WATER, SKY 6. PLANE, JET 6. BOATS, WATER 6 5 1

Text Based Retrieval people & fish sunset candy 1.0 1.0 1.0 Corr − LDA Corr − LDA Corr − LDA GM − Mixture GM − Mixture GM − Mixture 0.8 0.8 0.8 GM − LDA GM − LDA GM − LDA 0.6 Precision 0.6 0.6 Precision Precision 0.4 0.4 0.4 0.2 0.2 0.2 0.0 0.0 0.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 Recall Recall Recall

Text Based Retrieval (Qual.) Candy Sunset People & Fish

Conclusion If conditionals are needed, then model them explicitly

MODELING ANNOTATED DATA Reviewer: Saurabh Singh (ss1@uiuc.edu) - PowerPoint PPT Presentation

MODELING ANNOTATED DATA Reviewer: Saurabh Singh (ss1@uiuc.edu) Problem Modeling of associated document items Images & Annotations Papers & Bibliographies Genes & Functions Documents are considered as pairs of data

Artifact 2: Annotated Bibliography, Digital Poster, and Presentation Part 1: Annotated

Paving the Way to a Large-scale Pseudosense-annotated Dataset The problem: Paucity of

The Web as Collective Mind The Web as Collective Mind Building Large Annotated Data Building

BLOOMINGTON INDIANA UDO DIAGNOSIS AND ANNOTATED OUTLINE Summary Project Overview Key

These slides are annotated with notes to help you prepare and present your oral presentation for

Creating and exploiting multimodal annotated corpora Philippe Blache, Roxane Bertrand & Ga

Towards Efficient String Processing of Annotated Events David Woods 1 Tim Fernando 2 Carl Vogel 2

Metaphor Corpus Annotated for Source Target Domain Mappings Ekaterina Shutova 1 Simone Teufel

Modeling of proteins and complexes High resolution Low resolution Modeling of domains Modeling

Virtual Reality Modeling Virtual Reality Modeling from http://www.okino.com/ Modeling Modeling

Language Modeling CSE354 - Spring 2020 Task Language Modeling Probabilistic Modeling

Semi-Streaming Algorithms for Annotated Graph Streams Justin Thaler, Yahoo Labs Data Streaming

Topics Why E Field Modeling What is E Field Modeling Case Studies Questions 2 Why

Outline 1 The topic 2 Decision support systems 3 Modeling 3.3 Advanced modeling

Verilog HDL:Digital Design and Modeling Chapter 5 Gate-Level Modeling Chapter 5 Gate-Level

About (FRBR) Data Modeling: Conceptual Data Modeling In Cultural Heritage Institutions Ronald J.

Algorithms for NLP Parsing III Maria Ryskina CMU Slides adapted from: Dan Klein UC

Follow the brief presentation instructions Sharing PowerPoint slides is an effective way to get

Inconsistency Detection in Semantic Annotation Nora Hollenstein Nathan

Writing Your First Kotlin Compiler Plugin Kevin Most A brief intro Are these basically

Introduction to G Introduction to GATE Developer ATE Developer Ian Roberts University of

Collective Annotation of Linguistic Resources: Basic Principles and a Formal Model Ulle Endriss

Typed Clojure in Ti eory and Practice Ambrose Bonnaire-Sergeant Clojure Dynamic typing \_(

lti

Sambuz

Useful Links

Newsletter

Mail Us