Reading Tea Leaves: How Humans Interpret Topic Models By Jonathan - PowerPoint PPT Presentation

Nov 25, 2022 •220 likes •391 views

Reading Tea Leaves: How Humans Interpret Topic Models By Jonathan Chang, Jordan Boyd-Graber, (Chong Wang), et al. NIPS 2009 Presented by Stephen Mayhew Feb 2013 Motivation How to evaluate topic models? Anecdotally,

Reading Tea Leaves: How Humans Interpret Topic Models By Jonathan Chang, Jordan Boyd-Graber, (Chong Wang), et al. NIPS 2009 Presented by Stephen Mayhew Feb 2013
Motivation • How to evaluate topic models? • “Anecdotally”, “empirically” • Intrinsic vs. extrinsic
SVM Document Classification on Reuters 21578
Human Metrics 1. Word intrusion 2. Topic intrusion Crowdsourced approach using Amazon Mechanical Turk Evaluating three different approaches: LDA, pLSI, CTM.
Word Intrusion “Spot the intruder word” Process: 1. Select a topic at random 2. Choose the 5 most probable words from the topic 3. Choose an improbable word from this topic (which is probable in another topic) 4. Shuffle 5. Present to subject
Word Intrusion If the topic set is coherent, then the users will agree on the outlier. If the topic set is incoherent, then the users will choose the outlier at random.
Topic Intrusion “Spot the intruder topic” Process: 1. Choose a document 2. Choose the three highest-prob. topics for this document 3. Choose one low-prob. topic for this document 4. Shuffle 5. Present to subject
Topic Intrusion
Word Intrusion: how to measure it Model parameters: 𝑛 = 𝑛 = 𝑥 𝑙 𝑛 )/𝑇 MP 𝟚(𝑗 𝑙,𝑡 𝑙 𝑡 Which is just a fancy way of saying: 𝑜𝑣𝑛𝑐𝑓𝑠 𝑝𝑔 𝑞𝑓𝑝𝑞𝑚𝑓 𝑑𝑝𝑠𝑠𝑓𝑑𝑢 𝑢𝑝𝑢𝑏𝑚 𝑜𝑣𝑛𝑐𝑓𝑠 𝑝𝑔 𝑞𝑓𝑝𝑞𝑚𝑓
Word Intrusion
NYT corpus, 50 topic LDA model
Topic intrusion: how to measure It Topic Log Odds (TLO): 𝑛 = ( 𝑛 𝑛 log − log TLO 𝑒 𝜄 𝑒,𝑘 𝑒,∗ 𝜄 𝑒,𝑘 𝑒,𝑡 )/𝑇 𝑛 𝑛 𝑡 Tran anslation : normalized difference between probability mass of actual “intruder” and selected “intruder”. Upper bound is 0, higher is better.
Topic Intrusion
Wikipedia corpus, 50 topic LDA model
Problems Measures homogeneity (synonymy), not topic strength (coherence) Example le document: curling Pos ossib ible top opic: broom, ice, Canada, rock, sheet, stone Con onsid ider syn yntactic dif ifferences: organization, physicality, proportions, red

Recommend

Reading the Tea Leaves: Reading the Tea Leaves: How Utilities in the West Are Managing Carbon

Reading the Tea Leaves: Reading the Tea Leaves: How Utilities in the West Are Managing Carbon How Utilities in the West Are Managing Carbon Regulatory Risk in their Resource Plans Regulatory Risk in their Resource Plans Galen Barbose, Ryan

811 views • 29 slides

Using multiple equilibria to interpret paleoclimate David Ferreira University of Reading

Using multiple equilibria to interpret paleoclimate David Ferreira University of Reading Collaborators: John Marshall (MIT) Brian Rose (Albany) Taka Ito (Georgia Tech) David McGee (MIT) Outline Paleoclimate context Quick summary of

656 views • 41 slides

AT ATI TEAS READING REVIEW PART 2 UNDERSTANDING TOPIC QUESTIONS Topic questions on ATI TEAS

AT ATI TEAS READING REVIEW PART 2 UNDERSTANDING TOPIC QUESTIONS Topic questions on ATI TEAS can be one of two types: (1) differentiate between a topic, theme, main idea, and supporting details, (2) identify the topic sentence or summary

307 views • 3 slides

Reading the Tea Leaves: Analyzing the 2013 Environmental Scan Data Prepared by: Dr. Cheryl

Reading the Tea Leaves: Analyzing the 2013 Environmental Scan Data Prepared by: Dr. Cheryl Marshall, President Keith Wurtz, Dean, Institutional Effectiveness, Research & Planning Benjamin Gamboa, Research Analyst Objectives Review

719 views • 28 slides

PhysicsAndMathsTutor.com 1 (iii) Median = 29.5 B1 CAO B1 CAO Do not allow 27 Mean = 26.7

1 (i) Do not allow leaves 21 ,25, 28 etc 0 6 G1 Stem (in either order) Ignore commas between leaves 1 5 8 and leaves Allow stem 0, 10, 20, 30 2 1 5 8 G1 Sorted and aligned Allow errors in leaves if sorted and 3 1 1 3 5 8 9

719 views • 13 slides

Reading the Tea Leaves: Model Uncertainty, Robust Forecasts, and the Autocorrelation of

Introduction Robust Forecasting Empirical Methodology Empirical Analysis of Analysts Forecasts Decomposing the autocorrelation in forecast errors Reading the Tea Leaves: Model Uncertainty, Robust Forecasts, and the Autocorrelation of

637 views • 34 slides

Reading the Tea Leaves a.k.a . Electronic Schemac Diagrams KARS Presentation Jack Philley

Reading the Tea Leaves a.k.a . Electronic Schemac Diagrams KARS Presentation Jack Philley WB5KVV What is a schemac ? Resistors Other Common Symbols Switches and Jacks Grounds Wiring Common Convenons / Pracces Not all

497 views • 14 slides

READING THE EDUCATION POLICY TEA LEAVES INTRODUCATIONS Alex Nock ANock@pennhillgroup.com Bob

WELCOME READING THE EDUCATION POLICY TEA LEAVES INTRODUCATIONS Alex Nock ANock@pennhillgroup.com Bob Morrison bobm@artsedresearch.org Stan Karp skarp@edlawcenter.org Alex Nock Penn Hill Group ANock@pennhillgroup.com Topics U.S.

815 views • 35 slides

STATE BOARD OF EDUCATION TOPIC SUMMARY Topic: First Reading: Presentation on the adoption of

STATE BOARD OF EDUCATION TOPIC SUMMARY Topic: First Reading: Presentation on the adoption of Art Standards. Date: June 25, 2015 Staff/Office: Kim Patterson, ODE: Nancy Carr, Oregon Alliance for Arts Education Action Requested: Informational

183 views • 3 slides

Text SWEN-444 Text Topics Human reading process Using Text in Interaction Design Humans

Text SWEN-444 Text Topics Human reading process Using Text in Interaction Design Humans and Text the Reading Process Saccades quick, jerky eye movements forward 8-10 letters at a time plus CR/LF to the next line

534 views • 16 slides

Topic 18 Binary Trees "A tree may grow a thousand feet tall, but its leaves will return to

Topic 18 Binary Trees "A tree may grow a thousand feet tall, but its leaves will return to its roots." -Chinese Proverb Definitions A tree is an abstract data type root node internal one entry point, the root nodes

716 views • 52 slides

Applications of Topic Models Document Understanding, session 7 CS6200: Information Retrieval

Applications of Topic Models Document Understanding, session 7 CS6200: Information Retrieval Extending Topic Models PLSA is the most basic probabilistic topic model, and the idea has been usefully extended in many ways. Its probability

352 views • 8 slides

Language in humans Today: how do humans process language? Language in Humans We ve

Language in humans Today: how do humans process language? Language in Humans We ve looked above at syntactic processing. There are many other aspects of apparently similar complexity. Human Communication 1 The main questions are

229 views • 5 slides

Leaves Plant Jigsaw by Philip, Julia, Jessica, Mitti, and Chris The Function of Leaves Leaves

Leaves Plant Jigsaw by Philip, Julia, Jessica, Mitti, and Chris The Function of Leaves Leaves collect energy in the form of sunlight and provide it to the rest of the plant. This energy is necessary in the survival of the plant. They also are

225 views • 9 slides

Trust Models CS461/ECE422 1 Reading Chapter 5.1 5.3 (stopping at Models Proving

Trust Models CS461/ECE422 1 Reading Chapter 5.1 5.3 (stopping at Models Proving Theoretical Limitations) in Security in Computing 2 Outline Trusted System Basics Specific Policies and models Military Policy

561 views • 43 slides

Lecture 5 : Sparse Models Homework 3 discussion (Nima) Sparse Models Lecture - Reading :

Lecture 5 : Sparse Models Homework 3 discussion (Nima) Sparse Models Lecture - Reading : Murphy, Chapter 13.1, 13.3, 13.6.1 - Reading : Peter Knee, Chapter 2 Paolo Gabriel (TA) : Neural Brain Control After class - Project groups

571 views • 28 slides

Topic 3 Structured Programming Case Study Ugly programs are like ugly suspension bridges:

Topic 3 Structured Programming Case Study Ugly programs are like ugly suspension bridges: they're much more liable to collapse than pretty ones, because the way humans (especially engineer- humans) perceive beauty is intimately related to

308 views • 7 slides

Using topic models as classifiers Pavel Oleinikov Associate Director Quantitative Analysis

DataCamp Topic Modeling in R TOPIC MODELING IN R Using topic models as classifiers Pavel Oleinikov Associate Director Quantitative Analysis Center Wesleyan University DataCamp Topic Modeling in R Topic models as soft classifiers

473 views • 22 slides

Module 5 After the webcast you will have an understanding of - Leaves and absences - Vacation -

Module 5 After the webcast you will have an understanding of - Leaves and absences - Vacation - Protected absences (parental leaves-sickness-accident) - Allowances and average daily wage Leaves and absences Vacation: Annual Leave Each CCNL

680 views • 29 slides

Advanced Lesson 16 Topic 16: Reading: Why? Because it was there ! Take risks Why?

2 nd semester Advanced Lesson 16 Topic 16: Reading: Why? Because it was there ! Take risks Why? Sometimes we take risks because we're bored and want to 'spice up' our lives. In most cases this boredom is the result of some imbalance

616 views • 6 slides

Network Security Topic 3: User Authentication Topic 3: User Authentication 1 Reading for this

5/25/2019 Network Security Topic 3: User Authentication Topic 3: User Authentication 1 Reading for this Lecture 5/25/2019 Password Topic 3: User Authentication Password strength Salt_(cryptography) Password cracking

1.03k views • 75 slides

Leaves of Brass a A study of how conduc4ve materials behave on leaves Fawn Qiu | 10. 2011

Leaves of Brass a A study of how conduc4ve materials behave on leaves Fawn Qiu | 10. 2011 Conduc4ve ink was drawn on a piece of leaf to create an electrical circuit Silver ink painted using a s4ck was the most effec4ve Pen and brushes were

241 views • 7 slides

Temporal Common Sense n Humans assume information when reading Not explicitly mentioned

Going on a vacation takes longer than Going for a walk : A Study of Temporal Commonsense Understanding Ben Zhou Daniel Khashabi* Qiang Ning* Dan Roth *Currently affiliated with AI2 Temporal Common Sense n Humans assume

1.1k views • 15 slides

Advanced Lesson 30 Topic 30: Identifying similarities and differences in text .Reading

2 nd semester Advanced Lesson 30 Topic 30: Identifying similarities and differences in text .Reading Compare, in relation to reading, refers to the process of identifying the similarities and differences between two things. On the other

351 views • 5 slides