Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski

Visual Turing Test challenge The task involves Object detection • Ask about the content of the image in front inside left right on Spatial reasoning ‣ How many sofas? 3 ‣ Where is the lamp? on the table, close to tv ‣ What is behind the largest table? tv ‣ What is the color of the walls? purple Natural language understanding 2 M. Malinowski | Question Answering

Roadmap (parameters) (world) θ w monitor to the left of the mugs Semantic Parsing Evaluation x y z � x. ∃ y. monitor ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) mug to the left of the other mug (question) (logical form) (answer) � x. ∃ y. mug ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) state with the Alaska ∗∗ objects on the table largest area � x. ∃ y. object ( x ) ∧ on-rel ( x, y ) ∧ table ( y ) ( x 1 x 1 two blue cups are placed near to the computer screen state 1 � x. blue ( x ) ∧ cup ( x ) ∧ comp. ( x ) ∧ screen ( x ) z ∼ p θ ( z | x ) 1 area Jointly Learning to Parse and Perceive:   y = J z K w c Connecting Natural Language to the argmax Physical World. ! Learning Dependency-Based (J. Krishnamurthy et. al. TACL 2013) Compositional Semantics   (P. Liang et. al. ACL 2011) Some ideas ? 3 M. Malinowski | Grounding

Two dimensions of language understanding Old AI Our dream Percy’s work Precision Google Recall 4 M. Malinowski | Grounding

Semantic parser The Big Picture What is the most populous city in California? Database System Los Angeles Expensive : logical forms Cheap : answers [Zelle & Mooney, 1996; Zettlemoyer & Collins, 2005] [Clarke et al., 2010] [Wong & Mooney, 2007; Kwiatkowski et al., 2010] [ this work ] What is the most populous city in California? What is the most populous city in California? ⇒ Los Angeles ⇒ argmax ( λ x. city ( x ) ∧ loc ( x, CA ) , λ x. pop. ( x )) How many states border Oregon? How many states border Oregon? ⇒ 3 ⇒ count ( λ x. state ( x ) ∧ border ( x, OR ) · · · · · · 5 M. Malinowski | Grounding

The probabilistic framework p ( y | z, w ) capital of Interpretation x California? Semantic parsing ) p ( z | x, θ ) ∗∗ 1 parameters Objective 2 P θ max θ z p ( y | z, w ) p ( z | x, θ ) z capital 1 1 Interpretation Semantic parsing CA Learning database parameters θ k -best list Sacramento y w tree1 enumerate/score DCS trees tree2 (0 . 2 , − 1 . 3 , . . . , 0 . 7) tree3 tree4 numerical optimization (L-BFGS) tree5 6 M. Malinowski | Grounding

Challenges of the semantic parsing What is the most populous city in California? λ x. city ( x ) ∧ loc ( x, CA ) Los Angeles What is the most populous city in California? λ x. state ( x ) ∧ border ( x, CA ) Los Angeles What is the most populous city in California? argmax ( λ x. city ( x ) ∧ loc ( x, CA ) , λ x. population ( x )) Los Angeles 7 M. Malinowski | Grounding

Challenges of the semantic parsing Words to Predicates (Lexical Semantics) city city state state river river argmax population population CA What is the most populous city in CA ? Lexical Triggers: 1. String match CA ⇒ CA 2. Function words (20 words) most ⇒ argmax 3. Nouns/adjectives city ⇒ city state river population 8 M. Malinowski | Grounding

Dependency-based compositional semantics Solution: Mark-Execute most populous city in California Superlatives ∗∗ x 1 x 1 city 1 1 1 1 population loc Mark at syntactic scope 2 c 1 argmax CA 9 M. Malinowski | Grounding

Results On Geo , 600 training examples, 280 test examples System Description Lexicon Logical forms CCG [Zettlemoyer & Collins, 2005] zc05 relaxed CCG [Zettlemoyer & Collins, 2007] zc07 kzgs10 CCG w/unification [Kwiatkowski et al., 2010] our system dcs our system dcs + 100 95 91.1% 88.9% test accuracy 88.6% 90 86.1% 85 79.3% 80 75 zc05 zc07 kzgs10 dcs dcs + 10 M. Malinowski | Grounding

Grounding problem The mugs {} {} 1) , ) = ) = {} {} A mug left of the monitor 12 M. Malinowski | Grounding

Question answering problem How high is the highest point in the largest state? 6.000 m Semantic Evaluation parsing A Q T W logical question answer universe form P. Liang, M. Jordan, D. Klein. Learning Dependency-Based Compositional Semantics. ACL’11 J. Berant, A. Chou, R. Frostig, and P. Liang. Semantic Parsing on Freebase from Question-Answer Pairs. EMNLP’13. 13 M. Malinowski | Grounding

Question answering problem What is in front of sofa in image 1? table Semantic Evaluation parsing A Q T W logical question answer universe form Our knowledge base sofa (1,brown, image 1, X,Y,Z) table(1,brown, image 1,X,Y,Z) wall (1,white, image 1, X,Y,Z) Scene bed (1, white, image 2 X,Y,Z) analysis chair (1,brown, image 4, X,Y,Z) chair (2,brown, image 4, X,Y,Z) chair (1,brown, image 5, X,Y,Z) … 14 M. Malinowski | Grounding

Results Environment d Language z and predicted logical form ` Predicted grounding True grounding { ( 2 , 1 ) , ( 2 , 3 ) } { ( 2 , 1 ) , ( 2 , 3 ) } monitor to the left of the mugs � x. ∃ y. monitor ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) { ( 3 , 1 ) } { ( 3 , 1 ) } mug to the left of the other mug � x. ∃ y. mug ( x ) ∧ left-rel ( x, y ) ∧ mug ( y ) { ( 1 , 4 ) , ( 2 , 4 ) { ( 1 , 4 ) , ( 2 , 4 ) , objects on the table � x. ∃ y. object ( x ) ∧ on-rel ( x, y ) ∧ table ( y ) ( 3 , 4 ) } ( 3 , 4 ) } two blue cups are placed near to the computer screen { ( 1 ) } { ( 1 , 2 ) , ( 3 , 2 ) } � x. blue ( x ) ∧ cup ( x ) ∧ comp. ( x ) ∧ screen ( x ) Denotation � 0 rel. 1 rel. other total LSP- CAT 0.94 0.45 0.20 0.51 LSP-F 0.89 0.81 0.20 0.70 LSP-W 0.89 0.77 0.16 0.67 Grounding g 0 rel. 1 rel. other total LSP- CAT 0.94 0.37 0.00 0.42 LSP-F 0.89 0.80 0.00 0.65 LSP-W 0.89 0.70 0.00 0.59 % of data 23 56 21 100 (a) Results on the S CENE data set. 15 M. Malinowski | Grounding

Current limitations • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty • A computer system is on the table • There are items on the desk • There are two cups on the table • The computer is off 17 M. Malinowski | Grounding

Current limitations • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty 18 M. Malinowski | Grounding

Our suggestions • Language ‣ At most 1 relation ‣ Doesn’t model more complex phenomena (negations, superlatives, …) • Vision ‣ Dataset is restricted ‣ No uncertainty • A computer system is on • What is the object in front of the the table photocopying machine attached to the wall? • There are items on the desk • What is the object that is placed on the middle rack of the stand that is • There are two cups on the placed closed to the wall? table • What is time showing on the • The computer is off clock? 19 M. Malinowski | Grounding

Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski Visual Turing Test challenge The task involves Object detection Ask about the content of the image in front inside left right on Spatial reasoning How many sofas? 3

Beyond the Asymmetric Turing Test Fintan Mallory Rethinking, Reworking and Revolutionising The

Outline Super-Turing I. The Limits of Turing Computation or A. Models & Frames of

Alan Turing and the Turing Award Winners A Short Journey Through the History of Computer Ttulo

Turing Machines (TM) Deterministic Turing Machine (DTM) Nondeterministic Turing Machine

1 Undecidability; the Church-Turing Thesis The Church-Turing thesis: A Turing machine that halts

Lecture 13: Oracle Turing Machines Arijit Bishnu 13.04.2010 Oracle Turing Machines

criticisms of the turing test and why you should ignore (most of) them katrina lacurts | 6.893

NVIDIA QUADRO RTX NVIDIA TURING GPU Turing SM RT Cores Turing SM RT Cores Up to 10 Giga

Turing Machines Our most powerful model of a computer is the Turing Machine. This is an FA with

1 Turing Machines 1.1 Introduction Turing machines provide an answer to the question, What is a

Science (Bridging Course) Turing Machines Gian Diego Tipaldi Topics Covered Turing machines

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Theory Chapter 3: The Church-Turing Thesis 1 Chapter 3.1 Turing Machines 2 Turing Machines:

TURING MACHINE VARIATIONS ENCODING TURING MACHINES UNIVERSAL TURING MACHINE Your Questions?

Advanced Topics in Theoretical Computer Science Part 1: Turing Machines and Turing Computability

57% 2. Expose you to lots of new ideas 3. Present a coherent structure for teaching 4. Show the

Behavioral Issues Overconfidence Simon Gervais Duke University (sgervais@duke.edu) June 29,

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &

Autism Case Training: Web-based Continuing Education Course October 24, 2012 Webinar Overview

Katharine Lady Berkeleys School Sixth Form Open Evening 14 November 2019 Introduction: Tim

GMTS Scheme London Prospective Hosts Information Event 2015 Intake Mark Humble - Deputy Head,

Tackling the Obesity Epidemic Facilitating Consumer Adoption of Healthy Lifestyle Facilitating

New Strategies in Weight Loss Gary D. Foster, PhD Chief Scientific Officer Weight Watchers

Sambuz

Useful Links

Newsletter

Mail Us

Visual Turing Test: defining a challenge Mateusz Malinowski Visual - PowerPoint PPT Presentation

Visual Turing Test: defining a challenge Mateusz Malinowski Visual Turing Test challenge The task involves Object detection Ask about the content of the image in front inside left right on Spatial reasoning How many sofas? 3

Beyond the Asymmetric Turing Test Fintan Mallory Rethinking, Reworking and Revolutionising The

Outline Super-Turing I. The Limits of Turing Computation or A. Models &amp; Frames of

Alan Turing and the Turing Award Winners A Short Journey Through the History of Computer Ttulo

Turing Machines (TM) Deterministic Turing Machine (DTM) Nondeterministic Turing Machine

1 Undecidability; the Church-Turing Thesis The Church-Turing thesis: A Turing machine that halts

Lecture 13: Oracle Turing Machines Arijit Bishnu 13.04.2010 Oracle Turing Machines

criticisms of the turing test and why you should ignore (most of) them katrina lacurts | 6.893

NVIDIA QUADRO RTX NVIDIA TURING GPU Turing SM RT Cores Turing SM RT Cores Up to 10 Giga

Turing Machines Our most powerful model of a computer is the Turing Machine. This is an FA with

1 Turing Machines 1.1 Introduction Turing machines provide an answer to the question, What is a

Science (Bridging Course) Turing Machines Gian Diego Tipaldi Topics Covered Turing machines

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Foundations of Computer Science Lecture 26 Turing Machines The Turing Machine: DFA with Random

Theory Chapter 3: The Church-Turing Thesis 1 Chapter 3.1 Turing Machines 2 Turing Machines:

TURING MACHINE VARIATIONS ENCODING TURING MACHINES UNIVERSAL TURING MACHINE Your Questions?

Advanced Topics in Theoretical Computer Science Part 1: Turing Machines and Turing Computability

57% 2. Expose you to lots of new ideas 3. Present a coherent structure for teaching 4. Show the

Behavioral Issues Overconfidence Simon Gervais Duke University (sgervais@duke.edu) June 29,

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &amp;

Autism Case Training: Web-based Continuing Education Course October 24, 2012 Webinar Overview

Katharine Lady Berkeleys School Sixth Form Open Evening 14 November 2019 Introduction: Tim

GMTS Scheme London Prospective Hosts Information Event 2015 Intake Mark Humble - Deputy Head,

Tackling the Obesity Epidemic Facilitating Consumer Adoption of Healthy Lifestyle Facilitating

New Strategies in Weight Loss Gary D. Foster, PhD Chief Scientific Officer Weight Watchers

Sambuz

Useful Links

Newsletter

Mail Us

Outline Super-Turing I. The Limits of Turing Computation or A. Models & Frames of

Realistic modelling of complex Realistic modelling of complex Supercomputing, Visualization &