Learning to Map Sentences to Logical Form: Structured Classification - PowerPoint PPT Presentation

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars Luke Zettlemoyer and Michael Collins MIT CSAIL

The Problem Learning to Map Sentences to Logical Form Texas borders Kansas  borders ( texas,kansas )

Several potential applications • Natural Language Interfaces to Databases • Dialogue Systems • Machine Translation

Some Training Examples Input: What states border Texas? Output: λ x.state(x) ∧ borders(x,texas) Input: What is the largest state? Output: argmax ( λ x.state ( x ) , λ x.size ( x )) Input: What states border the largest state? Output: λ x.state ( x ) ∧ borders ( x, argmax ( λ y.state ( y ) , λ y.size ( y )))

Our Approach Learn lexical information (syntax/semantics) for words: • Texas | syntax = noun phrase ( NP ) : semantics = texas • states | syntax = noun ( N ) : semantics = λ x.state(x) Learn to parse to logical form: Input: What states border Texas? Output: λ x.state(x) ∧ borders(x,texas)

Background • Combinatory Categorial Grammar (CCG) • Lexicon • Parsing Rules (Combinators) • Probabilistic CCG (PCCG)

CCG Lexicon Category Words Syntax : Semantics Texas NP : texas borders (S\NP)/NP : λ x. λ y.borders ( y,x ) Kansas NP : kansas Kansas city NP : kansas_city_MO

Parsing Rules (Combinators) • Application • X/Y : f Y : a => X : f(a) (S\NP)/NP S\NP NP λ x. λ y.borders ( y,x ) λ y.borders ( y,texas ) texas • Y : a X\Y : f => X : f(a) S\NP NP S λ y.borders ( y,texas ) kansas borders ( kansas,texas ) • Additional rules • Composition • Type Raising

CCG Parsing Kansas Texas borders (S\NP)/NP NP NP texas kansas λ x. λ y.borders ( y,x ) S\NP λ y.borders ( y,kansas ) S borders ( texas,kansas )

Parsing a Question What states border Texas S/(S\NP)/N N (S\NP)/NP NP texas λ f. λ g. λ x.f(x) ∧ g(x) λ x.state(x) λ x. λ y.borders ( y,x ) S\NP S/(S\NP) λ y.borders ( y,texas ) λ g. λ x.state(x) ∧ g(x) S λ x.state(x) ∧ borders(x,texas)

Probabilistic CCG (PCCG) Log-linear model: • A CCG for parsing • Features • f i ( L,S,T ) : number of times lexical item i is used in the parse T that maps from sentence S to logical form L • A parameter vector θ with an entry for each f i

PCCG Distributions Log-linear model: • Defines a joint distribution: e f ( L , T , S ) � � P ( L , T | S ; � ) = e f ( L , T , S ) � � � ( L , T ) • Parses are a hidden variable: � P ( L | S ; � ) = P ( L , T | S ; � ) T

Learning • Generating Lexical Items • Learning a complete PCCG

Lexical Generation Input Training Example Sentence: Texas borders Kansas Logic Form: borders ( texas,kansas ) Output Lexicon Words Category Texas NP : texas borders (S\NP)/NP : λ x. λ y.borders ( y,x ) Kansas NP : kansas ... ...

GENLEX • Input: a training example ( S i ,L i ) • Computation: 1. Create all substrings of words in S i 2. Create categories from L i 3. Create lexical entries that are the cross product of these two sets • Output: Lexicon Λ

Step 1: GENLEX Words Input Sentence: Texas borders Kansas Ouput Substrings: Texas borders Kansas Texas borders borders Kansas Texas borders Kansas

Step 2: GENLEX Categories Input Logical Form: borders(texas,kansas) Output Categories: ... ... ...

Two GENLEX Rules Input Trigger Output Category a constant c NP : c an arity two predicate p (S\NP)/NP : λ x. λ y.p ( y,x ) Example Input: borders(texas,kansas) Output Categories: NP : texas , NP : kansas , (S\NP)/NP : λ x. λ y.borders ( y,x )

All of the Category Rules Input Trigger Output Category a constant c NP : c arity one predicate p N : λ x.p ( x ) arity one predicate p S\NP : λ x.p ( x ) arity two predicate p (S\NP)/NP : λ x. λ y.p ( y,x ) arity two predicate p (S\NP)/NP : λ x. λ y.p ( x,y ) arity one predicate p N/N : λ g. λ x.p ( x ) ∧ g(x) arity two predicate p and N/N : λ g. λ x.p ( x,c ) ∧ g(x) constant c arity two predicate p (N\N)/NP : λ x. λ g. λ y.p ( y,x ) ∧ g(x) arity one function f NP/N : λ g.argmax/min(g(x), λ x.f(x)) arity one function f S/NP : λ x.f(x)

Step 3: GENLEX Cross Product Input Training Example Sentence: Texas borders Kansas Logic Form: borders ( texas,kansas ) Output Lexicon Output Substrings: Output Categories: Texas NP : texas borders X NP : kansas Kansas (S\NP)/NP : Texas borders λ x. λ y.borders ( y,x ) borders Kansas Texas borders Kansas GENLEX is the cross product in these two output sets

GENLEX: Output Lexicon Words Category Texas NP : texas Texas NP : kansas Texas (S\NP)/NP : λ x. λ y.borders ( y,x ) borders NP : texas borders NP : kansas borders (S\NP)/NP : λ x. λ y.borders ( y,x ) ... ... Texas borders Kansas NP : texas Texas borders Kansas NP : kansas Texas borders Kansas (S\NP)/NP : λ x. λ y.borders ( y,x )

A Simple Algorithm Inputs: Initial lexicon Λ 0 The initial lexicon has two types of entries: • Domain Independent: Example: What | S/(S\NP)/N : λ f. λ g. λ x.f(x) ∧ g(x) • Domain Dependent: Example: Texas | NP : texas

A Simple Algorithm Inputs: Initial lexicon Λ 0 Training examples { } E = ( S i , L i ) : i = 1 K n Initialization: � * = � 0 � n Create lexicon U GENLEX ( S i , L i ) i = 1 Create features f Create initial parameters θ 0 Computation: Estimate parameters � = STOCGRAD ( E , � 0 , � * ) Output: PCCG ( Λ *, θ , f )

The Final Algorithm Inputs: Λ 0 , E Initialization: Create Λ *, f , θ 0 Computation: For t = 1 ...T 1. Prune Lexicon: • For each ( S i , L i ) � E Set − � = � 0 � GENLEX ( S i , L i ) � = MAXPARSE ( S i , L i , � , � t � 1 ), Calculate − the set of highest scoring correct parses Define λ i to be lexical items in a parse in π − � t = � 0 � n U • Set � i i = 1 � t = STOCGRAD ( E , � t � 1 , � t ) 2. Estimate parameters: Output: PCCG ( Λ T , θ T , f )

Related Work • C HILL (Zelle and Mooney, 1996) • learns deterministic parser; assumes semantic lexicon as input ( borders | borders (_,_) ) • W OLFIE (Thompson and Mooney, 2002) • learns complete lexicon; deterministic parsing • C OCKTAIL (Tang and Mooney, 2001) • best results; statistical parsing; assumes semantic lexicon

Experiments Two database domains: • Geo880 – 600 training examples – 280 test examples • Jobs640 – 500 training examples – 140 test examples

Evaluation Test for completely correct semantics • Precision: # correct / total # parsed • Recall: # correct / total # sentences

Results Geo 880 Jobs 640 Precision Recall Precision Recall 96.25 79.29 97.36 79.29 Our Method 89.92 79.40 93.25 79.84 C OCKTAIL

Example Learned Lexical Entries Words Category states N : λ x.state ( x) major N/N : λ g. λ x.major ( x ) ∧ g(x) population N : λ x.population ( x) cities N : λ x.city ( x) river N : λ x.river ( x) run through (S\NP)/NP : λ x. λ y.traverse ( y,x ) the largest NP/N : λ g.argmax(g, λ x.size(x)) rivers N : λ x.river ( x) the highest NP/N : λ g.argmax(g, λ x.elev(x)) the longest NP/N : λ g.argmax(g, λ x.len(x)) ... ...

Error Analysis Low recall: GENLEX is not general enough • Fails to parse 10% of training examples Some unparsed examples include: • Through which states does the Mississippi run? • If I moved to California and learned SQL on Oracle could I find anything for 30000 on Unix?

Future Work • Improve recall • Explore robust parsing techniques for ungrammatical input • Develop new domains • Integrate with a dialogue system

The End Thanks

Convergence Some Guarantees 1. Prune Lexicon Will not decrease accuracy on training • set 2. Estimate parameters • Should increase the likelihood of the training set

Learning to Map Sentences to Logical Form: Structured Classification - PowerPoint PPT Presentation

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars Luke Zettlemoyer and Michael Collins MIT CSAIL The Problem Learning to Map Sentences to Logical Form Texas borders Kansas borders

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Mapping Text to Meaning Learning to Map Sentences to Logical Form Natural Meaning Language M

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Logical Consequence: From Logical Terms to Semantic Constraints Gil Sagi Munich Center for

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

SYMBOLIC LOGIC UNIT 10: SINGULAR SENTENCES Singular Sentences (monadic) Paris is beautiful

Learning Context-dependent Mappings from Sentences to Logical Form Luke Zettlemoyer and Michael

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Logical Form and Quantifier Raising What is Logical Form In the late 1970s, Noam Chomsky and

Semantics and Logical Form Semantics and Logical Form Berlin Chen 2003 References: 1. Speech

Abstract Data Type Map Map ADT Another fundamental abstract data type is the map (also The most

The Lament Form The Lament Form The Lament Form The Lament Form The Lament Form Psalm 64

A Linear Logical A Linear Logical A Linear Logical Framework Framework Framework Iliano

Basic Computation logical AND logical OR logical NOT && || ! public static void

Evaluation Map Guide Evaluation Map Guide Evaluation Map Guide Evaluation Map Guide Progress on

Aggregation of Chunky Monkeys 13 November 2020 Association for Computing Machinery 13 November

Applications on Heterogeneous Platforms with Accelerators Accelerators and Hybrid Exascale

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao and Mary Hall The 3rd Annual

Combining Polyhedral and AST Transformations in CHiLL Huihui Zhang , Anand Venkat, Protonu Basu,

3 Tips for Writing Winning Content Charlotte Hicks Crockett Content Marketing Goals Attract

Academic Regulations 2 Academic Regulations 3 Deadlines 4 86 Credits Academic Regulations

Fine scales of decay and an application to decay of waves in a viscoelastic boundary damping model

Chinese Remainder Theorem explained with rotations Reference: Antonella Perucca, The Chinese

Sambuz

Useful Links

Newsletter

Mail Us

Learning to Map Sentences to Logical Form: Structured Classification - PowerPoint PPT Presentation

Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars Luke Zettlemoyer and Michael Collins MIT CSAIL The Problem Learning to Map Sentences to Logical Form Texas borders Kansas borders

A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE A STRUCTURED L IFE

Structured Prediction Introduction What is structured prediction? CS 6355: Structured Prediction

Mapping Text to Meaning Learning to Map Sentences to Logical Form Natural Meaning Language M

map-D map-D data refined map-D data refined map-D A GPU Database for Real-Time Big Data

Logical Consequence: From Logical Terms to Semantic Constraints Gil Sagi Munich Center for

Machine Learning Fall 2017 Structured Prediction (structured perceptron, HMM, structured SVM)

SYMBOLIC LOGIC UNIT 10: SINGULAR SENTENCES Singular Sentences (monadic) Paris is beautiful

Learning Context-dependent Mappings from Sentences to Logical Form Luke Zettlemoyer and Michael

Scaling Log-Structured KV-Stores featuring Monkey and Dostoevsky SIGMOD17 / SIGMOD18 Niv Dayan

Logical Form and Quantifier Raising What is Logical Form In the late 1970s, Noam Chomsky and

Semantics and Logical Form Semantics and Logical Form Berlin Chen 2003 References: 1. Speech

Abstract Data Type Map Map ADT Another fundamental abstract data type is the map (also The most

The Lament Form The Lament Form The Lament Form The Lament Form The Lament Form Psalm 64

A Linear Logical A Linear Logical A Linear Logical Framework Framework Framework Iliano

Basic Computation logical AND logical OR logical NOT &amp;&amp; || ! public static void

Evaluation Map Guide Evaluation Map Guide Evaluation Map Guide Evaluation Map Guide Progress on

Aggregation of Chunky Monkeys 13 November 2020 Association for Computing Machinery 13 November

Applications on Heterogeneous Platforms with Accelerators Accelerators and Hybrid Exascale

Chapel With Polyhedral Transformation Using Autotuning TuowenZhao and Mary Hall The 3rd Annual

Combining Polyhedral and AST Transformations in CHiLL Huihui Zhang , Anand Venkat, Protonu Basu,

3 Tips for Writing Winning Content Charlotte Hicks Crockett Content Marketing Goals Attract

Academic Regulations 2 Academic Regulations 3 Deadlines 4 86 Credits Academic Regulations

Fine scales of decay and an application to decay of waves in a viscoelastic boundary damping model

Chinese Remainder Theorem explained with rotations Reference: Antonella Perucca, The Chinese

Sambuz

Useful Links

Newsletter

Mail Us

Basic Computation logical AND logical OR logical NOT && || ! public static void