The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 - PowerPoint PPT Presentation

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 Sara Stymne Department of Linguistics and Philology Mostly based on slides from Marco Kuhlmann

Phrase structure trees root (top) S leaves (bottom) NP VP Pro Verb NP I prefer Det Nom a Nom Noun Noun flight morning

Ambiguity S NP VP Pro Verb NP I booked Det Nom a Nom PP Noun from LA flight

Ambiguity S NP VP Pro Verb NP PP I booked Det Nom from LA a Noun flight

Parsing as search • Parsing as search: search through all possible parse trees for a given sentence • bottom–up: build parse trees starting at the leaves • top–down: build parse trees starting at the root node

Overview of the CKY algorithm • The CKY algorithm is an efficient bottom-up parsing algorithm for context-free grammars. • It was discovered at least three (!) times and named after Cocke, Kasami, and Younger. • It is one of the most important and most used parsing algorithms.

Applications The CKY algorithm can be used to compute many interesting things. Here we use it to solve the following tasks: • Recognition: Is there any parse tree at all? • Probabilistic parsing: What is the most probable parse tree?

Restrictions • The original CKY algorithm can only handle rules that are at most binary: C → w i , C → C 1 C 2 . • It can easily be extended to also handle unit productions: C → w i , C → C 1 , C → C 1 C 2 . • This restriction is not a problem theoretically, but requires preprocessing (binarization) and postprocessing (debinarization). • A parsing algorithm that does away with this restriction is Earley’s algorithm (Lecture 5 and J&M 13.4.2).

Restrictions - details • The CKY algorithm originally handles grammars in CNF (Chomsky normal form): C → w i , C → C 1 C 2 , (S → ε ) • ε is normally not used in natural language grammars • This is what you will use in assignment 2 • We will also discuss allowing unit productions, C → C 1 • Extended CNF • Easy to integrate into CKY, gives easier grammar conversions

Conversion to CNF • Eliminate mixed rules: • VP->V to VP -- VP->V INF VP , INF->to • Elimainate n-ary branching subtrees, with n>2, by inserting additional nodes • VP->V INF VP -- VP->V X1, X1->INF V • Eliminate unary branching by merging nodes • S-> NP VP , NP->PRON, PRON->you -- NP->you

Conversion to CNF • Eliminate mixed rules: • VP->V to VP -- VP->V INF VP , INF->to • Eliminate n-ary branching subtrees, with n>2, by inserting additional nodes • VP->V INF VP -- VP->V X1, X1->INF V VP |V, VP|V->INF VP more readable: VP->V • Eliminate unary branching by merging nodes • S-> NP VP , NP->PRON, PRON->you -- NP->you more readable: NP->NP+PRON VP , NP+PRON->you

Conversion to CNF • The preceding slide showed how to convert a grammar to CNF • It is also possible to convert a treebank to CNF • You will do this in task 1

Conventions • We are given a context-free grammar G and a sequence of word tokens w = w 1 … w n . • We want to compute parse trees of w according to the rules of G . • We write S for the start symbol of G .

Fencepost positions We view the sequence w as a fence with n holes, one hole for each token w i , and we number the fenceposts from 0 till n . morning want flight a I 0 1 2 3 4 5

Structure • Is there any parse tree at all? • What is the most probable parse tree?

Recognition

Recognition Recognizer A computer program that can answer the question Is there any parse tree at all for the sequence w according to the grammar G ? is called a recognizer. In practical applications one also wants a concrete parse tree, not only an answer to the question whether such a parse tree exists.

Recognition Parse trees S NP VP Pro Verb NP I booked Det Nom a Nom PP Noun from LA flight

Recognition Preterminal rules and inner rules • preterminal rules: rules that rewrite a part-of-speech tag to a token, i.e. rules of the form C → w i Pro → I, Verb → booked, Noun → flight • inner rules: rules that rewrite a syntactic category to other categories: C → C 1 C 2 , ( C → C 1 ) S → NP VP , NP → Det Nom, (NP → Pro)

Recognition Recognizing small trees w i

Recognition Recognizing small trees C → w i w i

Recognition Recognizing small trees C w i

Recognition Recognizing small trees C covers all words between i – 1 and i

Recognition Recognizing big trees C 1 C 2 covers all words covers all words btw min and mid btw mid and max

Recognition Recognizing big trees C → C 1 C 2 C 1 C 2 covers all words covers all words btw min and mid btw mid and max

Recognition Recognizing big trees C C 1 C 2 covers all words covers all words btw min and mid btw mid and max

Recognition Recognizing big trees C covers all words between min and max

Recognition Questions • How do we know that we have recognized that the input sequence is grammatical? • How do we need to extend this reasoning in the presence of unary rules: C → C 1 ?

Recognition Signatures • The rules that we have just seen are independent of a parse tree’s inner structure. • The only thing that is important is how the parse tree looks from the ‘outside’. • We call this the signature of the parse tree. • A parse tree with signature [ min , max , C ] is one that covers all words between min and max and whose root node is labeled with C .

Recognition Questions • What is the signature of a parse tree for the complete sentence? • How many different signatures are there? • Can you relate the runtime of the parsing algorithm to the number of signatures?

Implementation

Implementation Data structure • The standard implementation represents signatures by means of a three-dimensional array chart . • Initially, all entries of chart should be set to false . • Whenever we have recognized a parse tree that spans all words between min and max and whose root node is labeled with C , we set the entry chart [ min ][ max ][ C ] to true .

Implementation Pseudo code • Informal high-level description, of how a computer program or algorithm works • Meant to be read and understood by humans, not machines • Can be augmented: • Natural language descriptions • Compact mathemtical notation • Efficient description of key principles of an algorithm, indeendently of programming languages and environments • Will be used to describe parsing algorithms on slides, and in books • Your assingment task 1 is to ”translate” pseudo code to python

Implementation Preterminal rules for each w i from left to right for each preterminal rule C -> w i chart[i - 1][i][C] = true

Implementation Binary rules for each max from 2 to n for each min from max - 2 down to 0 for each syntactic category C for each binary rule C -> C 1 C 2 for each mid from min + 1 to max - 1 if chart[min][mid][C 1 ] and chart[mid][max][C 2 ] then chart[min][max][C] = true

Implementation Numbering of categories • In order to use standard arrays, we need to represent syntactic categories by numbers. • We write m for the number of categories; we number them from 0 till m – 1. • We choose our numbers such that the start symbol S gets the number 0.

Implementation CKY in python • A three-dimensional array might not be the most suitable choice in python (even though it’d work). • It is quite possible to use more python-lika data structures like dictionaries, or variants such as defaultdict • Use tuples as keys, e.g. (i,j,S) ; ex: (2,3,”Pron”) • Lookup in chart: chart[i,j,S] • No need to numberize categories in this solution

Implementation Questions • In what way is this algorithm bottom–up? • Why is that property of the algorithm important? • How do we need to extend the code if we wish to handle unary rules C → C 1 ? • Why would we want to do that?

Summary • The CKY algorithm is an efficient parsing algorithm for context-free grammars. • Today: Recognizing whether there is any parse tree at all. • Next time: Probabilistic parsing – computing the most probable parse tree.

Reading • Recap of the introductory lecture: J&M chapter 12.1-12.7 and 13.1-13.3 • CKY recognition: J&M section 13.4.1 • CKY probabilistic parsing, for next week: J&M section 14.1-14.2

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 - PowerPoint PPT Presentation

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 Sara Stymne Department of Linguistics and Philology Mostly based on slides from Marco Kuhlmann Phrase structure trees root (top) S leaves (bottom) NP VP Pro Verb NP I

CKY Algorithm, Chomsky Normal Form Scott Farrar CLMA, University of Washington January 13, 2010

The CKY algorithm part 1: Recognition Syntactic analysis (5LN455) 2016-11-10 Sara Stymne

CKY & Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class

EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 Optimization: tips and tricks Scott

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Overview CKY algorithm: explores all analyses in parallel bottom-up From well-formed

Lecture 16: The CKY parsing algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net

Lecture 9: The CKY parsing algorithm Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel

SI485i : NLP Set 8 PCFGs and the CKY Algorithm PCFGs We saw how CFGs can model English (sort

SI425 : NLP Set 8 PCFGs and the CKY Algorithm PCFGs We saw how CFGs can model English (sort

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

License Plate Recognition License Plate Recognition License Plate Recognition License Plate

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Face detection and recognition Detection Recognition Sally Face detection &

Strong Nouns M&R 2662 ENG240Y Old English / Wed 22 Sep 2010 Recap: strong noun

Implementation of Round Colliding Beams Concept at VEPP-2000 Dmitry Shwartz BINP, Novosibirsk

Universals Across Languages E Stabler, E Keenan MTS@10 ESSLI 2007 E Stabler, E Keenan

Nominal PROPs Samuel Balco Alexander Kurz University of Leicester Chapman University

Delexicalized Parsing Daniel Zeman, Rudolf Rosa April 3, 2020 NPFL120 Multilingual Natural

Interpretation cannot determine the source of multiple sluicing in Hungarian Eszter Ronai &

Overview Last Time Grammatical Structure Context-Free Grammar Treebanks

Software Quality you know it when you see it Erik Doernenburg ThoughtWorks Software Quality

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 - PowerPoint PPT Presentation

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 Sara Stymne Department of Linguistics and Philology Mostly based on slides from Marco Kuhlmann Phrase structure trees root (top) S leaves (bottom) NP VP Pro Verb NP I

CKY Algorithm, Chomsky Normal Form Scott Farrar CLMA, University of Washington January 13, 2010

The CKY algorithm part 1: Recognition Syntactic analysis (5LN455) 2016-11-10 Sara Stymne

CKY &amp; Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class

EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 Optimization: tips and tricks Scott

A summary of deep models for face recognition Qianli Liao Face recognition Face recognition:

8-Speech Recognition Speech Recognition Concepts Speech Recognition Approaches

Overview CKY algorithm: explores all analyses in parallel bottom-up From well-formed

Lecture 16: The CKY parsing algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net

Lecture 9: The CKY parsing algorithm Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel

SI485i : NLP Set 8 PCFGs and the CKY Algorithm PCFGs We saw how CFGs can model English (sort

SI425 : NLP Set 8 PCFGs and the CKY Algorithm PCFGs We saw how CFGs can model English (sort

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

EMPLOYEE RECOGNITION OBJECTIVES Types of recognition Creating a culture of recognition

License Plate Recognition License Plate Recognition License Plate Recognition License Plate

Instance-level Recognition Pingmei Xu Object Recognition Friends SE01EP02 Recognition: Find the

Face detection and recognition Detection Recognition Sally Face detection &amp;

Strong Nouns M&amp;R 2662 ENG240Y Old English / Wed 22 Sep 2010 Recap: strong noun

Implementation of Round Colliding Beams Concept at VEPP-2000 Dmitry Shwartz BINP, Novosibirsk

Universals Across Languages E Stabler, E Keenan MTS@10 ESSLI 2007 E Stabler, E Keenan

Nominal PROPs Samuel Balco Alexander Kurz University of Leicester Chapman University

Delexicalized Parsing Daniel Zeman, Rudolf Rosa April 3, 2020 NPFL120 Multilingual Natural

Interpretation cannot determine the source of multiple sluicing in Hungarian Eszter Ronai &amp;

Overview Last Time Grammatical Structure Context-Free Grammar Treebanks

Software Quality you know it when you see it Erik Doernenburg ThoughtWorks Software Quality

CKY & Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class

Face detection and recognition Detection Recognition Sally Face detection &

Strong Nouns M&R 2662 ENG240Y Old English / Wed 22 Sep 2010 Recap: strong noun

Interpretation cannot determine the source of multiple sluicing in Hungarian Eszter Ronai &