Bare-Bones Dependency Parsing A Case for Occams Razor? Joakim Nivre - PowerPoint PPT Presentation

Bare-Bones Dependency Parsing A Case for Occam’s Razor? Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Bare-Bones Dependency Parsing 1(30)

Introduction Introduction ◮ Syntactic parsing of natural language: ◮ Who does what to whom? ◮ Dependency-based syntactic representations ◮ Binary, asymmetric relations between words ◮ Long tradition in descriptive linguistics ◮ Increasingly popular in computational linguistics Bare-Bones Dependency Parsing 2(30)

Introduction Varieties of Dependency Parsing ◮ Dependencies as internal representations (for parsers) ◮ Dependency relations useful for disambiguation ◮ Incorporated into head-lexicalized grammars Example: The Collins Parser [Collins 1997] Bare-Bones Dependency Parsing 3(30)

Introduction Varieties of Dependency Parsing ◮ Dependencies as final representations (for applications) ◮ Information extraction [Culotta and Sorensen 2004] ◮ Question answering [Bouma et al. 2005] ◮ Machine translation [Ding and Palmer 2004] Example: The Stanford Parser [Klein and Manning 2003] Bare-Bones Dependency Parsing 4(30)

Introduction Varieties of Dependency Parsing ◮ Dependencies as the one and only representation ◮ If we only want a dependency tree, why do more? ◮ Bare-bones dependency parsing [Eisner 1996] Bare-Bones Dependency Parsing 5(30)

Introduction Varieties of Dependency Parsing ◮ Dependencies as the one and only representation ◮ If we only want a dependency tree, why do more? ◮ Bare-bones dependency parsing [Eisner 1996] Occam’s razor: pluralitas non est ponenda sine necessitate Bare-Bones Dependency Parsing 5(30)

Introduction Outline ◮ Basic concepts of dependency parsing ◮ Representations, metrics, benchmarks ◮ Parsing methods for bare-bones dependency parsing ◮ Chart parsing techniques ◮ Parsing as constraint satisfaction ◮ Transition-based parsing ◮ Hybrid methods ◮ Comparative evaluation ◮ Different types of parsers evaluated on dependency output ◮ Can we really appeal to Occam’s razor? Bare-Bones Dependency Parsing 6(30)

Basic Concepts Dependency Graphs ◮ A dependency graph for a sentence S = w 1 , . . . , w n is a directed graph G = ( V , A ) , where: ◮ V = { 1 , . . . , n } is the set of nodes, representing tokens, ◮ A ⊆ V × V is the set of arcs, representing dependencies. ◮ Note: ◮ Arc i → j is a dependency with head w i and dependent w j ◮ Arc i → j may be labeled with a dependency type r ∈ R Bare-Bones Dependency Parsing 7(30)

Basic Concepts Constraints on Dependency Graphs ◮ G must be a projective tree ◮ All subtrees have a contiguous yield ◮ Simple conversion from/to phrase structure trees ◮ Hard to represent long-distance dependencies Bare-Bones Dependency Parsing 8(30)

Basic Concepts Constraints on Dependency Graphs ◮ G must be a tree ◮ Subtrees may have a discontiguous yield ◮ Allows non-projective arcs for long-distance dependencies ◮ Prague Dependency Trebank [Hajiˇ c et al. 2001] (25% trees) Bare-Bones Dependency Parsing 8(30)

Basic Concepts Constraints on Dependency Graphs ◮ G must be connected and acyclic (DAG) ◮ A node may have more than one incoming arc ◮ Allows multiple heads for deep syntactic relations ◮ Danish Dependency Trebank [Kromann 2003] Bare-Bones Dependency Parsing 8(30)

Basic Concepts Parsing Problem ◮ Input: S = w 1 , . . . , w n ◮ Output: G ∗ = argmax F ( S , G ) G ∈G ( S ) ◮ Note: ◮ F ( S , G ) is the score of G for S ◮ G ( S ) is the space of possible dependency graphs for S ◮ Nodes given by input, only arcs need to be found ◮ With tree constraint, assignment of head h i and relation r i Bare-Bones Dependency Parsing 9(30)

Basic Concepts Parsing Problem ◮ Input: S = w 1 , . . . , w n ◮ Output: G ∗ = argmax F ( S , G ) G ∈G ( S ) ◮ Note: ◮ F ( S , G ) is the score of G for S ◮ G ( S ) is the space of possible dependency graphs for S ◮ Nodes given by input, only arcs need to be found ◮ With tree constraint, assignment of head h i and relation r i Relation r i ∈ R OBJ ROOT SBJ VG Head h i ∈ V ∪ { 0 } 4 0 2 2 Output Node i ∈ V 1 2 3 4 Input Word w i ∈ S who did you see PoS tag WP VBD PRP VB Bare-Bones Dependency Parsing 9(30)

Basic Concepts Benchmark Data Sets ◮ Penn Treebank (PTB) [Marcus et al. 1993] : ◮ Phrase structure annotation converted to dependencies ◮ Penn2Malt – projective trees [Nivre 2006] ◮ Stanford – projective trees or graphs [de Marneffe et al. 2006] ◮ Prague Dependency Treebank (PDT) [Hajiˇ c et al. 2001] : ◮ Native dependency annotation – non-projective trees ◮ CoNLL Shared Tasks [Buchholz and Marsi 2006, Nivre et al. 2007] : ◮ CoNLL-06: 13 languages (trees, mostly non-projective) ◮ CoNLL-07: 10 languages (trees, mostly non-projective) Bare-Bones Dependency Parsing 11(30)

Parsing Methods Parsing Methods ◮ Parsing methods for bare-bones dependency parsing ◮ Chart parsing techniques ◮ Parsing as constraint satisfaction ◮ Transition-based parsing ◮ Hybrid methods Bare-Bones Dependency Parsing 12(30)

Parsing Methods Chart Parsing Techniques ◮ Context-free dependency grammar: H → L 1 · · · L m h R 1 · · · R n ◮ Parsing methods: ◮ Standard chart parsing techniques (CKY, Earley, etc.) ◮ Goes back to the 1960s [Hays 1964, Gaifman 1965] ◮ Grammar can be augmented/replaced with statistical model ◮ Efficiency gains thanks to dependency tree constraints Bare-Bones Dependency Parsing 13(30)

Parsing Methods Eisner’s Algorithm ◮ In standard CKY style parsing, chart items are trees ◮ Eisner’s algorithm [Eisner 1996, Eisner 2000] : ◮ Split head representation ◮ Chart items are (complete or incomplete) half-trees CKY Eisner C [ i , h , l , h ′ , j ] ⇒ O ( n 5 ) C [ h , h ′ , j ] ⇒ O ( n 3 ) Bare-Bones Dependency Parsing 14(30)

Parsing Methods Statistical Models ◮ Chart parsing requires factorized scoring function F : argmax T ∗ = F ( S , T ) T ∈T ( S ) � F ( S , T ) = f ( S , g ) g ∈ T ◮ Size of subgraph g determines model complexity Model Subgraph TC PTB Reference O ( n 3 ) 1st-order 90.9 [McDonald et al. 2005a] O ( n 3 ) 2nd-order 91.5 [McDonald and Pereira 2006] O ( n 4 ) 3rd-order 93.0 [Koo and Collins 2010] Bare-Bones Dependency Parsing 15(30)

Parsing Methods Beyond Projective Trees ◮ Context-free techniques are limited to projective trees ◮ Extension to mildly non-projective trees: ◮ Well-nested trees with gap degree 1 in O ( n 7 ) time [Kuhlmann and Satta 2009, Gómez-Rodríguez et al. 2009] ◮ Post-processing techniques: ◮ 2nd-order model + hill-climbing [McDonald and Pereira 2006] ◮ Can handle non-projective arcs as well as multiple heads ◮ Top-scoring model in CoNLL-06 [MSTParser] Bare-Bones Dependency Parsing 16(30)

Parsing Methods Parsing as Constraint Satisfaction ◮ Constraint dependency grammar [Maruyama 1990] : ◮ Variables h 1 , . . . , h n with domain { 0 , 1 , . . . , n } ◮ Grammar G = set of boolean constraints ◮ Parsing = search for tree in { T ∈ T ( S ) | ∀ c ∈ G : c ( S , T ) } ◮ Adding soft weighted constraints [Menzel and Schröder 1998] : � T ∗ argmax = f ( c ) T ∈T ( S ) c : ¬ c ( S , T ) ◮ Characteristics: ◮ Non-projective trees easily accommodated ◮ Constraints not inherently restricted to local subgraphs ◮ Exact inference intractable except in restricted cases Bare-Bones Dependency Parsing 17(30)

Parsing Methods Approaches to Inference ◮ Maximum spanning tree parsing [McDonald et al. 2005b] : ◮ First-order model: constraints restricted to single arcs ◮ T ∗ = maximum spanning tree in complete graph ◮ Exact parsing with non-projective trees in O ( n 2 ) time ◮ “An island of tractability” (D. Smith) ◮ Approximate inference for higher-order models: ◮ Transformational search [Foth et al. 2004] ◮ Gibbs sampling [Nakagawa 2007] ◮ Loopy belief propagation [Smith and Eisner 2008] ◮ Linear programming [Riedel and Clarke 2006, Martins et al. 2009] Bare-Bones Dependency Parsing 18(30)

Parsing Methods Transition-Based Approaches ◮ Transition-based dependency parsing: ◮ Define a transition system for dependency parsing ◮ Train a classifier for predicting the next transition ◮ Use the classifier to do deterministic parsing ◮ Open source implementation: ◮ MaltParser [Nivre et al. 2006] http://maltparser.org ◮ Characteristics: ◮ Highly efficient – linear time complexity for projective trees ◮ History-based feature models with unrestricted scope ◮ Sensitive to local prediction errors and error propagation Bare-Bones Dependency Parsing 19(30)

Bare-Bones Dependency Parsing A Case for Occams Razor? Joakim Nivre - PowerPoint PPT Presentation

Bare-Bones Dependency Parsing A Case for Occams Razor? Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Bare-Bones Dependency Parsing 1(30) Introduction Introduction Syntactic parsing

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . . . . . Graph Based

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Natural Language Processing Other Syntactic Models Parsing IV Dan Klein UC Berkeley Dependency

Dependency Parsing CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre,

Dependency Parsing & Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Skeletal System The skeletal system is made up of your bones made up of your bones,

Section 11: Anatomy Simple & Articulating Joints 11-1 The Bones of the Arms The Bones

Bare Bones of the Data Certain dialects of American English allow a Condition B-violating pronoun

BARE ROOT AND BARE ROOT AND CONTAINERIZED FOREST CONTAINERIZED FOREST PLANTS PLANTS PLANTS

64 bit Bare Metal Tristan Gingold Programming on RPI-3 gingold@adacore.com What is Bare Metal ?

Marina Valeeva Outline 2 1. Introduction What is Dependency Parsing? What is a

Statistical Parsing Dependency parsing ar ltekin University of Tbingen Seminar fr

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

TRANSLATION DEPARTMENT LANGUAGE PROOFREADING EDITING & COPYWRITING WEB TRANSLATION PRESS

The application of Semantic Publishing technologies in the Science of Science research domain for

Disclaimer The European Commission's support for the production of this publication does not

LEO TOLSTOY INSTITUTE OF PHILOLOGY AND INTERCULTURAL COMMUNICATION The history of Leo Tolstoy

Methodological considerations for code-switching research: Language mixing as input during

NATIONAL COLLEGE OF INFORMATICS MATEI BASARAB EUROPEAN SCHOOL CERTIFICATE 2018-2021

Presentation of P05 Crimean Engineering and Pedagogical University Thomas More University

2016 Results 30 May 2017 Disclaimer This document and its contents are This communication

Bare-Bones Dependency Parsing A Case for Occams Razor? Joakim Nivre - PowerPoint PPT Presentation

Bare-Bones Dependency Parsing A Case for Occams Razor? Joakim Nivre Uppsala University Department of Linguistics and Philology joakim.nivre@lingfil.uu.se Bare-Bones Dependency Parsing 1(30) Introduction Introduction Syntactic parsing

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Graph Based Dependency Parsing Wei Qiu December 15, 2011 . . . . . . Graph Based

Dependency Parsing II CMSC 470 Marine Carpuat Graph-based Dependency Parsing Slides credit:

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

Natural Language Processing Other Syntactic Models Parsing IV Dan Klein UC Berkeley Dependency

Dependency Parsing CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre, Dan

Dependency Parsing 2 CMSC 723 / LING 723 / INST 725 Marine Carpuat Fig credits: Joakim Nivre,

Dependency Parsing &amp; Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Skeletal System The skeletal system is made up of your bones made up of your bones,

Section 11: Anatomy Simple &amp; Articulating Joints 11-1 The Bones of the Arms The Bones

Bare Bones of the Data Certain dialects of American English allow a Condition B-violating pronoun

BARE ROOT AND BARE ROOT AND CONTAINERIZED FOREST CONTAINERIZED FOREST PLANTS PLANTS PLANTS

64 bit Bare Metal Tristan Gingold Programming on RPI-3 gingold@adacore.com What is Bare Metal ?

Marina Valeeva Outline 2 1. Introduction What is Dependency Parsing? What is a

Statistical Parsing Dependency parsing ar ltekin University of Tbingen Seminar fr

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

TRANSLATION DEPARTMENT LANGUAGE PROOFREADING EDITING &amp; COPYWRITING WEB TRANSLATION PRESS

The application of Semantic Publishing technologies in the Science of Science research domain for

Disclaimer The European Commission's support for the production of this publication does not

LEO TOLSTOY INSTITUTE OF PHILOLOGY AND INTERCULTURAL COMMUNICATION The history of Leo Tolstoy

Methodological considerations for code-switching research: Language mixing as input during

NATIONAL COLLEGE OF INFORMATICS MATEI BASARAB EUROPEAN SCHOOL CERTIFICATE 2018-2021

Presentation of P05 Crimean Engineering and Pedagogical University Thomas More University

2016 Results 30 May 2017 Disclaimer This document and its contents are This communication

Dependency Parsing & Feature-based Parsing Ling571 Deep Processing Techniques for NLP

Section 11: Anatomy Simple & Articulating Joints 11-1 The Bones of the Arms The Bones

TRANSLATION DEPARTMENT LANGUAGE PROOFREADING EDITING & COPYWRITING WEB TRANSLATION PRESS