Why NLP Needs Theoretical Syntax (It in Fact Already Uses It) Owen - PowerPoint PPT Presentation

Jun 26, 2023 •163 likes •254 views

Why NLP Needs Theoretical Syntax (It in Fact Already Uses It) Owen Rambow Center for Computational Learning Systems Columbia University, New York City rambow@ccls.columbia.edu Key Issue: Representation Aravind Joshi to statisticians

Why NLP Needs Theoretical Syntax (It in Fact Already Uses It) Owen Rambow Center for Computational Learning Systems Columbia University, New York City rambow@ccls.columbia.edu
Key Issue: Representation • Aravind Joshi to statisticians (adapted): “You know how to count, but we tell you what to count” • Linguistic representations are not naturally occurring! • They are devised by linguists • Example: English Penn Treebank – Beatrice Santorini (thesis: historical syntax of Yiddish) – Lots of linguistic theory went into the PTB – PTB annotation manual is a comprehensive descriptive grammar of English
What Sort of Representations for Syntax? • Syntax: links between text and meaning • Text consists of words -> lexical models – Lexicalized formalisms – Note: bi- and monolexical versions of CFG • Need to link to meaning (for example, PropBank) – Extended domain of locality to locate predicate- argument structure – Note: importance of dashtags etc in PTB II • Tree Adjoining Grammar! (but CCG is also cool, and LFG has its own appeal)
Why isn’t everyone using TAG? • The PTB is not annotated with a TAG • Need to do linguistic interpretation on PTB to extract TAG (Chen 2001, Fei 2001) • This is not surprising: all linguistic representations need to be interpreted (Rambow 2010) – Extraction of (P)CFG is simple and requires little interpretation – Extraction of bilexical (P)CFG is not, requires head percolation, which is interpretation
Why isn’t everyone using TAG Parsers? • Unclear how well they are performing – PS evaluation irrelevant • MICA parser (Bangalore et al 2009): – high 80s on a linguistically motivated predicate-argument structure dependency – MALT does slightly better on same representation – But MICA output comes fully interpreted, MALT does not • Once we have a good syntactic pred-arg structure, tasks like semantic role labeling (PropBank) are easier – 95% on args given gold pred-arg structure (Chen and Rambow 2002)
What Have We Learned About TAG Parsing? • Large TAG grammar not easy to manage computationally (MICA: 5000 trees, 1,200 used in parsing) • Small TAG grammars lose too much information • Need to investigate: – Dynamic creation of TAG grammars (trees created in response to need) (note: LTAG-spinal Shen 2006) – “Bushes”: underspecified trees – Metagrammars (Kinyon 2003)
What about All Those Other Languages? • Can’t do treebanks for 3,000 languages • Need to understand cross-linguistic variation and use that understanding in computational models – Cross-linguistic variation: theoretical syntax – Models: NLP – Link: metagrammars for TAG
Summary • Treebanks already encode insights from theoretical syntax • Require interpretation for non-trivial models • Applications other than Parseval require richer representations (and richer evaluations) • But probably English is not the right language to argue for the need for richer syntactic knowledge • Real coming bottleneck: NLP for 3,000 languages

Recommend

Linguistics 101 Theoretical Syntax Theoretical Syntax When constructing sentences, our brains

Linguistics 101 Theoretical Syntax Theoretical Syntax When constructing sentences, our brains do a lot of work behind the scenes. Syntactic theories attempt to discover these hidden processes. While languages differ a lot on the

817 views • 32 slides

1 Parsing Theoretical Foundations Given expression find tree Many foundational systems

CS 242 Syntax and Semantics of Programs Fundamentals Syntax The symbols used to write a program Semantics The actions that occur when a program is executed John Mitchell Programming language implementation Syntax

370 views • 8 slides

log ( parseProb ) (Alex) log ( parseProb / trigramProb ) (Anoop) Result: worse than

Features Implicit Syntax Shallow Syntax (POS, chunks) Deep Syntax (trees) Tricky Syntax (tree fragments) Syntax for Statistical MT JHU 2003 WS Deep Syntax What is deep? use of parser output Why parser?

195 views • 16 slides

Abbreviated syntax The abbreviated syntax is more economical and often (but not always!) more

Inf1-DA 20102011 II: 64 / 118 Abbreviated syntax The abbreviated syntax is more economical and often (but not always!) more intuitive. The XPath abbreviations are: The syntax child:: may be omitted from a location step altogether. (The

854 views • 19 slides

SI485i : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit

SI485i : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit knowledge of your native language that you had mastered by the time you were 3 years old Not the kind of stuff you were later taught in

963 views • 56 slides

SI425 : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit

SI425 : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit knowledge of your native language that you had mastered by the time you were 3 years old Not the kind of stuff you were later taught in

677 views • 57 slides

Introduction to English Linguistics 4: Grammar and Syntax I Grammar and Syntax Grammar The

Introduction to English Linguistics 4: Grammar and Syntax I Grammar and Syntax Grammar The rules of language, comprising syntax and inflectional morphology Syntax The hierarchical structure of language Lexical Words (Open Class) Noun

540 views • 31 slides

Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And

Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And Semantics Programming language syntax: how programs look, their form and structure Syntax is defined using a kind of formal grammar Programming

605 views • 48 slides

Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And

870 views • 49 slides

SI425 : NLP Set 10 Syntax and Parsing Fall 2020 : Chambers Syntax Grammar, or syntax:

SI425 : NLP Set 10 Syntax and Parsing Fall 2020 : Chambers Syntax Grammar, or syntax: The kind of implicit knowledge of your native language that you had mastered by the time you were 3 years old Not the kind of stuff you were later

855 views • 57 slides

Chapter 6: Syntax Syntax Syntax is the structure of a language. Earlier, both syntax and

Chapter 6: Syntax Syntax Syntax is the structure of a language. Earlier, both syntax and semantics were described using lengthy English language explanations. Although semantics are still described in English, syntax is described

964 views • 78 slides

Introduction to English Linguistics 4: Grammar and Syntax Grammar and Syntax Grammar The rules

Introduction to English Linguistics 4: Grammar and Syntax Grammar and Syntax Grammar The rules of language, comprising syntax and inflection Syntax The hierarchical structure of language Lexical Words, Open Class Noun Adjective

739 views • 39 slides

Literary Analysis Syntax Review AP Literature and Composition 1 SYNTAX n Syntax Defines Style

Literary Analysis Syntax Review AP Literature and Composition 1 SYNTAX n Syntax Defines Style Through Variety of Sentence Structure: n Syntax refers to sentence structure and the variation of phrases and clauses within, which the author

174 views • 5 slides

Fundamantals Syntax of Programming Languages cs3723 1 Syntax and Semantics Syntax The

Fundamantals Syntax of Programming Languages cs3723 1 Syntax and Semantics Syntax The symbols and rules to write legal programs Semantics The meaning of legal programs Programming language implementation Syntax >

479 views • 14 slides

Syntax & Semantics UML - Java Jan Pettersen Nytun, page no. 1 Syntax From Wikipedia, the

Syntax & Semantics UML - Java Jan Pettersen Nytun, page no. 1 Syntax From Wikipedia, the free encyclopedia: the syntax of a computer language is the set of rules that defines the combinations of symbols that are considered to be a

901 views • 12 slides

Syntax and ANTLR Syntax vs. Semantics Semantics: What does a program mean? Defined by

CS152 Programming Language Paradigms Prof. Tom Austin Syntax and ANTLR Syntax vs. Semantics Semantics: What does a program mean? Defined by an interpreter or compiler? Syntax: How is a program structured? Defined by a

490 views • 16 slides

24) Exchange Syntax and Textual 3.Advanced features 1. Mapping text to data types DSLs using

Outline 1.Introduction 2.How to build a DSL 1. Defining/Using a meta model 2. Syntax Definition 1. Generating an initial syntax (HUTN) 3. Refining the syntax 24) Exchange Syntax and Textual 3.Advanced features 1. Mapping text to data types

435 views • 14 slides

Abstract Syntax Aslan Askarov aslan@cs.au.dk Revised from slides by E. Ernst Abstract syntax

Compilation 2014 Abstract Syntax Aslan Askarov aslan@cs.au.dk Revised from slides by E. Ernst Abstract syntax High-level source Pretty printing code Abstract syntax tree Lexing/Parsing Elaboration Lowering Code generation

375 views • 18 slides

Syntax & ANTLR Prof. Tom Austin San Jos State University Syntax vs. Semantics

CS 152: Programming Language Paradigms Syntax & ANTLR Prof. Tom Austin San Jos State University Syntax vs. Semantics Semantics: What does a program mean? Defined by an interpreter or compiler Syntax: How is a program

352 views • 18 slides

Syntax and Grammars 1 / 21 Outline What is a language? Abstract syntax and grammars Abstract

Syntax and Grammars 1 / 21 Outline What is a language? Abstract syntax and grammars Abstract syntax vs. concrete syntax Encoding grammars as Haskell data types What is a language? 2 / 21 What is a language? Language : a system of

429 views • 21 slides

Description of a programming language Syntax describes the structure of a language

Description of a programming language Syntax describes the structure of a language given as grammatical rules which streams of symbols (characters) form a legal program Syntax checking (syntax analysis, parsing)

543 views • 43 slides

Syntax Analysis Parsing Syntactic analysis = parsing Goal of parser: Find all syntax errors

COS 301: Programming Languages Syntax Analysis Parsing Syntactic analysis = parsing Goal of parser: Find all syntax errors diagnostic message If no syntax errors parse tree UMaine School of Computing and Information Science COS 301 -

1.38k views • 98 slides

Abstract Syntax Trees 27 February 2019 OSU CSE 1 Abstract Syntax Tree An abstract syntax

Abstract Syntax Trees 27 February 2019 OSU CSE 1 Abstract Syntax Tree An abstract syntax tree (AST) is a tree model of an entire program or a certain program structure (e.g., a statement or an expression in a Java program) An

538 views • 18 slides

Where we are abstract syntax tree syntax-directed translation (IR generation) CS 4120

Where we are abstract syntax tree syntax-directed translation (IR generation) CS 4120 intermediate code Introduction to Compilers syntax-directed translation (flattening) Andrew Myers reordering with traces canonical intermediate code

489 views • 4 slides

Why NLP Needs Theoretical Syntax (It in Fact Already Uses It) Owen - PowerPoint PPT Presentation

Why NLP Needs Theoretical Syntax (It in Fact Already Uses It) Owen Rambow Center for Computational Learning Systems Columbia University, New York City rambow@ccls.columbia.edu Key Issue: Representation Aravind Joshi to statisticians

Linguistics 101 Theoretical Syntax Theoretical Syntax When constructing sentences, our brains

1 Parsing Theoretical Foundations Given expression find tree Many foundational systems

log ( parseProb ) (Alex) log ( parseProb / trigramProb ) (Anoop) Result: worse than

Abbreviated syntax The abbreviated syntax is more economical and often (but not always!) more

SI485i : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit

SI425 : NLP Set 7 Syntax and Parsing Syntax Grammar, or syntax: The kind of implicit

Introduction to English Linguistics 4: Grammar and Syntax I Grammar and Syntax Grammar The

Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And

Defining Program Syntax Chapter Two Modern Programming Languages, 2nd ed. 1 Syntax And

SI425 : NLP Set 10 Syntax and Parsing Fall 2020 : Chambers Syntax Grammar, or syntax:

Chapter 6: Syntax Syntax Syntax is the structure of a language. Earlier, both syntax and

Introduction to English Linguistics 4: Grammar and Syntax Grammar and Syntax Grammar The rules

Literary Analysis Syntax Review AP Literature and Composition 1 SYNTAX n Syntax Defines Style

Fundamantals Syntax of Programming Languages cs3723 1 Syntax and Semantics Syntax The

Syntax &amp; Semantics UML - Java Jan Pettersen Nytun, page no. 1 Syntax From Wikipedia, the

Syntax and ANTLR Syntax vs. Semantics Semantics: What does a program mean? Defined by

24) Exchange Syntax and Textual 3.Advanced features 1. Mapping text to data types DSLs using

Abstract Syntax Aslan Askarov aslan@cs.au.dk Revised from slides by E. Ernst Abstract syntax

Syntax &amp; ANTLR Prof. Tom Austin San Jos State University Syntax vs. Semantics

Syntax and Grammars 1 / 21 Outline What is a language? Abstract syntax and grammars Abstract

Description of a programming language Syntax describes the structure of a language

Syntax Analysis Parsing Syntactic analysis = parsing Goal of parser: Find all syntax errors

Abstract Syntax Trees 27 February 2019 OSU CSE 1 Abstract Syntax Tree An abstract syntax

Where we are abstract syntax tree syntax-directed translation (IR generation) CS 4120

Syntax & Semantics UML - Java Jan Pettersen Nytun, page no. 1 Syntax From Wikipedia, the

Syntax & ANTLR Prof. Tom Austin San Jos State University Syntax vs. Semantics