B LEU ATRE : Flattening Syntactic Dependencies for MT Evaluation - PowerPoint PPT Presentation

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References B LEU ˆ ATRE : Flattening Syntactic Dependencies for MT Evaluation Dennis N. Mehay and Chris Brew Department of Linguistics The Ohio State University { mehay,cbrew } @ling.osu.edu Theoretical and Methodological Issues in MT (2007) Sk¨ ovde, Sweden B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References Outline Target Language-based MT Evaluation: The Basic Regime 1 A Tour of Other Approaches: Motivating B LEU ˆ 2 ATRE B LEU and NIST: N-gram-based MT Evaluation M ETEOR Syntax-based Approaches B LEU ˆ ATRE : Flattening and Using Word-word Dependencies 3 Experiments with LDC TIDES Multiple Translation “Chinese” 4 B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References (Thompson, 1991) Comparing Candidates to References Reference (target language) corpus is one-time investment. Comparison is consistent and (potentially) fast, cheap, etc. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References Ways of Comparing Candidates to References Word-based is well-represented — (Thompson, 1991; Brew and Thompson, 1994), B LEU (Papineni et al., 2002), M ETEOR (Banerjee and Lavie, 2005), etc. Synax-based is gaining traction — (Liu and Gildea, 2005), (Owczarzak et al., 2007). B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References Simulating Parsing: Combining Syntax- and Word-based Technologies Is there a middle ground? How do you use parse information from references without parsing the candidates? Cf. TextRunner (Banko et al., 2007) ⇒ they simulate parsing by training word- and POS-fed classifiers to recognise dependencies in strings. We want to simlulate parsing in a similar way. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ ATRE B LEU ˆ ATRE : Flattening and Using Dep’s Experiments: w/ LDC TIDES MultiTrans “Chinese” References Our Approach: B LEU ˆ ATRE (‘Bluish’) Use syntactic information from reference set. “Compile” it down to a form suitable for word-based comparison. Motivation: Draw on strengths of word- and syntax-based approaches. Avoid parsing where possible. But only look for syntactically relevant word matches. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References B LEU and NIST Measure translation quality by n-gram overlap with reference(s). Typically 1 ≤ n ≤ 4 or 5 Strengths: Simple, fast and cheap: only word matching. Portable: only have to port (or develop) tokenisers. Reference set is (virtually) the only investment. Shortcomings: Sometimes do not correlate with human judgments (Callison-Burch et al., 2006) Behavior is unreliable in presence of (good and bad) word-order variation. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References B LEU and NIST: How to break them. Some words can “move around”, some cannot. B LEU and NIST do not distinguish the two cases. Reference(s) Candidates Please fill your name in c1: Fill please your name in ... c2: Please fill in your name c3: Please fill your name in ... Figure: (Key: unigram, bigram, trigram and 4-gram match(es).) B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References B LEU and NIST: How to break them. Some words can “move around”, some cannot. B LEU and NIST do not distinguish the two cases. Reference(s) Candidates Please fill your name in c1: Fill please your name in ... c2: Please fill in your name ⇐ perfectly good. c3: Please fill your name in ... Figure: (Key: unigram, bigram, trigram and 4-gram match(es).) B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References B LEU and NIST: How to break them. Some words can “move around”, some cannot. B LEU and NIST do not distinguish the two cases. Reference(s) Candidates Please fill your name in c1: Fill please your name in ⇐ this scores higher ... c2: Please fill in your name ⇐ perfectly good. c3: Please fill your name in ... Figure: (Key: unigram, bigram, trigram and 4-gram match(es).) B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References B LEU and NIST: How to break them. Some words can “move around”, some cannot. B LEU and NIST do not distinguish the two cases. Reference(s) Candidates Please fill your name in c1: Fill please your name in ⇐ this scores higher ... c2: Please fill in your name ⇐ perfectly good. c3: Please fill your name in ... Figure: (Key: unigram, bigram, trigram and 4-gram match(es).) (Callison-Burch et al., 2006): w.r.t. one reference, can be > 10 73 permutations of a sentence with same B LEU score (or better). B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References M ETEOR : Susceptible to the Same Word-order Pitfalls Computes unigram precision and recall; penalises crossing � β . #chunks � alignments ⇒ γ · #unigram matches But incorporates no notion of better or worse crossing alignments. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References (Liu and Gildea, 2005) & (Owczarzak et al., 2007) Compare at the constituent or dependency level. Candidate is no longer punished for legitimate word-order variation. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References (Liu and Gildea, 2005) & (Owczarzak et al., 2007) Compare at the constituent or dependency level. Candidate is no longer punished for legitimate word-order variation. But: MT output is messy. B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

TL-based MTE Other Approaches: Motivating B LEU ˆ B LEU and NIST: N-gram-based MT Evaluation ATRE B LEU ˆ ATRE : Flattening and Using Dep’s M ETEOR Experiments: w/ LDC TIDES MultiTrans “Chinese” Syntax-based Approaches References (Liu and Gildea, 2005) & (Owczarzak et al., 2007) Compare at the constituent or dependency level. Candidate is no longer punished for legitimate word-order variation. But: MT output is messy. How do you parse ill-formed input? (E.g., Fill please your name in .) B LEU ˆ Dennis N. Mehay and Chris Brew ATRE : Flattening Syntactic Dependencies for MT Eval.

B LEU ATRE : Flattening Syntactic Dependencies for MT Evaluation - PowerPoint PPT Presentation

TL-based MTE Other Approaches: Motivating B LEU ATRE B LEU ATRE : Flattening and Using Deps Experiments: w/ LDC TIDES MultiTrans Chinese References B LEU ATRE : Flattening Syntactic Dependencies for MT Evaluation Dennis N.

On Different Approaches to Syntactic Analysis into Bi-Lexical Dependencies An Empirical

Attribute Dependencies Wilhelm/Seidl/Hack: Compiler Design, Syntactic and Semantic Analysis

Joint Learning of Syntactic and Semantic Dependencies Xavier Llu s and Llu s M` arquez

Dependency Parsing CMSC 470 Marine Carpuat Dependency Grammars Syntactic structure = lexical

Long Distance Dependencies Syntactic Theory Winter Semester 2009/2010 Antske Fokkens Department

A Latent Variable Model of Synchronous Parsing for Syntactic and Semantic Dependencies James

Transformational Priors The Big Concept Over Grammars Want to parse (or build a syntactic

Character-level Annotation for Chinese Surface-Syntactic Universal Dependencies Chuanming Dong,

Outline Information Retrieval (IR) Syntactic IR Problems of Syntactic IR Semantic

Task Dependencies: ant Steven J Zeil February 25, 2013 Task Dependencies: ant Outline

Surface flattening - from local to global Emil Saucan EE Department, Technion Joint work with

A Short Introduction to OCaml Jean-Christophe Filli atre September 11, 2018 Jean-Christophe

A Functional Implementation of the GarsiaWachs Algorithm Jean-Christophe Filli atre (CNRS)

Building stuff with monadic dependencies + unchanging dependencies + polymorphic dependencies +

Queens on a Chessboard an exercise in program verification Jean-Christophe Filli atre

Dependencies and Hazards Lecture 17 CS301 Data Dependencies We want to keep the pipeline

Introduction Syntactic analysis (5LN455) Syntactic parsing (5LN713/5LN717) 2017-11-07 Sara

Managing Dependencies and Runtime Security ActiveState Deminar Managing Dependencies and

Continuous Updating How do you keep track of your LIBRARIES? How many DEPENDENCIES do you have

COVID-19 Webinar Miniseries Flattening the Curve Wednesday, March 18, 2020 #AllHealthLive

The HISPACAT comparative database of syntactic constructions and its applications to syntactic

Dependencies in Interval- -valued valued Dependencies in Interval Symbolic Data Symbolic Data

Syntactic list of tokens analysis Syntactic analyzer grammar: context free format: BNF

Chapter 3: Syntactic Forms, Grammatical Functions, and Semantic Roles Syntactic Constructions in