Translating Negation: Induction, Search and Model Errors Federico - PowerPoint PPT Presentation

Sub-constituents of negation 在同一个急诊的值班中，我两次没有发现病患得了盲肠炎。 During my emergency duty , I have n’t diagnosed a patient with appendicitis twice . • Cue : the morpheme, word or multi-word unit inherently expressing negation. • im- possible, breath less ness, 不要脸，不少，… • by no means, save, … • Event : the lexical unit the cue directly refers to • Scope: all the elements whose falsity would prove negation to be false. • The event is included in the scope www.inf.ed.ac.uk

What kind of errors? www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion , reordering and insertion errors www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion , reordering and insertion errors – Results: www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion , reordering and insertion errors – Results: • Cue is easiest to translate followed by event and scope difficult www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion , reordering and insertion errors – Results: • Cue is easiest to translate followed by event and scope difficult • Deletion across all categories www.inf.ed.ac.uk

What kind of errors? • Manual analysis of the errors involved in translating negation (Fancellu & Webber, 2015 – Ex-Prom @ NAACL ‘15) – Annotation of the sub-constituents of negation – HMEANT (Lo & Wu, 2010) to calculate P, R and F1 measure – Classification of the errors into deletion , reordering and insertion errors – Results: • Cue is easiest to translate followed by event and scope difficult • Deletion across all categories • Scope reordering www.inf.ed.ac.uk

What is the source of these errors? www.inf.ed.ac.uk

What is the source of these errors? • Rule/phrase Table : the best translation cannot be generated because its necessary phrases/rules are absent from the search space à induction errors www.inf.ed.ac.uk

What is the source of these errors? • Rule/phrase Table : the best translation cannot be generated because its necessary phrases/rules are absent from the search space à induction errors • Search space : the most probable output is absent from the search space à search errors www.inf.ed.ac.uk

What is the source of these errors? • Rule/phrase Table : the best translation cannot be generated because its necessary phrases/rules are absent from the search space à induction errors • Search space : the most probable output is absent from the search space à search errors • Model : the model scores a sub-optimal translation higher than an optimal one à model errors www.inf.ed.ac.uk

Constrained decoding www.inf.ed.ac.uk

Constrained decoding •Tries to reconstruct the reference www.inf.ed.ac.uk

Constrained decoding •Tries to reconstruct the reference • Reference reachability as a proxy to analyze errors during decoding www.inf.ed.ac.uk

Constrained decoding •Tries to reconstruct the reference • Reference reachability as a proxy to analyze errors during decoding •Implemented as a feature in Moses: – 1 if the hypothesis is a sub-string of the reference – - inf if the hypothesis is not a sub-string of the reference www.inf.ed.ac.uk

Constrained Decoding www.inf.ed.ac.uk

Constrained Decoding • If the reference is reconstructed: www.inf.ed.ac.uk

Constrained Decoding • If the reference is reconstructed: – Search vs. model errors (Wisniewski and Yvon, 2013): • if p( e ) < p( ê ): search error * e : 1-best hypothesis ê : reconstructed reference • if p( e ) > p( ê ): model error www.inf.ed.ac.uk

Constrained Decoding • If the reference is reconstructed: – Search vs. model errors (Wisniewski and Yvon, 2013): • if p( e ) < p( ê ): search error * e : 1-best hypothesis ê : reconstructed reference • if p( e ) > p( ê ): model error • If the reference can not be reconstructed: www.inf.ed.ac.uk

Constrained Decoding • If the reference is reconstructed: – Search vs. model errors (Wisniewski and Yvon, 2013): • if p( e ) < p( ê ): search error * e : 1-best hypothesis ê : reconstructed reference • if p( e ) > p( ê ): model error • If the reference can not be reconstructed: – Increase the translation option limit (Auli and Lopez, 2009) • if the reference can now be reconstructed à induction error www.inf.ed.ac.uk

Constrained Decoding • If the reference is reconstructed: – Search vs. model errors (Wisniewski and Yvon, 2013): • if p( e ) < p( ê ): search error * e : 1-best hypothesis ê : reconstructed reference • if p( e ) > p( ê ): model error • If the reference can not be reconstructed: – Increase the translation option limit (Auli and Lopez, 2009) • if the reference can now be reconstructed à induction error – Increase the cube pruning pop limit • if the reference can now be reconstructed à search error www.inf.ed.ac.uk

Locality issues www.inf.ed.ac.uk

Locality issues • Negation is usually a local phenomenon www.inf.ed.ac.uk

Locality issues • Negation is usually a local phenomenon 就拿住在村东南一个小弯子里的湾家人来说吧，虽然那一家子的家长有点不要脸，我们伟大的中村不是照样会罩着这一家吗？ www.inf.ed.ac.uk

Locality issues • Negation is usually a local phenomenon 就拿住在村东南一个小弯子里的湾家人来说吧，虽然那一家子的家长有点不要脸，我们伟大的中村不是照样会罩着这一家吗？ • If we fail to reconstruct a whole reference, it is unclear whether it is because of negation www.inf.ed.ac.uk

Locality issues • Negation is usually a local phenomenon 就拿住在村东南一个小弯子里的湾家人来说吧，虽然那一家子的家长有点不要脸，我们伟大的中村不是照样会罩着这一家吗？ • If we fail to reconstruct a whole reference, it is unclear whether it is because of negation • Solution: isolate the part containing negation and use them as input to CD www.inf.ed.ac.uk

Locality issues • Negation is usually a local phenomenon 那一家子的家长有点不要脸 the parents of the family are somewhat shameless • If we fail to reconstruct a whole reference, it is unclear whether it is because of negation • Solution: isolate the part containing negation and use them as input to CD www.inf.ed.ac.uk

Results www.inf.ed.ac.uk

Results • We could generate max. 16 out of 54 sentences (29%) www.inf.ed.ac.uk

Results • We could generate max. 16 out of 54 sentences (29%) • Enlarging translation option limit and cube pruning pop limit leads to a small improvement – Just a few induction/ search errors www.inf.ed.ac.uk

Results • We could generate max. 16 out of 54 sentences (29%) • Enlarging translation option limit and cube pruning pop limit leads to a small improvement – Just a few induction/ search errors • p(e) always < p( ê) – model errors www.inf.ed.ac.uk

Discussion www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model • However: www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model • However: – We are basing our results on less than a half test sentences • ! CD is based only one or a few references vs. virtually infinite ways of translating a sentence www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model • However: – We are basing our results on less than a half test sentences • ! CD is based only one or a few references vs. virtually infinite ways of translating a sentence – If model errors, which score component is the most responsible? www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model • However: – We are basing our results on less than a half test sentences • ! CD is based only one or a few references vs. virtually infinite ways of translating a sentence – If model errors, which score component is the most responsible? – CD treats decoding as a “black box” www.inf.ed.ac.uk

Discussion • Ad-interim conclusion: one should enhance the model • However: – We are basing our results on less than a half test sentences • ! CD is based only one or a few references vs. virtually infinite ways of translating a sentence – If model errors, which score component is the most responsible? – CD treats decoding as a “black box” – It is hard to connect CD with deletion and reordering errors www.inf.ed.ac.uk

Chart analysis •Analysis of each step during decoding •Access to hypothesis stacks and sub-scores – In-depth analysis of model errors •We can understand the causes of deletion and reordering errors •We can analyze the translation of cue, event and scope separately •We can analyze patterns of translation amongst these elements www.inf.ed.ac.uk

How does it work? www.inf.ed.ac.uk

How does it work? • Input à decoding chart trace www.inf.ed.ac.uk

How does it work? • Input à decoding chart trace • A good translation of negation needs to meet four conditions: 1. The cue has to be translated 2. The event has to be translated 3. The cue has to refer to the right event 4. The scope elements should be placed in the correct negation scope www.inf.ed.ac.uk

How does it work? • Input à decoding chart trace • A good translation of negation needs to meet four conditions: 1. The cue has to be translated deletion 2. The event has to be translated 3. The cue has to refer to the right event reordering 4. The scope elements should be placed in the correct negation scope www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements 他们没有放弃政府 www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements event needs to be translated 他们没有放弃政府 www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements 他们 scope element attached to the right event 没有放弃政府 www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements 他们没有 cue needs to be translated 放弃政府 www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements 他们没有放弃 cue should refer to the right event 政府 www.inf.ed.ac.uk

How does it work? – Cont’d • Assuming we know the elements of negation on the source, the cell has to satisfy a given condition if it cover one or more of those elements ✓ 他们没有放弃 All elements should be translated 政府 and should correctly related to each other www.inf.ed.ac.uk

Stack analysis – model errors • Analysis whether a component is more responsible for model errors 1. gave up | p(e|f) p(f|e) p(LM) p lex. … ✖ 他们 2. not | p(e|f) p(f|e) p(LM) p lex. … ✓ [ … ] 10: did not give up | p(e|f) p(f|e) p(LM) p lex. … 没有 10 meets all conditions, 1 does not 放弃 1: p(e|f) p(f|e) p(LM) p lex (e|f) p lex (e|f) 政府 10: p(e|f) p(f|e) p(LM) p lex (e|f) p lex (e|f) www.inf.ed.ac.uk

Stack analysis – search/induction errors 他们没有放弃政府 www.inf.ed.ac.uk

Stack analysis – search/induction errors • cue has to be translated in all 他们 cells marked with 没有放弃政府 www.inf.ed.ac.uk

Translating Negation: Induction, Search and Model Errors Federico - PowerPoint PPT Presentation

Translating Negation: Induction, Search and Model Errors Federico Fancellu & Bonnie Webber School of Informatics University of Edinburgh f.fancellu@sms.ed.ac.uk, bonnie@inf.ed.ac.uk www.inf.ed.ac.uk Why

Stratied Negation Negation wrapp ed inside a recursion mak es no sense. Ev

Identifying Negation in the DGS Corpus Graz, 2019-05-03 Marc Schulder, Thomas Hanke

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Logic Programming Theory Lecture 7: Negation as Failure Richard Mayr School of Informatics 6th

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Subminimal Logics and Relativistic Negation Satoru Niki School of Information Science, JAIST

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 12, 2014

Welcome To The ResQIPS Research Symposium Keynote Discussion - Translating Health Policy Into

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr>En ATA 57th

The Universal Model for the negation-free fragment of IPC Apostolos Tzimoulis and Zhiguang Zhao

Normal Forms 1 Literals Definition A literal is an atom or the negation of an atom. In the

Negation, Polarity, N-words Gianina Iord achioaia E BERHARD K ARLS U NIVERSITT T BINGEN

On the Use of the Negation Map in the Pollard Rho Method Joppe W. Bos Thorsten Kleinjung Arjen

Today More on Closed World Assumption & Negation as Failure. Clark completion

Appendix B Matrices and Matrix Algebra The interested readers may find a complete treatment in

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach Pablo E. Verde Coordination Centre

AUTOMATED REASONING These slides cover various topics that relate tableau and resolution, for

A New Proposal for Metabolic Classification of NENs Stefano Severi IRST Meldola Italy

Performance Evaluation and Modeling of SaaS Web Services in the Cloud Abdallah Ali Zainelabden

Lecture 4.1: Binary relations on a set Matthew Macauley Department of Mathematical Sciences

Transitioning to Adulthood Moving into the Ideal World of Adult Health Care. Transition to

2020/21 Draft Budget for consultation Economy and Environment Overview and Scrutiny Panel 23

Translating Negation: Induction, Search and Model Errors Federico - PowerPoint PPT Presentation

Translating Negation: Induction, Search and Model Errors Federico Fancellu & Bonnie Webber School of Informatics University of Edinburgh f.fancellu@sms.ed.ac.uk, bonnie@inf.ed.ac.uk www.inf.ed.ac.uk Why

Stratied Negation Negation wrapp ed inside a recursion mak es no sense. Ev

Identifying Negation in the DGS Corpus Graz, 2019-05-03 Marc Schulder, Thomas Hanke

Today Closed World Assumption &amp; Negation as Failure. Clark completion Lloyd-Topor

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 1, 2014

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Logic Programming Theory Lecture 7: Negation as Failure Richard Mayr School of Informatics 6th

Variable Negation Strategy Decision Table-Based Testing Variable Negation Strategy An

Subminimal Logics and Relativistic Negation Satoru Niki School of Information Science, JAIST

Double Negation Translations as Morphisms Olivier Hermant CRI, MINES ParisTech December 12, 2014

Welcome To The ResQIPS Research Symposium Keynote Discussion - Translating Health Policy Into

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr&gt;En ATA 57th

The Universal Model for the negation-free fragment of IPC Apostolos Tzimoulis and Zhiguang Zhao

Normal Forms 1 Literals Definition A literal is an atom or the negation of an atom. In the

Negation, Polarity, N-words Gianina Iord achioaia E BERHARD K ARLS U NIVERSITT T BINGEN

On the Use of the Negation Map in the Pollard Rho Method Joppe W. Bos Thorsten Kleinjung Arjen

Today More on Closed World Assumption &amp; Negation as Failure. Clark completion

Appendix B Matrices and Matrix Algebra The interested readers may find a complete treatment in

Meta-Analysis for Diagnostic Test Data: a Bayesian Approach Pablo E. Verde Coordination Centre

AUTOMATED REASONING These slides cover various topics that relate tableau and resolution, for

A New Proposal for Metabolic Classification of NENs Stefano Severi IRST Meldola Italy

Performance Evaluation and Modeling of SaaS Web Services in the Cloud Abdallah Ali Zainelabden

Lecture 4.1: Binary relations on a set Matthew Macauley Department of Mathematical Sciences

Transitioning to Adulthood Moving into the Ideal World of Adult Health Care. Transition to

2020/21 Draft Budget for consultation Economy and Environment Overview and Scrutiny Panel 23

Today Closed World Assumption & Negation as Failure. Clark completion Lloyd-Topor

Translating Henri Poincar Bruce D. Popp, Ph.D. ATA Certified Translator, Fr>En ATA 57th

Today More on Closed World Assumption & Negation as Failure. Clark completion