Retrieving Comparative Arguments using Deep Pre-trained Language - PowerPoint PPT Presentation

Retrieving Comparative Arguments using Deep Pre-trained Language Models and NLU Viktoriia Chekalina, Alexander Panchenko Skolkovo Institute of Science and Technology, Philips Innovations Lab RUS 23 sent 2020 1 / 11

Task Methodology ◮ Given a set of topics with comparative query, i.e. question like “What is better X or Y ?” ◮ For each topic: ◮ retrieve documents from ClueWeb12 1 corpus by ChatNoir 2 search engine. ◮ find documents answering the proposed comparative question most fully and reasonably. Methodology ◮ for each topic t i for every retrieved document d j count similarity measures s ij = sim ( t i , d j ) ◮ rank documents according to these similarities 1 https://lemurproject.org/clueweb12 2 https://www.chatnoir.eu/doc 2 / 11

Methodology & Approaches Methodology Candidate documents and their scores ChatNoir: an inverted index + BM25 Similarity scores Topic Topic-document (query) similarity computation Documents sorted by similarity score Scheme of our methods: candidate documents are obtained using a traditional inverted index-based IR system and then re-ranked on the basis of topic-document similarity Approaches ◮ Inverted index relevance estimation from ChatNoir system ◮ Approaches based on SOTA NLP models: LSTM, BERT, Transformers ◮ Comparative feature-based approaches: number of comparative structures in document and density of comparative sentences. 3 / 11

Approaches ChatNoir build-in ranking ◮ ChatNoir has custom BM25 ranking function 3 based on inverted index. ◮ similarity s ij = cn ij , where cn ij - scores provided by ChatNoir LSTM-ULMFit ◮ we use pre-trained bi-directional LSTM language model LSTM-ULMFit ◮ similarity s ij = cos ( h i , h j ) where h i , h j are hidden states of model which processes topic title and document body, respectively. 3 https://www.elastic.co/guide/en/elasticsearch/reference/current/ index-modules-similarity.html 4 / 11

Approaches Bidirectional encoder representations from Transformer (BERT) ◮ BERT model provides word embeddings ◮ similarity s ij = cos ( e topic title , e doc title ), where e topic title , e doc title are averages between word embeddings in the topic’s title and retrieved document’s title, respectively. 5 / 11

Approaches Attention of a Transformer-based language model ◮ We concatenate topic title and document body in one sequence ◮ We use the output of the self-attention layer in pre-trained Transformer model ◮ Base on obtained token-to-token responses we build the attention maps. ◮ In these maps, we select the area which describes responses of tokens from one sentence to another. For 3, 4, 10, 11 attention’s head, we count the sum around this area, and the obtained sum is a similarity. 6 / 11

Approaches The attention map for input “[CLS]” + Which is better, a laptop or a desktop?+ “[SEP]” + Laptop is preferable than desktop because it is more portable. + “[SEP]”. Areas highlighted in red participate in the similarity calculation. 7 / 11

Approaches Comparative features For every pair topic-document we count additive multiplicative term r ij based on comparative features. This term describes the competitiveness of the document. Initialy r ij = 1 ◮ From topic we extract objects for comparison and find the same objects in document. ◮ Term updates as: � r ij ∗ 1 . 2 if we find one object r ij = r ij ∗ 1 . 5 if we find two objects ◮ If we find in document l aspects(features of comparison) and predicates, we increase r ij to l . 8 / 11

Approaches Comparative features ◮ Using classifier of comparative sentences based on InferSent, count number of comparative sentences in the document ◮ r i , j = r i , j + n , where n - number of comparative sentences. Proposed term we add to ChatNoir’s score and ULMFit’s score, so similarity is s ij = cn ij ∗ r ij 9 / 11

Results The evaluation is carried out manually and based on the existence of relevant information in the document. The table describes 5@DCG(discount cumulative gain) scores for approaches mentioned above 10 / 11

Discussion ◮ We apply several approaches to the Argument retrieval shared task. We use baseline from a search engine, information from pre-trained NLP models, the existence of comparative structures in documents. ◮ The best scores gives a model based on inverted indices. ◮ Model with consideration of comparative structures gives the better result than one’s base on only SOTA NLP models. 11 / 11

Retrieving Comparative Arguments using Deep Pre-trained Language - PowerPoint PPT Presentation

Retrieving Comparative Arguments using Deep Pre-trained Language Models and NLU Viktoriia Chekalina, Alexander Panchenko Skolkovo Institute of Science and Technology, Philips Innovations Lab RUS 23 sent 2020 1 / 11 Task Methodology Given a

A MultiAgent System for A MultiAgent System for Retrieving Bioinformatics Retrieving

Classical arguments that for us is a PP complement of easy The following arguments are the

Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels Pengfei Chen

A comparative study of deep learning based Dadashi-Tazehozi rd2669 methods for MRI image

Deep Learning for Natural Language Processing Introduction to transfer learning and pre-trained

Efficient Deep Neural Networks EMC2 Workshop @ NeurIPS 2019 1 Outline Low Rank (LR) Models

Deep Neural Networks and Molecular Dynamics Roberto Car Princeton University MAX Conference

Retrieving Data from Multiple Tables Unit Objectives After completing this unit, you should be

Delay and laziness no not ea eager er ... When are expressions

Script Supervisor Introduction About Me I trained on About Time with a script supervisor

Surfing: Iterative Optimization Over Incrementally Trained Deep Networks Ganlin Song, Zhou Fan,

Janet Kittams-Lalley Helpline Center Available 24/7 Have staff that are trained to assess

I have trained more than 1,000 individuals to become ACII qualified I have trained over

Contextual Word Representations with BERT and Other Pre-trained Language Models Jacob Devlin

Imposing Hard Constraints on Deep Networks: Promises and Limitations min w R ( w ) s.t C ( w ) =

PO POLC42 To Topics in Comparative Politics Af African Politics cs Week 3: Colonial Africa

definite comparative descriptions the more superlative-like comparative construction Elizabeth

A Workflow Workflow for for Retrieving Retrieving Orthologous Orthologous A Promoters and I

Comparative Genomics Comparative Genomics Common Themes Gene and functional pathway

International Comparative Assessments 1 05/06/2015 1 International Comparative Assessments Key

Arguments Arguments An ARGUMENT is a collection of statements which are intended to support,

The Field of Logical Reasoning: (& The back 40 of Bad Arguments) Adapted from: An

Comparative Genomics: Comparative Genomics: Sequence, Structure, Sequence, Structure, and

Architecture Perceptron Highway Networks Highway Networks Allows training very deep

Retrieving Comparative Arguments using Deep Pre-trained Language - PowerPoint PPT Presentation

Retrieving Comparative Arguments using Deep Pre-trained Language Models and NLU Viktoriia Chekalina, Alexander Panchenko Skolkovo Institute of Science and Technology, Philips Innovations Lab RUS 23 sent 2020 1 / 11 Task Methodology Given a

A MultiAgent System for A MultiAgent System for Retrieving Bioinformatics Retrieving

Classical arguments that for us is a PP complement of easy The following arguments are the

Understanding and Utilizing Deep Neural Networks Trained with Noisy Labels Pengfei Chen

A comparative study of deep learning based Dadashi-Tazehozi rd2669 methods for MRI image

Deep Learning for Natural Language Processing Introduction to transfer learning and pre-trained

Efficient Deep Neural Networks EMC2 Workshop @ NeurIPS 2019 1 Outline Low Rank (LR) Models

Deep Neural Networks and Molecular Dynamics Roberto Car Princeton University MAX Conference

Retrieving Data from Multiple Tables Unit Objectives After completing this unit, you should be

Delay and laziness no not ea eager er ... When are expressions

Script Supervisor Introduction About Me I trained on About Time with a script supervisor

Surfing: Iterative Optimization Over Incrementally Trained Deep Networks Ganlin Song, Zhou Fan,

Janet Kittams-Lalley Helpline Center Available 24/7 Have staff that are trained to assess

I have trained more than 1,000 individuals to become ACII qualified I have trained over

Contextual Word Representations with BERT and Other Pre-trained Language Models Jacob Devlin

Imposing Hard Constraints on Deep Networks: Promises and Limitations min w R ( w ) s.t C ( w ) =

PO POLC42 To Topics in Comparative Politics Af African Politics cs Week 3: Colonial Africa

definite comparative descriptions the more superlative-like comparative construction Elizabeth

A Workflow Workflow for for Retrieving Retrieving Orthologous Orthologous A Promoters and I

Comparative Genomics Comparative Genomics Common Themes Gene and functional pathway

International Comparative Assessments 1 05/06/2015 1 International Comparative Assessments Key

Arguments Arguments An ARGUMENT is a collection of statements which are intended to support,

The Field of Logical Reasoning: (&amp; The back 40 of Bad Arguments) Adapted from: An

Comparative Genomics: Comparative Genomics: Sequence, Structure, Sequence, Structure, and

Architecture Perceptron Highway Networks Highway Networks Allows training very deep

The Field of Logical Reasoning: (& The back 40 of Bad Arguments) Adapted from: An