Task This work focuses on a cloze-style reading comprehension task - PowerPoint PPT Presentation

Task This work focuses on a cloze-style reading comprehension task over fairy stories, which is highly challenging due to diverse semantic patterns with personified expressions and reference. The cloze-style task can be described as a triple < D; Q; A > , where D is a document (context), Q is a query over the contents of D , in which a word or phrase is replaced with a placeholder, and A is the answer to Q . 2

Representation challenges • Representation difficulty and computational complexity due to the large vocabulary and data sparsity. • Out-of-vocabulary ( OOV ) word issues, especially when the ground-truth answers contain rare words or name entities , which are hardly fully recorded in the vocabulary. There are over 13,000 characters in Chinese while there are only 26 letters in English without regard to punctuation marks. If a reading comprehension system can not effectively manage the OOV issues, the performance will not be semantically accurate for the task. 3

Two common levels of embedding Character-level Embedding Word-level Embedding 青 | 蛙 | 和 | 小 | 白 | 兔 | 去 | 赶 | 集青蛙 | 和 | 小白兔 | 去 | 赶集 • Word-level representation is good at catching global context and dependency relationships between words. However, rare words are often expressed poorly due to data sparsity. • Character embedding are more expressive to model sub-word morphologies, which is beneficial to deal with rare words. • However, the minimal meaningful unit below word usually is not character, which motivates researchers to explore the potential unit ( subword ) between character and word to model sub-word morphologies or lexical semantics. 4

Framework • Given the triple < D; Q; A > , the system will be built in the following steps. 5

BPE Subword Segmentation Word in most languages usually can be split into meaningful subword units despite of the writing form. For example, “ indispensable ” could be split into < in; disp; ens; able > . The generalized framework : Firstly, all the input sequences (strings) are tokenized into a sequence of single- character subwords, then we repeat: 1. Count all bigrams under the current segmentation status of all sequences. 2. Find the bigram with the highest frequency and merge them in all the sequences. Note the segmentation status is updating now. 3. If the merging times do not reach the specified number, go back to 1, otherwise the algorithm ends. 6

Subword-augmented Word Embedding An augmented embedding ( AE ) is to straightforwardly integrate word embedding WE ( w ) and subword embedding SE ( w ) for a given word w . In this work, we investigate concatenation ( concat ), element-wise summation ( sum ) and element-wise multiplication ( mul ). The subword embedding SE ( w ) is generated by taking the final outputs of a bidirectional gated recurrent unit (GRU) 7

Short list lookup Trainable Embedding 的 Motivation: insufficient training for UNK words 了一 Technique : 小 • Sort the dictionary according to the word 我说 frequency from high to low. High-frequency words 在 (90%) • A frequency filter ratio γ is set to filter out 是不 the low-frequency words (rare words) from 你 the lookup table. 着 • For example, if γ is 0.9, then the last 10% 他 γ = 0.9 …… low-frequency words will be mapped into 药膏 UNK words. 洪武私访 low-frequency words 彩虹曲 • Thus, AE ( w ) can be rewritten as (10%) 牢合 · 乔治攻坚厅长 8

Attention Module • Contextual representations of the document and query • Gated-attention • Probability of each candidate word as being the answer • The predicted answer 9

Dataset and hyper-parameters • Three Chinese Machine Reading Comprehension datasets, namely CMRC- 2017, People’s Daily (PD) and Children Fairy Tales (CFT). • We also use the Children’s Book Test (CBT) dataset (Hill et al., 2015) to test the generalization ability in multi-lingual case. 10

Main results • Our SAW Reader ( mul ) outperforms all other single models • mul might be more informative than concat and sum operations 11

Accuracy on CBT dataset Our model outperforms most of the previously public works. 12

Analysis • When the vocabulary size is 1 k and γ = 0 . 9 , the models could obtain the best performance. • For a task like reading comprehension the subwords, being a highly flexible grained representation between character and word, tends to be more like characters instead of words. • The balance between word and character is quite critical and an appropriate grain of character-word segmentation could essentially improve the word representation 13

Subword-Augmented Representations • In CMRC- 2017, we observe questions with OOV answers (denoted as “OOV questions”) account for 17.22% in the error results of the best Word + Char embedding based model. • With BPE subword embedding, 12.17% of these “OOV questions” could be correctly answered. • This shows the subword representations could be essentially useful for modeling rare and unseen words. 14

Conclusion • This paper presents an effective neural architecture, called subword-augmented word embedding to enhance the model performance for the cloze-style reading comprehension task. • The proposed SAW Reader uses subword embedding to enhance the word representation and limit the word frequency spectrum to train rare words efficiently. • With the help of the short list , the model size will also be reduced together with training speedup. • Giving state-of-the-art performance on multiple benchmarks, the proposed reader has been proved effective for learning joint representation at both word and subword level and alleviating OOV difficulties. 15

Thanks! Q & A

Task This work focuses on a cloze-style reading comprehension task - PowerPoint PPT Presentation

Task This work focuses on a cloze-style reading comprehension task over fairy stories, which is highly challenging due to diverse semantic patterns with personified expressions and reference. The cloze-style task can be described as a triple <

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

Telematics Task Force Telematics Task Force Charlie Gorman Charlie Gorman Talking Points

iDASH - Secure Genome Analysis Task 1A Competition Using ObliVM Task 1B Set union Task 2A Xiao

AU Task Force: 2018 Consultation Bob Dony Chair, AU Task Force April 5, 2018 Outline About

Task-Centered Design Task-Centered Process Creating a Task Scenario Scenario-based Walk-throughs

Somers Central School District Safety Task Force Report Committee Report to the Board of

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

Traffic Study May 23, 2019 Project Team DBS Scope of work Task 1 Data Collection Task 2

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

RIVER ECOLOGY & GOVERNANCE TASK FORCE FULL TASK FORCE MEETING APRIL 30, 2019 RIVER ECOLOGY

Clebsch-Gordan Coefficients and Principal Series Representations Zhuohui Zhang, Rutgers

From irreducible representations to character tables Recapitulation Recipe for generating a

I/O in C Standard C Library I/O commands are not included as part of the C language.

What is... a supercharacter? Dario De Stavola 11 October 2016 What is... a supercharacter?

q -Deformed Representation Theory At The Limit Jonas Wahl based on work in progress with Alexey

Representation in Scene Text Detection and Recognition Prof. Xiang Bai Huazhong University of

CS525: Advanced Database Organization Notes 3: File and System Structure Yousef M. Elmehdwi

Geometric methods for character varieties Vicente Muoz Universidad Complutense de Madrid 2015

Task This work focuses on a cloze-style reading comprehension task - PowerPoint PPT Presentation

Task This work focuses on a cloze-style reading comprehension task over fairy stories, which is highly challenging due to diverse semantic patterns with personified expressions and reference. The cloze-style task can be described as a triple <

Bond Task Force Draft Bond Task Force Recommendations Tuesday, February 27 , 2018 Bond Task

Task 1d: River basin management Task leader: LNEC; Involved partners EU: ISPRA, DTU, EWA Task

p wered Yva productivity AI Task Manager @nerdybff Task Management Task Management Todoist

CGO Task Presentation CGO Task Presentation CGO Task Presentation Effective Task Presentation

IEA Bioenergy IEA BIOENERGY Task 42 Biorefinery 5 th Task Meeting Dublin, Ireland, 25/26 March

WASC 2019 Findings Presentation to stakeholders February 2019 2 3 Process Task 1 Task 2

Telematics Task Force Telematics Task Force Charlie Gorman Charlie Gorman Talking Points

iDASH - Secure Genome Analysis Task 1A Competition Using ObliVM Task 1B Set union Task 2A Xiao

AU Task Force: 2018 Consultation Bob Dony Chair, AU Task Force April 5, 2018 Outline About

Task-Centered Design Task-Centered Process Creating a Task Scenario Scenario-based Walk-throughs

Somers Central School District Safety Task Force Report Committee Report to the Board of

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

Traffic Study May 23, 2019 Project Team DBS Scope of work Task 1 Data Collection Task 2

Delaware River Basin Delaware River Basin Flood Mitigation Task Force: Flood Mitigation Task

RIVER ECOLOGY &amp; GOVERNANCE TASK FORCE FULL TASK FORCE MEETING APRIL 30, 2019 RIVER ECOLOGY

Clebsch-Gordan Coefficients and Principal Series Representations Zhuohui Zhang, Rutgers

From irreducible representations to character tables Recapitulation Recipe for generating a

I/O in C Standard C Library I/O commands are not included as part of the C language.

What is... a supercharacter? Dario De Stavola 11 October 2016 What is... a supercharacter?

q -Deformed Representation Theory At The Limit Jonas Wahl based on work in progress with Alexey

Representation in Scene Text Detection and Recognition Prof. Xiang Bai Huazhong University of

CS525: Advanced Database Organization Notes 3: File and System Structure Yousef M. Elmehdwi

Geometric methods for character varieties Vicente Muoz Universidad Complutense de Madrid 2015

RIVER ECOLOGY & GOVERNANCE TASK FORCE FULL TASK FORCE MEETING APRIL 30, 2019 RIVER ECOLOGY