Empirical Analysis of Beam Search Performance Degradation in Neural - PowerPoint PPT Presentation

Aug 29, 2023 •145 likes •253 views

1 The Thirty-sixth International Conference on Machine Learning Empirical Analysis of Beam Search Performance Degradation in Neural Sequence Models Eldan Cohen J. Christopher Beck Poster: Pacific Ballroom #47 Motivation 2 u Most commonly

1 The Thirty-sixth International Conference on Machine Learning Empirical Analysis of Beam Search Performance Degradation in Neural Sequence Models Eldan Cohen J. Christopher Beck Poster: Pacific Ballroom #47
Motivation 2 u Most commonly used inference algorithm for neural sequence decoding u Intuitively, increasing beam width should lead to better solutions u In practice, performance degradation for larger beams u While the search finds solutions that are more probable, they tend to have lower evaluation u One of six main challenges in machine translation (Koehn & Knowles, 2017)
Beam Search Performance Degradation 3 Task Dataset Metric B =1 B =3 B =5 B =25 B =100 B =250 Translation En-De BLEU4 25.27 26.00 26.11 25.11 23.09 21.38 En-Fr BLEU4 40.15 40.77 40.83 40.52 38.64 35.03 Summarization Gigaword R-1 F 33.56 34.22 34.16 34.01 33.67 33.23 Captioning MSCOCO BLEU4 29.66 32.36 31.96 30.04 29.87 29.79 u Different tasks: translation, summarization, image captioning u Previous works highlighted potential explanations: u Machine translation: source copies (Ott et al., 2018) u Image captioning: training set predictions (Vinyals et al., 2017)
Analytical Framework: Search Discrepancies 4 u Inspired by search discrepancies in combinatorial search (Harvey & Ginsberg, 1995) u Search discrepancy at sequence position t logP θ ( y t | x ; { y 0 , ..., y t − 1 } ) < max y ∈ V logP θ ( y | x ; { y 0 , ..., y t − 1 } ) . ratio between the most probable token and the chosen token as discrepancy u Discrepancy gap for position t max y ∈ V log P θ ( y | x ; { y 0 , ..., y t − 1 } ) � log P θ ( y t | x ; { y 0 , ..., y t − 1 } ) .
Empirical Analysis (WMT’14 En-De) 5 Search discrepancies vs. sequence position • Increasing the beam width leads to more, early discrepancies • For larger beam widths, these discrepancies are more likely to be associated with degraded solutions
Empirical Analysis (WMT’14 En-De) 6 Discrepancy gap vs. sequence position • As we increase the beam width, the gap of early discrepancies in degraded solutions grows
Discrepancy-Constrained Beam Search 7 <sos> comment vas [-0.69] est [-0.92] venu [-2.99] ... ≤ 𝓝 Discrepancy gap: 0 0.23 2.30 … ≤ 𝓞 Candidate rank: 1 2 3 … • M and N are hyper-parameters, tuned on a held-out validation set. • The methods successfully eliminate the performance degradation
Summary 8 u Analytical framework based on search discrepancies u Performance degradation is associated with early large search discrepancies u Propose two heuristics based on constraining the search discrepancies u Successfully eliminate the performance degradation. u In the paper: u Detailed analysis of the search discrepancies u Our results generalize previous observations on copies (Ott et al., 2018) and training set predictions (Vinyals et al., 2017) u Discussion on the biases that can explain the observed patterns
9 The Thirty-sixth International Conference on Machine Learning Empirical Analysis of Beam Search Performance Degradation in Neural Sequence Models Eldan Cohen J. Christopher Beck Poster: Pacific Ballroom #47

Recommend

Zhang Logo Background n-alkane degradation ? n-alkane degradation Degradation Sensing

Chengdu Sichuan University Jun Ju, Kaige Zhang, Chuanqiang Yan, Xu Tong, Mengping Jiang, Yi Zhang Logo Background n-alkane degradation ? n-alkane degradation Degradation Sensing Emulsification Stress tolerance Biofilm

901 views • 20 slides

Beam Search Shahrzad Kiani and Zihao Chen CSC2547 Presentation Beam Search Greedy Search: Always

Beam Search Shahrzad Kiani and Zihao Chen CSC2547 Presentation Beam Search Greedy Search: Always go to top 1 scored sequence (seq2seq) Beam Search: Maintain the top K scored sequences (this paper) Seq2Seq Train and Test Issues gold sequence

337 views • 14 slides

The Economics of Land Degradation (ELD) Initiative Economic arguments to combat land degradation

The Economics of Land Degradation (ELD) Initiative Economic arguments to combat land degradation Soil Leadership Academy Mark Schauer, Coordinator ELD Secretariat 21.10.2015 Seite 1 Vision The Economics of Land Degradation (ELD)

302 views • 8 slides

E-lens related beam-beam experiment Xiaofeng Gu 1 IP10 -- e-beam collision with proton only (up

E-lens related beam-beam experiment Xiaofeng Gu 1 IP10 -- e-beam collision with proton only (up to 800mA e-beam with 100 mA / step) 2 IP10 + IP6 + IP8 3 IP10 + IP8 4 IP10 -- 1D horizontal separation scan with 700 mA beam from 2mm to -4.5

411 views • 9 slides

Beam-beam Studies, Tool Development and Tests EIC Collaboration Meeting, Jlab, Oct. 29-Nov. 1,

Beam-beam Studies, Tool Development and Tests EIC Collaboration Meeting, Jlab, Oct. 29-Nov. 1, 2018 Yun Luo for eRHIC beam-beam study team Content New eRHIC Beam-beam Studies Modified Weak-strong Simulation Simulation Tool

374 views • 20 slides

Intra-Pulse Beam-Beam Scans at the NLC IP Steve Smith SLAC Nanobeams 2002 Beam-Beam Scans

Intra-Pulse Beam-Beam Scans at the NLC IP Steve Smith SLAC Nanobeams 2002 Beam-Beam Scans Next Linear Collider Next Linear

594 views • 10 slides

NuMI Primary Beam November 7, 2003 NuMI NuMI Primary Beam NBI03 Nov. 7-11, 2003 NuMI Primary

NuMI NBI03 Nov. 7-11, 2003 NuMI Primary Beam S. Childress (FNAL) NuMI Primary Beam November 7, 2003 NuMI NuMI Primary Beam NBI03 Nov. 7-11, 2003 NuMI Primary Beam FNAL Site S. Childress (FNAL) Beam enclosure entrances thru Main

412 views • 19 slides

APEX 04/08/2015 1. BTF: different beam size and different beam current 2. BTF: Octupole 3. BTF:

E-lens related beam-beam experiment 04/08/2015 W. Fischer, Y. Luo, X. Gu APEX 04/08/2015 1. BTF: different beam size and different beam current 2. BTF: Octupole 3. BTF: Different e-beam energy 4. BTF: 1D separation 5. Three 111x111 ramps 2

463 views • 12 slides

Introduction to Apache Beam Dan Halperin JB Onofr Google Talend Beam podling PMC Beam

Introduction to Apache Beam Dan Halperin JB Onofr Google Talend Beam podling PMC Beam Champion & PMC Apache Member Apache Beam is a unified programming model designed to provide efficient and portable data processing pipelines What

667 views • 37 slides

Bridge Beam E In Posi5on Bridge Beam A At TCO 1 APA#1 is loaded on Bridge Beam A At TCO 2

Bridge Beam E In Posi5on Bridge Beam A At TCO 1 APA#1 is loaded on Bridge Beam A At TCO 2 APA#1 is moved to its upstream loca5on 3 APA#2 is loaded on Bridge Beam A At TCO 4 APA#2 is moved to its midstream loca5on 5 APA#3 is loaded

713 views • 43 slides

Modelling and implementation of the 6D beam -beam interaction G. Iadarola, R. De Maria, Y.

CERN-ACC-SLIDES-2018-001 2018-02-27 Giovanni.Iadarola@ cern.ch Modelling and implementation of the 6D beam -beam interaction G. Iadarola, R. De Maria, Y. Papaphilippou Keywords: beam-beam, 6D, synchro beam mapping

1.53k views • 104 slides

J-PARC Neutrino Beam-line Upgrade T. Nakadaira for J-PARC neutrino beam-line construction group

J-PARC Neutrino Beam-line Upgrade T. Nakadaira for J-PARC neutrino beam-line construction group T2K collaboration 1 Outline J-PARC neutrino beam-line Achieved beam power: ~230kW Basic scenario for beam power upgrade Key points for each

327 views • 16 slides

Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the

Empirical Principal Component FPC for the model Empirical vs. theoretical FPC Functional Principal Component Analysis May 14, 2018 Empirical Principal Component FPC for the model Empirical vs. theoretical FPC Outline Empirical Principal

626 views • 15 slides

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search

Search Engines Issues Avi Rappoport Search Tools Consulting Search Issues Enterprise Search Engines Corporate and institutional sites E-commerce Intranets P2P, Meta search and distributed search CMSs and Search Engines

429 views • 25 slides

The Audio Degradation Toolbox http://code.soundsoftware.ac.uk/projects/audio-degradation-toolbox/

The Audio Degradation Toolbox http://code.soundsoftware.ac.uk/projects/audio-degradation-toolbox/ and its Application to Robustness Evaluation Sebastian Ewert and Matthias Mauch Friday, 1 November 13 reverb photo by steveleenow Friday, 1

564 views • 32 slides

DEGRADATION in E.coli iGEM 2012_ UC Davis 1 Introduction PET Degradation Protein Engineering

Engineering Pathways for Polyethylene Terepthalate DEGRADATION in E.coli iGEM 2012_ UC Davis 1 Introduction PET Degradation Protein Engineering Chassis Conclusion References Problem Problem Solution The Problem: Environmental The

718 views • 50 slides

Millimeter Wave Hybrid Beamforming with DFT-MUB Aided Precoder Codebook Design K. Satyanarayana

Millimeter Wave Hybrid Beamforming with DFT-MUB Aided Precoder Codebook Design K. Satyanarayana University of Southampton & InterDigital Supervisors: Mohammed El-Hajjar , Ping-Heng Kuo , Alain Mourad , Lajos Hanzo

404 views • 18 slides

Algorithmic approaches to distributed adaptive transmit beamforming 5th international conference

Algorithmic approaches to distributed adaptive transmit beamforming 5th international conference on Intelligent Sensors, Sensor Networks and Information Processing Stephan Sigg, Michael Beigl Institute of Distributed and Ubiquitous Systems

386 views • 27 slides

Distributed Beamforming Architectures for Space & Airborne Applications: Taxonomy,

IGNSS2018: 7 - 9 February 2018 Australian Centre for Space Engineering Research (ACSER) Distributed Beamforming Architectures for Space & Airborne Applications: Taxonomy, Requirements & Synergies Presented by E Glennon, E Aboutanios

154 views • 11 slides

#UDT2019 Motivation #UDT2019 Beamforming #UDT2019 Beamforming #UDT2019 Receive beamforming

Multiple-Input-Multiple-Output (MIMO-) High-Resolution Monostatic Sonar for Target Detection Dr.-Ing. Tim Claussen, Dipl.-Ing. Gunnar Zindel #UDT2019 Motivation #UDT2019 Beamforming #UDT2019 Beamforming #UDT2019 Receive beamforming vs.

297 views • 18 slides

Performance Analysis: new tools and concepts from the cloud Brendan Gregg Lead Performance

Performance Analysis: new tools and concepts from the cloud Brendan Gregg Lead Performance Engineer, Joyent SCaLE10x brendan.gregg@joyent.com Jan, 2012 whoami I do performance analysis I also write performance tools out of necessity

1.21k views • 86 slides

The exploration of strongly interacting matter Andr Mischk e Utrecht University Inaugural

The exploration of strongly interacting matter Andr Mischk e Utrecht University Inaugural presentation Physics and Engineering Section, Academia Europaea 3 September 2017 Outline About myself Structure of matter The

560 views • 22 slides

The mystery of simultaneous pulsar moding in the X-ray and radio bands Wim Hermsen 1,2

The mystery of simultaneous pulsar moding in the X-ray and radio bands Wim Hermsen 1,2 Collaborators: L. Kuiper 1 , J.W.T. Hessels 3,2 , J. van Leeuwen 3,2 , D. Mitra 4 , J.M. Rankin 2,5 , B. Stappers 6 , G.A.E. Wright 7 , R. Basu 4,8 , A. Szary 2

890 views • 44 slides

Comparing different models of pulsar timing noise NewCompStar, Budapest, 2015 Gregory Ashton In

Comparing different models of pulsar timing noise NewCompStar, Budapest, 2015 Gregory Ashton In collaboration with Ian Jones & Reinhard Prix Motivation 2/16 The signal from pulsars is highly stable, but variations do exist in the

581 views • 16 slides