Unsupervised Label Noise Modeling and Loss Correction International - PowerPoint PPT Presentation

Unsupervised Label Noise Modeling and Loss Correction International Conference on Machine Learning Eric Arazo*, Diego Ortego*, Paul Albert, Noel O’Connor Long Beach, June 2019 and Kevin McGuinness eric.arazo@insight-centre.org, diego.ortego@insight-centre.org

Outline Motivation ● ● Observations Proposed method ● ○ Label noise modeling Loss correction approach ○ ● Results

Motivation: why label noise? ● Top performing DNN models: strong supervision Labeled data is a scarce resource ● ● Several alternatives to relax strong supervision 3

Motivation: why label noise? ● Top performing DNN models: strong supervision Labeled data is a scarce resource ● ● Several alternatives to relax strong supervision Data Semi-supervised learning Unlabeled Labeled 4

Motivation: why label noise? ● Top performing DNN models: strong supervision Labeled data is a scarce resource ● ● Several alternatives to relax strong supervision Data Automatic labeling (label noise) Incorrectly labeled Correctly Labeled 5

Observations ● “Deep neural networks easily fit random labels” [1] CIFAR-10 Source: [1] [1] Zhang et al., “Understanding Deep Learning Requires Re-thinking Generalization”, ICLR 2017. 6

Observations ● Noisy samples take longer to learn ○ “Simple patterns are learned first” [2] ○ “Small loss” [3] ○ “High learning rate prevents memorization [4]” CIFAR-10 Loss 80% label noise Uniform label noise Epoch [2] Arpit et al., “A Closer Look at Memorization in Deep Networks”, ICML 2017. [3] Yu et al., How does disagreement help against label corruption?, ICML 2019 7 [4] Tanaka et al., “Joint Optimization Framework for Learning with Noisy Labels”, CVPR 2018.

Label noise modeling ● Before label noise memorization: clean and noisy samples are (to some extent) distinguishable in the loss ● Two-component mixture model suits the problem Loss 8 Epoch

Loss correction approach ● Bootstrapping loss correction [5] + mixup data augmentation [6] [5] Reed t al. “Training deep neural networks on noisy labels with bootstrapping”, ICLR 2015. [6] Zhang et al., “mixup: Beyond Empirical Risk Minimization”, ICLR 2018. 12

Loss correction approach ● Bootstrapping loss correction [5] + mixup data augmentation [6] Our Beta Mixture Model drives our learning approach a step further by: ● ○ Preventing memorization Correcting noisy labels to learn from them ○ [5] Reed t al. “Training deep neural networks on noisy labels with bootstrapping”, ICLR 2015. [6] Zhang et al., “mixup: Beyond Empirical Risk Minimization”, ICLR 2018. 13

Loss correction approach ● Standard training (left) vs proposed training (right) Loss Epoch Epoch CIFAR-10, 80% label noise, uniform label noise 14

Loss correction approach ● Original labels training (left) vs predicted labels after training (right) 15

Results CIFAR-10 results Code on github: https://git.io/svE 16

For more details and discussions... Come to our poster! (Pacific Ballroom #176) Thanks! 17

Unsupervised Label Noise Modeling and Loss Correction International - PowerPoint PPT Presentation

Unsupervised Label Noise Modeling and Loss Correction International Conference on Machine Learning Eric Arazo, Diego Ortego, Paul Albert, Noel OConnor Long Beach, June 2019 and Kevin McGuinness eric.arazo@insight-centre.org,

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

Module-2c: Two Port Noise Modelling 20 July 2018 16:40 Shot Noise vs. Flicker Noise Simple

Making deep neural networks robust to label noise: a loss correction approach Giorgio Patrini

Boosting under high noise. Adaboost is sensitive to label noise Letter / Irvine Database

Visioning Committee Air Quality and Noise January 23, 2020 Noise Data Noise is evaluated on

Johnson Noise: Determinations of k and Absolute Zero Edwin Ng | 12 December 2011 Nyquists

Lecture 19- ECE 240a Laser Phase Noise 1 ECE 240a Lasers - Fall 2019 Lecture 19 Phase Noise

Making Polynomials Robust to Noise Alexander Sherstov U C L A Noise in computation 2 Noise in

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -> value Pseudo-random:

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -> value Pseudo-random:

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Presentation of the label Certicold WHY A CERTICOLD LABEL? A European conformity label For

Introduction to Network Science William J. Cunningham Department of Physics Network Science

BJC in Action: Comparison of Student Perceptions of a Computer Science Principles Course Thomas

Halls B G Theorem E(H) L(H) R(H) Hall.1 Hall.2 Albert R Meyer. April 3, 2013 Albert R

Commitments etc. Bart Geurts Ulterior motives Two aspects of promises Albert to Berta:

Discussion of The Active vs. Passive Asset Management Debate by T. Roncalli Charles-Albert

Search-As-You-Type in Forms: Leveraging the Usability and the Functionality of S earch Paradigm

Foundations of Artificial Intelligence 1. Introduction Organizational Aspects, AI in Freiburg,

Learning efficient logic programs Andrew Cropper & Stephen Muggleton Input Output

Unsupervised Label Noise Modeling and Loss Correction International - PowerPoint PPT Presentation

Unsupervised Label Noise Modeling and Loss Correction International Conference on Machine Learning Eric Arazo*, Diego Ortego*, Paul Albert, Noel OConnor Long Beach, June 2019 and Kevin McGuinness eric.arazo@insight-centre.org,

Blue Label Pilot-plant Reactor 1 Product Line-up Platinum Label Gold Label Blue Label Blue

AG! Blue Label Bench-top Reactor 1 Product line up Platinum Label Gold Label Blue Label Blue

Extreme Classification A New Paradigm for Ranking &amp; Recommendation Manik Varma Microsoft

UNSUPERVISED LEARNING, CLUSTERING UNSUPERVISED LEARNING UNSUPERVISED LEARNING Supervised

Module-2c: Two Port Noise Modelling 20 July 2018 16:40 Shot Noise vs. Flicker Noise Simple

Making deep neural networks robust to label noise: a loss correction approach Giorgio Patrini

Boosting under high noise. Adaboost is sensitive to label noise Letter / Irvine Database

Visioning Committee Air Quality and Noise January 23, 2020 Noise Data Noise is evaluated on

Johnson Noise: Determinations of k and Absolute Zero Edwin Ng | 12 December 2011 Nyquists

Lecture 19- ECE 240a Laser Phase Noise 1 ECE 240a Lasers - Fall 2019 Lecture 19 Phase Noise

Making Polynomials Robust to Noise Alexander Sherstov U C L A Noise in computation 2 Noise in

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -&gt; value Pseudo-random:

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -&gt; value Pseudo-random:

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

Unsupervised Learning and Clustering l In unsupervised learning you are given a data set with no

Presentation of the label Certicold WHY A CERTICOLD LABEL? A European conformity label For

Introduction to Network Science William J. Cunningham Department of Physics Network Science

BJC in Action: Comparison of Student Perceptions of a Computer Science Principles Course Thomas

Halls B G Theorem E(H) L(H) R(H) Hall.1 Hall.2 Albert R Meyer. April 3, 2013 Albert R

Commitments etc. Bart Geurts Ulterior motives Two aspects of promises Albert to Berta:

Discussion of The Active vs. Passive Asset Management Debate by T. Roncalli Charles-Albert

Search-As-You-Type in Forms: Leveraging the Usability and the Functionality of S earch Paradigm

Foundations of Artificial Intelligence 1. Introduction Organizational Aspects, AI in Freiburg,

Learning efficient logic programs Andrew Cropper &amp; Stephen Muggleton Input Output

Unsupervised Label Noise Modeling and Loss Correction International Conference on Machine Learning Eric Arazo, Diego Ortego, Paul Albert, Noel OConnor Long Beach, June 2019 and Kevin McGuinness eric.arazo@insight-centre.org,

Extreme Classification A New Paradigm for Ranking & Recommendation Manik Varma Microsoft

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -> value Pseudo-random:

Noises Jaanus Jaggo Noise Noise is a function: noise(coordinate) -> value Pseudo-random:

Learning efficient logic programs Andrew Cropper & Stephen Muggleton Input Output