Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert - PowerPoint PPT Presentation

Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert R. Sabuncu Cornell University Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 1

Cornell University Motivation Deep neural networks: • Often need lots of clean labeled data - can be expensive to obtain • Can overfit to noisy labels [Zhang et al. 2016] Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 2

Cornell University Symmetric Loss • A loss function is symmetric if • Symmetric loss can be tolerant to noisy labels [Ghosh et al. 2017] • MAE for classification with probabilistic outputs is symmetric Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 3

Cornell University Limitations of MAE • MAE is noise-robust but can converge to lower accuracy Much slower convergence Slight gap in test accuracy ResNet on CIFAR-10 Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 4

Cornell University Limitations of MAE • MAE is noise-robust but can converge to lower accuracy Using MAE, the highest ~ 20% accuracy achieved is 38.29% in 2000 epochs, and CCE achieved better performance after 7 epochs! ResNet on CIFAR-100 Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 4

Cornell University Generalized Cross Entropy (Lq Loss) CCE • Good convergence, but prone to label noise MAE • More noise robust, but bad convergence Use the Box-Cox Transformation to combine them Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 5

Cornell University Generalized Cross Entropy (Lq Loss) CCE MAE ! = 0 ! " [0,1] ! = 1 • Lq loss has bounded sum of losses for non zero q • The tighter the bound, the more noise robust the Lq loss Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 5

Cornell University Generalized Cross Entropy (Lq Loss) CCE MAE ! = 0 ! " [0,1] ! = 1 ResNet on CIFAR-10 Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 5

Cornell University Truncated Lq Loss • Propose the truncated Lq loss • Often has tighter bound • Use alternative convex search algorithm for optimization Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 6

Cornell University Experiments CIFAR-10 87.62% 87.13% 40% NOISE • ResNet on CIFAR-10, 67% 81.88% CIFAR-100 and 89.70% 89.83% 20% NOISE FASHION-MNIST with 83.72% 86.98% synthetic noise 60.00% 65.00% 70.00% 75.00% 80.00% 85.00% 90.00% 95.00% CIFAR-100 62.64% 61.77% 40% NOISE 9.03% • Consistent improvements 48.20% over CCE and MAE 67.61% 66.81% 20% NOISE 15.80% 58.72% 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% Trunc Lq Lq (q = 0.7) MAE CCE Generalized Cross Entropy Loss for Noisy Labels – Poster # 101 7

Cornell University • Thank you very much for your attention! • Hope to see you at Poster #101

Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert - PowerPoint PPT Presentation

Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert R. Sabuncu Cornell University Generalized Cross Entropy Loss for Noisy Labels Poster # 101

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1

Formal Modeling in Cognitive Science 1 Noisy Channel Model Channel Capacity Lecture 29: Noisy

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels Lu Jiang, Di Huang, Mason Liu,

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

1) Entropy = measure of randomness 2) Entropy = measure of compressibility More random = Less

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Learning Sound Event Classifiers from Web Audio with Noisy Labels Eduardo Fonseca 1 , Manoj Plakal

Model-agnostic Approaches to Handling Noisy Labels When Training Sound Event Classifiers Eduardo

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Road detection via entropy By Anna Zaidman 1 1 What is entropy? Entropy is a mathematically

Entropy Change in Entropy Reversible Isobaric Process Ideal Gas in a Reversible Process Free

Entropy and The Second Law of Thermodynamics Entropy (S)

Example: Grid World CS 188: Artificial Intelligence Markov Decision Processes II A

Neural Networks for Machine Learning Lecture 9a Overview of ways to improve generalization

Noise-adaptive Margin- based Active Learning, and Yining Wang , Aarti Singh Carnegie Mellon

2008, nature Shlens et al 09 Pillow et al, 2008 Pillow et al, 2008 Whats role of coupling

Overview of State Space Models Standard State Space Model Standard state space model x n +1 =

Improving the Accuracy of System Performance Estimation by Using Shards Nicola Ferro &

2018 Annual Noise & Operations Report Santa Monica Airport Commission April 22, 2019 Areas

C OMPUTATIONAL A SPECTS OF C OMPUTATIONAL D IGITAL P HOTOGRAPHY P HOTOGRAPHY Noise & Denoising

Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert - PowerPoint PPT Presentation

Generalized Cross Entropy Loss for Noisy Labels Zhilu Zhang and Mert R. Sabuncu Cornell University Generalized Cross Entropy Loss for Noisy Labels Poster # 101

Entropy, Relative Entropy, Cross Entropy Entropy Entropy, H(x) is a measure of the uncertainty of

Formal Modeling in Cognitive Science Lecture 25: Entropy, Joint Entropy, Conditional Entropy 1

Formal Modeling in Cognitive Science 1 Noisy Channel Model Channel Capacity Lecture 29: Noisy

Beyond Synthetic Noise: Deep Learning on Controlled Noisy Labels Lu Jiang, Di Huang, Mason Liu,

2016 Vegetable Pesticide Update: Weeds 1) New/Changed labels 2) Labels soon 3) Auxin Technologies

2012 GFVGA: Herbicide Update 2012 Weed Control Update 1. Recent labels 2. New labels 3. Near

Entropy Coding Definition of Entropy Three Entropy coding techniques: (taken from the

1) Entropy = measure of randomness 2) Entropy = measure of compressibility More random = Less

Chapter 2 Entropy, Relative Entropy, and Mutual Infor- mation Peng-Hua Wang Graduate Institute

02 | 27 SOUTHERN CROSS 23.04 03 | 27 SOUTHERN CROSS 23.04 04 | 27 SOUTHERN CROSS 23.04 06

Learning Sound Event Classifiers from Web Audio with Noisy Labels Eduardo Fonseca 1 , Manoj Plakal

Model-agnostic Approaches to Handling Noisy Labels When Training Sound Event Classifiers Eduardo

The Shadow of the Cross The Cross of Jesus part 1B The Shadow of the Cross Hebrews 10:1-14 The

Road detection via entropy By Anna Zaidman 1 1 What is entropy? Entropy is a mathematically

Entropy Change in Entropy Reversible Isobaric Process Ideal Gas in a Reversible Process Free

Entropy and The Second Law of Thermodynamics Entropy (S)

Example: Grid World CS 188: Artificial Intelligence Markov Decision Processes II A

Neural Networks for Machine Learning Lecture 9a Overview of ways to improve generalization

Noise-adaptive Margin- based Active Learning, and Yining Wang , Aarti Singh Carnegie Mellon

2008, nature Shlens et al 09 Pillow et al, 2008 Pillow et al, 2008 Whats role of coupling

Overview of State Space Models Standard State Space Model Standard state space model x n +1 =

Improving the Accuracy of System Performance Estimation by Using Shards Nicola Ferro &amp;

2018 Annual Noise &amp; Operations Report Santa Monica Airport Commission April 22, 2019 Areas

C OMPUTATIONAL A SPECTS OF C OMPUTATIONAL D IGITAL P HOTOGRAPHY P HOTOGRAPHY Noise &amp; Denoising

Improving the Accuracy of System Performance Estimation by Using Shards Nicola Ferro &

2018 Annual Noise & Operations Report Santa Monica Airport Commission April 22, 2019 Areas

C OMPUTATIONAL A SPECTS OF C OMPUTATIONAL D IGITAL P HOTOGRAPHY P HOTOGRAPHY Noise & Denoising