Samples for Robust Deep Learning Hwanjun Song , Minseok Kim , - PowerPoint PPT Presentation

Apr 09, 2023 •403 likes •498 views

SELFIE: Refurbishing Unclean Samples for Robust Deep Learning Hwanjun Song , Minseok Kim , Jae-Gil Lee * Graduate School of Knowledge Service Engineering, KAIST * Corresponding Author Standard Supervised Learning Setting ,

SELFIE: Refurbishing Unclean Samples for Robust Deep Learning Hwanjun Song † , Minseok Kim † , Jae-Gil Lee †* † Graduate School of Knowledge Service Engineering, KAIST * Corresponding Author
• Standard Supervised Learning Setting 𝑂 , 𝒛 𝒋 : True label – Assume: training data {(𝑦 𝑗 , 𝑧 𝑗 )} 𝑗=1 – In practical setting, 𝑧 𝑗 → ෥ 𝑧 𝑗 , 𝒛 𝒋 : Noisy label ෥  High cost and time consuming  Expert knowledge Difficulties of label annotation  Unattainable at scale • Learning with Noisy Label – Suffer from poor generalization on test data (VGG-19 on CIFAR-10) 100.0% 100.0% Label Noise Train Error 75.0% 75.0% Test Error 0% 50.0% 50.0% 20% 25.0% 25.0% 40% 0.0% 0.0% 0 25 50 75 100 0 25 50 75 100 Epochs Epochs 2
• Loss Correction – Modify the loss ℒ of all samples before backward step – Suffer from accumulated noise by the false correction → Fail to handle heavily noisy data • Sample Selection (Recent direction) – Select low-loss (easy) samples as clean samples 𝓓 for SGD – Use only partial exploration of the entire training data → Ignore useful hard samples classified as unclean All corrected Selected samples samples (a) Loss correction (b) Sample selection 3
• SELFIE ( SEL ectively re F urb I sh uncl E an samples) – Hybrid of loss correction and sample selection – Introduce refurbishable samples 𝓢 • The samples can be “ corrected with high precision ” – Modified update equation on mini-batch {(𝑦 𝑗 , ෥ 𝑐 𝑧 𝑗 )} 𝑗=1 • Correct the losses of samples in 𝓢 • Combine them with the losses of samples in 𝓓 • Exclude the samples not in 𝓢 ∪ 𝓓 1 𝓜 𝒚, 𝒛 𝒔𝒇𝒈𝒗𝒔𝒄 + 𝜄 𝑢+1 = 𝜄 𝑢 − 𝛽𝛼 ෍ ෍ 𝓜 𝒚, ෥ 𝒛 𝓢 𝓢 ∪ 𝓓 𝒚∈𝓓∩𝓢 −𝟐 𝒚∈𝓢 𝓓 Corrected losses Selected clean losses 4
• Clean Samples 𝓓 from 𝓝 (mini-batch) – Adopt loss-based separation (Han et al., 2018) – 𝓓 ← 100 − 𝑜𝑝𝑗𝑡𝑓 𝑠𝑏𝑢𝑓 % of low-loss samples in 𝓝 • Refurbishable Samples 𝓢 from 𝓝 – 𝓢 ← the samples with consistent label predictions – Replace its label into the most frequently predicted label 𝒔𝒇𝒈𝒗𝒔𝒄 𝒛 𝒋 → 𝒛 𝒋 ෥ Consistent label predictions 𝑦 … dog cat dog dog dog dog dog cat 𝑧 ෤ dog (refurbished label) 5
• Synthetic Noise: pair and symmetric – Injected two widely used noises • Realistic Noise – Built ANIMAL-10N dataset with real-world noise  Crawled 5 pairs of confusing animals E.g., {(cat, lynx), (jaguar, cheetah),…}  Educated 15 participants for one hour  Asked the participants to annotate the label – Summary # Training 50,000 Resolution 64x64 (RGB) # Test 5,000 Noise Rate 8% (estimated) # Classes 10 Data Created April 2019 6
• Results with two synthetic noises (CIFAR-10, CIFAR-100) CIFAR-10 CIFAR-100 CIFAR-10 CIFAR-100 (a) Varying pair noises (b) Varying symmetric noises • Results with realistic noise (ANIMAL-10N) (a) DenseNet (L=25, k=12) (b) VGG-19 7

Recommend

Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification

GESG seminar, 16 October 2015, UFM Outlier Outlier Outlier- Outlier - -robust - robust robust robust identification identification identification of identification of of of switching regimes: switching regimes: switching regimes:

582 views • 27 slides

Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised

Robust Optimization & Machine Learning 6. Robust Optimization Short Course in Supervised Learning Robust Optimization and Machine Learning Robust Supervised Learning Motivations Examples Thresholding and robustness Boolean data

724 views • 48 slides

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University

Robust Deep Learning Based on Meta-learning Deyu Meng Xian Jiaotong University dymeng@mail.xjtu.edu.cn http://gr.xjtu.edu.cn/web/dymeng Deep Learning Robust Meta-learning The Success of Deep Learning Relies on

1k views • 46 slides

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

Deep 3D Representation Learning for Visual Computing Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms Conclusion 2 Outline Overview of 3D deep learning Background 3D deep learning tasks 3D deep

1.66k views • 122 slides

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep Learning Image from http://imgur.com/ Deep

1.15k views • 79 slides

Samples Advertising of samples and handing out samples Advertising Education and Assurance

Samples Advertising of samples and handing out samples Advertising Education and Assurance Section Regulatory Compliance Branch Regulatory Practice and Support Division CHP Australia -Therapeutic Goods Advertising Code Seminar March/April 2020

250 views • 8 slides

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp:

-Samples [AB98] Hyp: domain S is a smooth curve or surface. S 1 -Samples [AB98] Hyp: domain S is a smooth curve or surface. S E 1 -Samples [AB98] Hyp: domain S is a smooth curve or surface. S E 1 -Samples [AB98] Hyp: domain S

617 views • 38 slides

Business Statistics CONTENTS Comparing two samples Comparing two unrelated samples Comparing

TWO S OR MEDIANS: COMPARISONS Business Statistics CONTENTS Comparing two samples Comparing two unrelated samples Comparing the means of two unrelated samples Comparing the medians of two unrelated samples Old exam question Further study

397 views • 38 slides

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Deep Neural Networks and Deep Reinforcement Learning Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and Courville [chapt. 6,7,8]; AIMA [sect. 21.1-21.3]; Sutton and Barto, Reinforcement Learning: an

528 views • 35 slides

STAT 113 Independent vs. Paired Samples Colin Reimer Dawson Oberlin College November 16, 2017

Paired Samples Design Analyzing Paired Data STAT 113 Independent vs. Paired Samples Colin Reimer Dawson Oberlin College November 16, 2017 1 / 7 Paired Samples Design Analyzing Paired Data Paired Samples Design Analyzing Paired Data 2 / 7

278 views • 7 slides

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 14, 2019 1 / 125

Definitions and Some Examples of Biased Samples Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 14, 2019 1 / 125 Definitions and Some Examples of Biased Samples Definitions and Some Examples of Biased Samples All

1.42k views • 127 slides

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 13, 2019 1 / 125

Definitions and Some Examples of Biased Samples Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 13, 2019 1 / 125 Definitions and Some Examples of Biased Samples Definitions and Some Examples of Biased Samples All

1.3k views • 127 slides

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys: surveys: the case of the Chandra Deep Field South the case of the Chandra Deep Field South the case of the Chandra Deep Field South Fabrizio Fiore

423 views • 21 slides

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations deeplearning.cce2020@gmail.com Deep Networks Intuition Neural networks with multiple hidden layers - Deep networks [Hinton, 2006] Deep Networks Intuition

1.33k views • 28 slides

Learning Nearest Neighbor Graphs from Noisy Distance Samples Noisy Distance Samples Blake Mason,

Learning Nearest Neighbor Graphs from Learning Nearest Neighbor Graphs from Noisy Distance Samples Noisy Distance Samples Blake Mason, Ardhendu Tripathy, & Robert Nowak Blake Mason, Ardhendu Tripathy, & Robert Nowak Motivation Wish to

488 views • 24 slides

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning. 2.Brief introduction of Backpropagation. 3.Brief introduction of Convolutional Neural Networks. Deep learning I . Introduction to Deep Learning

609 views • 22 slides

Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication Konstantina

Lynx: Using OS and Hardware Support for Fast Fine-Grained Inter-Core Communication Konstantina Mitropoulou, Vasileios Porpodas, Xiaochun Zhang and Timothy M. Jones Computer Laboratory UKMAC 2016, Edinburgh slide 1 of 30

1.31k views • 77 slides

An Ethnographic Study of Copy and Paste Programming Practices in OOPL Miryung Kim 1 , Lawrence

An Ethnographic Study of Copy and Paste Programming Practices in OOPL Miryung Kim 1 , Lawrence Bergman 2 , Tessa Lau 2 , and David Notkin 1 Department of Computer Science and Engineering University of Washington 1 , IBM T.J. Watson Research

579 views • 32 slides

Partial Kernelization for Rank Aggregation: Theory and Experiments Nadja Betzler, Robert

Kemeny Ranking Parameterized Algorithms Results Conclusion + References Partial Kernelization for Rank Aggregation: Theory and Experiments Nadja Betzler, Robert Bredereck, Rolf Niedermeier Friedrich-Schiller-Universit at Jena, Germany

550 views • 23 slides

Principles of Database Systems V. Megalooikonomou Fractals and Databases (based on notes by C.

Principles of Database Systems V. Megalooikonomou Fractals and Databases (based on notes by C. Faloutsos at CMU) Indexing - Detailed outline fractals intro applications 2 Intro to fractals - outline Motivation 3 problems /

1.43k views • 99 slides

Emma Enix, Jaleel Rogers and Caden Walker Improving Transportation Through Technology Team

Project Team: Tricycle Emma Enix, Jaleel Rogers and Caden Walker Improving Transportation Through Technology Team Tricycle: Emma Enix, Jaleel Rogers, Caden Walker Our Team of Researchers Believes That... Technology X Transportation

259 views • 11 slides

Workshop Facilitators Larry Shuman, Eric Hamilton, University of Pittsburgh Pepperdine

7/19/2012 Using Model Eliciting Activities (MEAs) in the Engineering Classrooms July 19-20, 2012 U NIVERSITY O F P ITTSBURGH U NIVERSITY O F M INNESOTA P URDUE U NIVERSITY U NITED S TATES A IR F ORCE A CADEMY C OLORADO S CHOOL

898 views • 70 slides

Data Backup for Mobile Nodes : a Cooperative Middleware and an Experimentation Platform

Data Backup for Mobile Nodes : a Cooperative Middleware and an Experimentation Platform Marc-Olivier Killijian Matthieu Roy Gatan Sverac Christophe Zanon roy@laas.fr http://theresumeexperience.blogspot.com/ LAAS-CNRS, Toulouse DSN WADS,

525 views • 27 slides

Power Tuning Linux: A Case Study Alexandra Yates alexandra.yates@intel.com

Power Tuning Linux: A Case Study Alexandra Yates alexandra.yates@intel.com http://01.org/powertop 1 About Experiment 2 About Experiment Software Ubuntu 13.10: Includes Ubuntu Linux Kernel 3.11.0-12.19. Based on upstream Linux Kernel

464 views • 20 slides