Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling - PowerPoint PPT Presentation

Sep 04, 2023 •210 likes •379 views

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling Loh Shashank Rajput* Zhili Feng* Zachary Charles Papailiopoulos * Equal Contribution Data Augmentation (DA) DA means increasing the training set artificially. Used to

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling Loh Shashank Rajput* Zhili Feng* Zachary Charles Papailiopoulos * Equal Contribution
Data Augmentation (DA) • DA means increasing the training set artificially. • Used to train state of the art deep models. Rotations, crops Noise
Why use Data Augmentation (DA)? Aim: Build a model that is robust to slight perturbations of input Idea: Train on perturbed versions of the inputs! Works in practice! But can we prove it?
Setup Learning DA S' w' S • What margin does w’ achieve with Augmented Model Training respect to S ? Dataset Set
Setup Learning DA S' w' S • What margin does w’ achieve? Augmented Model Training Dataset Set Blackbox learner – Outputs ANY classifier that fits the training set No DA • Enforces no margin è Not robust
Setup Learning DA S' w' S • What margin does w’ achieve? Augmented Model Training Dataset Set Blackbox learner – Outputs ANY classifier that fits the training set No DA With DA • Enforces no margin è Not robust • Enforces some margin è Robust
Can we use DA to enforce margin?
Can we use DA to enforce margin? Idea: Create an ε-net of DA points. Problem: ε-net requires exponentially many points
What is the minimum number of points we need? Class 1 Class 2 Theorem : d+1 points necessary and sufficient to get max - margin .
What is the minimum number of points we need? Class 1 Class 2 Theorem : d+1 points necessary and sufficient to get max - margin . Caveat: You need to know the max margin classifier – Beats the purpose!
Random DA: Points on the sphere δ δ • What should the radius δ be? • How many DA points?
Random DA: Points on the sphere Max margin = ! * Margin Achieved δ = " ( ! *) " ( 2 % ) #DA Points
Random DA: Points on the sphere Max margin = ! * $ ( ! * √# ) Margin Achieved δ = " ( ! *) δ = " " ( 2 ( ) " ( poly ( # )) #DA Points
Beyond Linear Classifiers • Similar results for classifiers which “respect” local convex hulls of training points. • Example: Nearest neighbor classifier. Future Work: More structured augmentation • How much robustness do cropping, rotation etc. add? Adaptive augmentation • What margin does Adaptive Data Augmentation (Adversarial Training) achieve?
Thank you • Poster #155 • 6:30 – 9:00 PM, Today • Pacific Ballroom • Emails: rajput3@wisc.edu, zfeng49@cs.wisc.edu

Recommend

Data Augmentation in NLP 2020-03-21 Xiachong Feng Outline Why we need Data Augmentation?

Data Augmentation in NLP 2020-03-21 Xiachong Feng Outline Why we need Data Augmentation? Data Augmentation in CV Widely Used Methods EDA Back-Translation Contextual Augmentation Methods based on Pre-trained Language

483 views • 46 slides

Population Based Augmentation Efficient Learning of Augmentation Policy Schedules Daniel Ho , Eric

Population Based Augmentation Efficient Learning of Augmentation Policy Schedules Daniel Ho , Eric Liang, Ion Stoica, Pieter Abbeel, Xi Chen Efficiently learn data augmentation policies to improve neural network performance. Data Augmentation

335 views • 8 slides

Max Margin-Classifier Oliver Schulte - CMPT 726 Bishop PRML Ch. 7 Maximum Margin Criterion Math

Maximum Margin Criterion Math Maximizing the Margin Non-Separable Data Max Margin-Classifier Oliver Schulte - CMPT 726 Bishop PRML Ch. 7 Maximum Margin Criterion Math Maximizing the Margin Non-Separable Data Outline Maximum Margin

582 views • 55 slides

Support Vector Machines Greg Mori - CMPT 419/726 Bishop PRML Ch. 7 Maximum Margin Criterion

Maximum Margin Criterion Math Maximizing the Margin Non-Separable Data Support Vector Machines Greg Mori - CMPT 419/726 Bishop PRML Ch. 7 Maximum Margin Criterion Math Maximizing the Margin Non-Separable Data Outline Maximum Margin

1.13k views • 37 slides

image-augmentation April 9, 2019 1 Image Augmentation In [1]: % matplotlib inline import d2l

image-augmentation April 9, 2019 1 Image Augmentation In [1]: % matplotlib inline import d2l import mxnet as mx from mxnet import autograd, gluon, image, init, nd from mxnet.gluon import data as gdata, loss as gloss, utils as gutils import sys

109 views • 8 slides

Becky Coffin Kingfisher plc Net Positive 2 Net Positive 3 Net Positive 4 Creating the

Achieving Net Positive through responsible sourcing Becky Coffin Kingfisher plc Net Positive 2 Net Positive 3 Net Positive 4 Creating the leader Net Positive 5 Net Positive 6 WHAT HAVE WE LEARNED ABOUT RESPONSIBLE SOURCING?

450 views • 19 slides

Galileo Local Element Augmentation System Galileo Local Element Augmentation System (GALILEA)

Galileo Local Element Augmentation System Galileo Local Element Augmentation System (GALILEA) (GALILEA) Galileo Workshop for SMEs SMEs Galileo Workshop for organised by

354 views • 12 slides

About this class Maximizing the Margin Maximum margin classifiers Picture of large and small

About this class Maximizing the Margin Maximum margin classifiers Picture of large and small margin hyperplanes SVMs: geometric derivation of the primal prob- lem Intuition: large margin condition acts as a reg- ularizer and should generalize

219 views • 9 slides

ECE 417 Fall 2018 Lecture 19: Mini-Batch Training and Data Augmentation Mark Hasegawa-Johnson

Annealing Mini-Batch Training Data Augmentation Conclusions ECE 417 Fall 2018 Lecture 19: Mini-Batch Training and Data Augmentation Mark Hasegawa-Johnson University of Illinois October 25, 2018 Annealing Mini-Batch Training Data

501 views • 28 slides

SwitchOut: An Efficient Data Augmentation for Neural Machine Translation Xinyi Wang , Hieu

SwitchOut: An Efficient Data Augmentation for Neural Machine Translation Xinyi Wang , Hieu Pham , Zihang Dai, Graham Neubig November 2, 2018 :equal contribution 1 / 41 Data Augmentation Neural models are data hungry, while

782 views • 41 slides

Convolutional Neural Networks with Data Augmentation against Jitter-Based Countermeasures Eleonora

Convolutional Neural Networks with Data Augmentation against Jitter-Based Countermeasures Convolutional Neural Networks with Data Augmentation against Jitter-Based Countermeasures Eleonora Cagli 1 , 3 ecile Dumas 1 C Emmanuel Prouff 2 , 3 1

1.28k views • 100 slides

Topic #28 Nyquist plots: Gain and phase margin Reference textbook : Control Systems, Dhanesh N.

ME 779 Control Systems Topic #28 Nyquist plots: Gain and phase margin Reference textbook : Control Systems, Dhanesh N. Manik, Cengage Publishing, 2012 1 Nyquist plots: Gain and Phase margin Gain Margin and Phase Margin phase crossover

580 views • 21 slides

Keep Lead from Keep Lead from Lurking Lurking Lead Testing and Lead Testing and Healthy

Keep Lead from Keep Lead from Lurking Lurking Lead Testing and Lead Testing and Healthy Healthy Homes to Help Prevent Childhood Lead Poisoning Homes to Help Prevent Childhood Lead Poisoning Oct 30, 2019 Recharge for Resilience Conference

397 views • 20 slides

Improving Molecular Design by Stochastic Iterative Target Augmentation Kevin Yang, Wengong Jin,

Improving Molecular Design by Stochastic Iterative Target Augmentation Kevin Yang, Wengong Jin, Kyle Swanson, Regina Barzilay, Tommi Jaakkola 15-Second Overview Data augmentation approach: improve molecular optimization SOTA by > 10%

621 views • 50 slides

IPR/Reservoir Augmentation Reservoir Storage Permitting Issues Michael R. Welch, Ph.D., P.E.

IPR/Reservoir Augmentation Reservoir Storage Permitting Issues Michael R. Welch, Ph.D., P.E. Focus of todays discussion: Present overview of reservoir-related regulations for indirect potable reuse/reservoir augmentation (IPR/RA)

758 views • 26 slides

Federal Aviation Administration Overview Wide Area Augmentation System (WAAS) Status

Federal Aviation Administration Overview Wide Area Augmentation System (WAAS) Status Local Area Augmentation System (LAAS) Status Automated Dependent Surveillance Broadcast (ADS-B) Status eLoran Royal Institute of Navigation

460 views • 30 slides

Bidens new consolidated lead in the ba battlegr ttleground ound Phone Poll August 24, 2020

Bidens new consolidated lead in the ba battlegr ttleground ound Phone Poll August 24, 2020 Biden moves up to 10-point lead in battleground, with Democratic sweep possible PRESIDENTIAL, HOUSE & SENATE BALLOTS BATTLEGROUND Democratic

480 views • 13 slides

Inquiry methods that lead students through the process of authentic scientific discovery Sarah

Inquiry methods that lead students through the process of authentic scientific discovery Sarah Richardson DePaul University Biological Sciences Understanding Learning a body how new of knowledge knowledge is created in VS. discipline

592 views • 12 slides

Lead Us Not Into Temptation Lead Us Not Into Temptation The Sixth Petition Why would God ever

Lead Us Not Into Temptation Lead Us Not Into Temptation The Sixth Petition Why would God ever lead us into temptation? Why would God ever lead us into temptation? God is not the one who tempts people. God is not the one who tempts people.

256 views • 9 slides

1 QRS: Wider is Better Variability in Electrical Activation Sequence Across all patients

Overview CRT in the non-LBBB patient When to Consider LV lead Placement in the Non-LBBB IVCD Patient ? What is the real issue here? Jag Singh MD DPhil FHRS Is the concern secondary to Associate Chief, Cardiology Division patient

437 views • 5 slides

How to Push Extreme Limits of Performance and Scale with Vector Packet Processing Technology

A feedback loop where all outputs of a process are available as causal inputs to that process How to Push Extreme Limits of Performance and Scale with Vector Packet Processing Technology Maciek Konstantynowicz FD.io CSIT Tech Project

644 views • 16 slides

April 2017 Webinar Welcome The LEAD Growth & Performance System is designed to recognize,

LEAD GROWTH & PERFORMANCE SYSTEM April 2017 Webinar Welcome The LEAD Growth & Performance System is designed to recognize, support, and empower leaders in ways that result in every student achieving success (Denver Plan 2020). The

668 views • 13 slides

TypeWell Conference 2013: Listen, Learn & Lead Find ing, Fund ing a nd Dev elop ing Op p

TypeWell Conference 2013: Listen, Learn & Lead Find ing, Fund ing a nd Dev elop ing Op p ortunities for Continuing Ed uca tion, Mentoring a nd Professiona l Dev elop m ent Round Table Discussion: 1. Has your site continued to provide

292 views • 18 slides

CSBG O RG . S TANDARDS / H EAD S TART P ERFORMANCE S TANDARDS I MPLEMENTATION T OOLS 2 1

9/26/2019 Empowering Your Board to Lead: Resources for Board Training & Orientation Thursday, September 26, 2019 PRESENTED BY: Community Action Program Legal Services www.caplaw.org National Community Action Partnership

594 views • 24 slides