Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling - - PowerPoint PPT Presentation

does data augmentation lead to positive margin
SMART_READER_LITE
LIVE PREVIEW

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling - - PowerPoint PPT Presentation

Does Data Augmentation Lead to Positive Margin? Dimitris Po-Ling Loh Shashank Rajput* Zhili Feng* Zachary Charles Papailiopoulos * Equal Contribution Data Augmentation (DA) DA means increasing the training set artificially. Used to


slide-1
SLIDE 1

Does Data Augmentation Lead to Positive Margin?

Zhili Feng* Zachary Charles Po-Ling Loh Shashank Rajput* Dimitris Papailiopoulos * Equal Contribution

slide-2
SLIDE 2

Data Augmentation (DA)

  • DA means increasing the training set artificially.
  • Used to train state of the art deep models.

Rotations, crops Noise

slide-3
SLIDE 3

Why use Data Augmentation (DA)?

Aim:

Build a model that is robust to slight perturbations of input

Idea:

Train on perturbed versions of the inputs! Works in practice! But can we prove it?

slide-4
SLIDE 4

Setup

S' S

DA Training Set

w'

Model Augmented Dataset Learning

  • What margin does w’ achieve with

respect to S ?

slide-5
SLIDE 5

Setup

S' S

DA Training Set

w'

Model Augmented Dataset Learning

  • What margin does w’ achieve?

No DA

  • Enforces no margin è Not robust

Blackbox learner – Outputs ANY classifier that fits the training set

slide-6
SLIDE 6

Setup

S' S

DA Training Set

w'

Model Augmented Dataset Learning

  • What margin does w’ achieve?

No DA

  • Enforces no margin è Not robust

With DA

  • Enforces some margin è Robust

Blackbox learner – Outputs ANY classifier that fits the training set

slide-7
SLIDE 7

Can we use DA to enforce margin?

slide-8
SLIDE 8

Can we use DA to enforce margin?

Idea: Create an ε-net of DA points. Problem: ε-net requires exponentially many points

slide-9
SLIDE 9

Class 1 Class 2

What is the minimum number of points we need?

Theorem: d+1 points necessary and sufficient to get max-margin.

slide-10
SLIDE 10

Class 1 Class 2

Caveat: You need to know the max margin classifier – Beats the purpose!

Theorem: d+1 points necessary and sufficient to get max-margin.

What is the minimum number of points we need?

slide-11
SLIDE 11

Random DA: Points on the sphere

  • What should the radius δ be?
  • How many DA points?

δ δ

slide-12
SLIDE 12

Random DA: Points on the sphere

Max margin = !* #DA Points Margin Achieved

δ = "(!*)

"( 2%)

slide-13
SLIDE 13

Max margin = !* #DA Points Margin Achieved

δ = "(!*) "(poly(#)) δ = " $(!*√#)

"( 2()

Random DA: Points on the sphere

slide-14
SLIDE 14

Beyond Linear Classifiers

  • Similar results for classifiers which “respect” local convex hulls of training points.
  • Example: Nearest neighbor classifier.

Future Work: More structured augmentation

  • How much robustness do cropping, rotation etc. add?

Adaptive augmentation

  • What margin does Adaptive Data Augmentation (Adversarial Training) achieve?
slide-15
SLIDE 15

Thank you

  • Poster #155
  • 6:30 – 9:00 PM, Today
  • Pacific Ballroom
  • Emails: rajput3@wisc.edu, zfeng49@cs.wisc.edu