metapoison learning to craft poison
play

METAPOISON: LEARNING TO CRAFT POISON W. Ronny Huang,* Jonas - PowerPoint PPT Presentation

METAPOISON: LEARNING TO CRAFT POISON W. Ronny Huang,* Jonas Geiping,* Liam Fowl,^ Tom Goldstein *Equal Contribution ^Speaker University of Maryland NeurIPS MetaLearn 2019 DATA POISONING Training data Testing example Plane Frog Base


  1. METAPOISON: LEARNING TO CRAFT POISON W. Ronny Huang,* Jonas Geiping,* Liam Fowl,^ Tom Goldstein *Equal Contribution ^Speaker University of Maryland NeurIPS MetaLearn 2019

  2. DATA POISONING Training data Testing example Plane Frog Base

  3. DATA POISONING Training data Testing example Plane Frog Base Poison! + =

  4. DATA POISONING Training data Testing example Plane Frog Base Poison! + =

  5. LEARNING TO CRAFT Initial weights Training phase Poison Updated Forward + Backward weights Testing phase Target Forward Adversarial loss

  6. LEARNING TO CRAFT Initial weights Training phase Poison Updated Forward + Backward weights Testing phase Target Forward Adversarial loss Backprop to the poison!

  7. POISONED TRAINING DYNAMICS Weight space θ i θ i +1 θ i − 1 ⋯ without poison data Low ⋯ θ N θ 0 training loss NeurIPS Metalearn 19 (spotlight) Huang* , Geiping*, Fowl, Taylor, Goldstein, “MetaPoison: Learning to...”

  8. POISONED TRAINING DYNAMICS Weight space θ i θ i +1 θ i − 1 ⋯ without poison data Low ⋯ θ N θ 0 training ⋯ with poison data loss ⋯ θ N NeurIPS Metalearn 19 (spotlight) Huang* , Geiping*, Fowl, Taylor, Goldstein, “MetaPoison: Learning to...”

  9. POISONED TRAINING DYNAMICS Weight space θ i θ i +1 θ i − 1 ⋯ without poison data Low ⋯ θ N θ 0 training ⋯ with poison data loss ⋯ θ N Low adversarial loss NeurIPS Metalearn 19 (spotlight) Huang* , Geiping*, Fowl, Taylor, Goldstein, “MetaPoison: Learning to...”

  10. adversarial class (81%) Victim model ResNet18 Validation accuracy 85% (no drop) true class (1%) Target 5000 poisons (10%) cause classified as

  11. adversarial class 92% Victim model ResNet18 Validation accuracy true class 3% 85% (no drop) Target 5000 poisons (10%) cause classified as

Recommend


More recommend