SLIDE 19 19
Modify victim model prediction
Reduce prediction granularity[1]
- Full vector → top-1 / top-N
- Model extraction still effective with top-1[3]
Alter predictions
- Modify probabilities in prediction vector[6]
- Model extraction still effective with top-1[3]
- Large rate of wrong predictions is effective[5]
Degrade model utility for legitimate clients : car (car,0.85) : (boat,0.1) (cat,0.05) (car,0.75) (boat,0.1) (cat,0.05) : (plane,0.04) (dog,0.01) (fish,0.01) … Top-1 Top-n + probability Full prediction vector
[1] Tramèr et al. - Stealing Machine Learning Models via Prediction APIs. USENIX’16 (https://arxiv.org/abs/1609.02943) [3] Juuti et al. - PRADA: Protecting against DNN Model Stealing Attacks. EuroS&P’19 (https://arxiv.org/abs/1805.02628) [5] Atli et al. - Extraction of Complex DNN Models: Real Threat or Boogeyman? AAAI-EDSMLS’20 (https://arxiv.org/pdf/1910.05429.pdf) [6] Lee et al. - Defending Against NN Model Stealing Attacks Using Deceptive Perturbations. S&PW’19 (https://arxiv.org/abs/1806.00054)