leveraging frequency analysis for deep fake image
play

Leveraging Frequency Analysis for Deep Fake Image Recognition Joel - PowerPoint PPT Presentation

Leveraging Frequency Analysis for Deep Fake Image Recognition Joel Frank , Thorsten Eisenhofer, Lea Schnherr, Asja Fischer, Dorothea Kolossa, Thorsten Holz Which Face is Real? Which Face is Real? Which Face is Real? N 1 1 N 2 1 I x ,


  1. Leveraging Frequency Analysis for Deep Fake Image Recognition Joel Frank , Thorsten Eisenhofer, Lea Schönherr, Asja Fischer, Dorothea Kolossa, Thorsten Holz

  2. Which Face is Real?

  3. Which Face is Real?

  4. Which Face is Real? N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  5. Which Face is Real? N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  6. Which Face is Real? N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  7. Specific to StyleGAN?

  8. Specific to StyleGAN? Stanford Dogs BigGAN ProGAN SN-DCGAN StyleGAN

  9. Specific to StyleGAN? Stanford Dogs BigGAN ProGAN SN-DCGAN StyleGAN LSUN Bedrooms Nearest Neighbour Bilinear Binomial

  10. Advantages of the Frequency Domain

  11. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00%

  12. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00% • Experiments on corrupted data

  13. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00% • Experiments on corrupted data • Blurring, cropping, jpeg compression, noise, combination

  14. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00% • Experiments on corrupted data • Blurring, cropping, jpeg compression, noise, combination • Frequency representation performs better (bar one exception)

  15. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00% • Experiments on corrupted data • Blurring, cropping, jpeg compression, noise, combination • Frequency representation performs better (bar one exception) • When trained on corrupted data, frequency representation recovers higher accuracy

  16. Frequency Domain N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  17. Frequency Domain Discrete Cosine Transform

  18. Frequency Domain

  19. Frequency Domain N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  20. Frequency Domain N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  21. Frequency Domain N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  22. Frequency Domain Frequencies in x direction Frequencies in y direction N 1 − 1 N 2 − 1 I x , y cos [ N 1 ( x + 1 2 ) k x ] cos [ N 2 ( y + 1 2 ) k y ] . π π ∑ ∑ D k x , k y = x =0 y =0

  23. Specific to StyleGAN? Stanford dogs BigGAN ProGAN SN-DCGAN StyleGAN

  24. Specific to GANs? Cascaded Refinement Networks Implicit Maximum Likelihood Estimation Wang, et al., "CNN-generated images are surprisingly easy to spot... for now", CVPR 2020

  25. Upsampling? Latent 4x4 8x8 16x16 32x32 64x64 ... 1024x1024

  26. Upsampling? Odena, et al., "Deconvolution and Checkerboard Artifacts", Distill 2016 Latent 4x4 8x8 16x16 32x32 64x64 ... 1024x1024

  27. Upsampling? Odena, et al., "Deconvolution and Checkerboard Artifacts", Distill 2016 Latent 4x4 8x8 16x16 32x32 64x64 ... Strided Transposed Convolution → Upsampling + Convolution 1024x1024

  28. Upsampling? Odena, et al., "Deconvolution and Checkerboard Artifacts", Distill 2016 Latent 4x4 8x8 16x16 32x32 64x64 ... Strided Transposed Convolution → Upsampling + Convolution 1024x1024 Durall, et al., "Watch your Up-Convolution: CNN Based Generative Deep Neural Networks are Failing to Reproduce Spectral Distributions", CVPR 2020

  29. Advantages of the Frequency Domain Domain Accuracy Image 75.78% Frequency 100.00%

  30. Advantages of the Frequency Domain • Frequency domain enables linear separability

  31. Advantages of the Frequency Domain Nearest Neighbour Bilinear Binomial

  32. Advantages of the Frequency Domain • Frequency domain enables linear separability • Still artifacts for more elaborate upsampling techniques

  33. Advantages of the Frequency Domain • Frequency domain enables linear separability • Still artifacts for more elaborate upsampling techniques • For existing source attribution tasks, we can reduce the error rate by up to 75%

  34. Advantages of the Frequency Domain • Frequency domain enables linear separability • Still artifacts for more elaborate upsampling techniques • For existing source attribution tasks, we can reduce the error rate by up to 75% • Neural network training is easier and needs less training data

  35. Advantages of the Frequency Domain • Frequency domain enables linear separability • Still artifacts for more elaborate upsampling techniques • For existing source attribution tasks, we can reduce the error rate by up to 75% • Neural network training is easier and needs less training data • Experiments on corrupted data

Recommend


More recommend