Projected Spatial Rich Features on a GPU Andrew Ker adk @ - PowerPoint PPT Presentation

Implementing the Projected Spatial Rich Features on a GPU Andrew Ker adk @ cs.ox.ac.uk Department of Computer Science University of Oxford SPIE/IS&T Electronic Imaging, San Francisco, 4 February 2014

Background Features for binary classification steganalysis in raw images. extraction time dimension for 1Mpix image  WAM [2006] 27 negligible moments of noise residuals  SPAM [2009] 686 0.25 s co-occurrences of noise residuals  SRM [2012] 12753+ 12 s co-occurrences of diverse noise residuals  PSRM [2013] 12870 25 m histograms of randomly projected, diverse, noise residuals

Background Features for binary classification steganalysis in raw images. extraction time dimension for 1Mpix image  WAM [2006] 27 negligible moments of noise residuals  SPAM [2009] 686 0.25 s co-occurrences of noise residuals  SRM [2012] 12753+ 12 s co-occurrences of diverse noise residuals  PSRM [2013] 12870 25 m An experiment with histograms of randomly projected, diverse, noise residuals 1 million images takes 50 years 

Projected residuals quantize ¤ random kernel count central noise residuals 6 histogram bins  Width, height uniform on {1,…,8}  Entries Gaussian, scaled to unit norm

Projected residuals quantize ¤ random kernel count central noise residuals 6 histogram bins + quantize ¤ flipped kernel count central 6 histogram bins

PSRM features ¤ quantize ¤ quantize ¤ quantize … … min/max operations Sum and ¤ quantize concatenate to 12870 features … raw image ¤ quantize … … … 168·55·8 convolutions & histograms, 168 residuals 30 filters average kernel size 20 pixels

PSRM features ¤ quantize ¤ quantize ¤ quantize … … min/max operations Sum and ¤ quantize concatenate to 12870 features … raw image ¤ ~1.2 TFLOPs quantize … per 1Mpix … image  … 168·55·8 convolutions & histograms, 168 residuals 30 filters average kernel size 20 pixels

GPU architecture We target the NVIDIA Tesla K20 card (GK110 GPU):  Costs $2800.  CUDA programming language.  Execution in warps , 32 simultaneous identical instructions per multiprocessor (MP).  Communicating warps grouped in blocks .  Blocks interleaved concurrently on 78 MPs. 2496 FP processors: ~3.52TFLOP/s. … but memory bandwidth & latency is limiting.

GPU architecture latency size  Registers zero 64K words per MP  Shared memory ~ 10 cycles ~ 48KB for all concurrent blocks  Global memory ~ 200 cycles ~ 5GB Global access latency hidden by concurrently-running blocks (with immediate context switching). … parallelism vs register exhaustion.

GPU-PSRM features ¤ quantize ¤ quantize ¤ quantize … 4  4 kernels … min/max operations Sum and ¤ quantize concatenate to 12870 features … raw image ¤ quantize … … … same 55 kernels for all residuals … also consider fewer projections per residual

Tiles pixels used by thread 1 0 2 31 1 … 1 warp 32 64 (32 threads) … … … padding 1 block (32 Θ threads)

One thread pixels used by thread 1 convolution kernel A B C D ¤ E F G H I J K L M N O P  Quantize  Truncate  Increment histogram bin

One thread pixels used by thread 1 convolution kernel M N O P ¤ I J K L E F G H A B C D  Quantize  Truncate  Increment histogram bin

One thread pixels used by thread 1 convolution kernel M N O P I J K L E F G H A B C D ¤  Quantize  Truncate  Increment histogram bin

One thread pixels used by thread 1 convolution kernel A B C D ¤ E F G H I J K L M N O P  Quantize  Truncate  Increment histogram bin

One thread pixels used by thread 1 convolution kernel A B C D ¤ E F G H I J K L M N O P bin=(int)floor(x); x histogram[bin]++;   Quantize  Truncate  Increment histogram bin

One thread pixels used by thread 1 convolution kernel A B C D ¤ E F G H I J K L M N O P bin=(int)floor(x); x if(bin==0) histogram[0]++; if(bin==1) histogram[1]++; ...   Quantize  Truncate  Increment histogram bin

Benchmarks Machine: 16-core 2.0GHz SandyBridge Xeon wallclock extraction time Implementation for 1Mpix image  Reference C ++ 29588 s  Reference MATLAB 1554 s single-thread  Reference MATLAB 1100 s (2186 s CPU) multi-thread  Optimized CUDA 2.6 s potentially <1 s using 1  TESLA K20

Accuracy Steganalysis experiment:  10000 BOSSBase v1.01 cover images (256Kpix).  HUGO embedding, 0.4bpp.  Measure Ensemble FLD error on disjoint testing sets. # projections testing Extraction of dimension per residual error rate 256Kpix image 55 12870 12.98% 491 s Reference PSRM 55 12870 14.34% 0.59 s GPU-PSRM 40 9360 14.75% 0.45 s 30 7020 14.78% 0.36 s 20 4680 14.88% 0.27 s 10 2340 15.71% 0.20 s

Accuracy Steganalysis experiment: This single experiment:  10000 BOSSBase v1.01 cover images (256Kpix).  HUGO embedding, 0.4bpp.  2732 core hours.  Measure Ensemble FLD error on disjoint testing sets.  Costs £136 ($223) on Oxford University cluster (internal prices). # projections testing Extraction of  Would cost twice as much on EC2.  dimension per residual error rate 256Kpix image 55 12870 12.98% 491 s Reference PSRM 55 12870 14.34% 0.59 s GPU-PSRM 40 9360 14.75% 0.45 s 30 7020 14.78% 0.36 s 20 4680 14.88% 0.27 s 10 2340 15.71% 0.20 s

Conclusions  PSRM features require massive amounts of computation. GPU implementation the only possibility for a quick result.  GPU-PSRM features are slightly modified, optimization-friendly. Lose a little in variety, but only 1% additional error. 400-1000 times faster than current CPU implementations.  Should consider cost/benefit analysis of new features. A practitioner might prefer speed to accuracy.  Optimize implementation of previous-gen. features? (SRM/JRM) Need not necessarily involve a GPU.

Conclusions  PSRM features require massive amounts of computation. GPU implementation the only possibility for a quick result.  GPU-PSRM features are slightly modified, optimization-friendly. Lose a little in variety, but only 1% additional error. 400-1000 times faster than current CPU implementations.  Should consider cost/benefit analysis of new features. Source will be available from A practitioner might prefer speed to accuracy. http://www.cs.ox.ac.uk/andrew.ker/gpu-psrm/   Optimize implementation of previous-gen. features? (SRM/JRM) Need not necessarily involve a GPU.

Projected Spatial Rich Features on a GPU Andrew Ker adk @ - PowerPoint PPT Presentation

Implementing the Projected Spatial Rich Features on a GPU Andrew Ker adk @ cs.ox.ac.uk Department of Computer Science University of Oxford SPIE/IS&T Electronic Imaging, San Francisco, 4 February 2014 Background Features for binary

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Buffers and centroids Zev Ross President, ZevRoss Spatial Analysis DataCamp Spatial Analysis

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

Central Dauphin School District 2018-2019 Budget Presenta<on #8 April 23, 2018 Central Dauphin

THE GOOD Nutritional value of seafood: Rich source of vitamins Rich source of minerals Rich

modelling rich interaction sensor-based systems statusevent analysis rich set of

Spatial aggregation and optimal Spatial aggregation and optimal p p gg gg g g p p reserve

State of Science in Spatial State of Science in Spatial Demography Demography - Methods and

Marine Spatial Data for Marine Spatial Planning Ocean Leadership 2010 Public Policy Forum March

Panel Regarding Marine Panel Regarding Marine Spatial Planning Spatial Planning A public process

Getting precise about precision Kiran S. Kedlaya ( kedlaya@mit.edu ) Department of Mathematics,

VFS, Continued Don Porter CSE 506 Logical Diagram Binary Memory Threads Formats Allocators

EE 457 Unit 2a Unsigned 2s Complement Sign and Zero Extension Fixed Point Systems and

Brian Hickmann, Dennis Bradford Motivation AI is driving development of several new

PostgreSQL Replication Christophe Pettus PostgreSQL Experts PerconaLive, April 25, 2018

Quillen metrics on modular curves Mathieu Dutour Institut de Mathmatiques de Jussieu - Paris

A Parallel Method for the Computation of Matrix Exponential based on Truncated Neumann Series V.

Martingale Difference Central Limit Theorem Yichen Zhou May 9, 2016 Intuition Why martingale

Sambuz

Useful Links

Newsletter

Mail Us

Projected Spatial Rich Features on a GPU Andrew Ker adk @ - PowerPoint PPT Presentation

Implementing the Projected Spatial Rich Features on a GPU Andrew Ker adk @ cs.ox.ac.uk Department of Computer Science University of Oxford SPIE/IS&T Electronic Imaging, San Francisco, 4 February 2014 Background Features for binary

COMPANY PROFILE WATER FEATURES 1 WATER FEATURES 2 WATER FEATURES 3 WATER FEATURES 4 WATER

Resource 1: What is spatial? presentation notes Section Section text Notes 1. Spatial

Broadening the Study of Spatial Intelligence Mary Hegarty University of California, Santa

A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial Cloaking Framework A Spatial

Buffers and centroids Zev Ross President, ZevRoss Spatial Analysis DataCamp Spatial Analysis

Spatial Digitech Keep it s im ple Make it spatial About US Spatial Digitech is a provider of

Creating a Science of Spatial Learning Nora S. Newcombe Temple University PI, Spatial

UCSB is Spatial ! http://www.spatial.ucsb.edu Specialist Meeting on Spatial Thinking across the

STAT 209 Spatial Data I April 30, 2018 Colin Reimer Dawson 1 / 26 Spatial Data Projections

Central Dauphin School District 2018-2019 Budget Presenta&lt;on #8 April 23, 2018 Central Dauphin

THE GOOD Nutritional value of seafood: Rich source of vitamins Rich source of minerals Rich

modelling rich interaction sensor-based systems statusevent analysis rich set of

Spatial aggregation and optimal Spatial aggregation and optimal p p gg gg g g p p reserve

State of Science in Spatial State of Science in Spatial Demography Demography - Methods and

Marine Spatial Data for Marine Spatial Planning Ocean Leadership 2010 Public Policy Forum March

Panel Regarding Marine Panel Regarding Marine Spatial Planning Spatial Planning A public process

Getting precise about precision Kiran S. Kedlaya ( kedlaya@mit.edu ) Department of Mathematics,

VFS, Continued Don Porter CSE 506 Logical Diagram Binary Memory Threads Formats Allocators

EE 457 Unit 2a Unsigned 2s Complement Sign and Zero Extension Fixed Point Systems and

Brian Hickmann, Dennis Bradford Motivation AI is driving development of several new

PostgreSQL Replication Christophe Pettus PostgreSQL Experts PerconaLive, April 25, 2018

Quillen metrics on modular curves Mathieu Dutour Institut de Mathmatiques de Jussieu - Paris

A Parallel Method for the Computation of Matrix Exponential based on Truncated Neumann Series V.

Martingale Difference Central Limit Theorem Yichen Zhou May 9, 2016 Intuition Why martingale

Sambuz

Useful Links

Newsletter

Mail Us

Central Dauphin School District 2018-2019 Budget Presenta<on #8 April 23, 2018 Central Dauphin