SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012 - PowerPoint PPT Presentation

adk @ cs.ox.ac.uk Department of Computer Science, Oxford University pevnak @ gmail.com Agent Technology Center, Czech Technical University in Prague SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012

cover objects stego objects

classified ‘cover’ classified ‘stego’ classified ‘stego’ classified ‘cover’ • Stego training set requires knowledge of embedding algorithm. • Does not ‘identify a steganographer’.

‘Actor 1’ ‘Actor 2’ ‘Actor 3’ ‘Actor 4’ ‘Actor 5’ • Many actors, transmitting many objects each. • Different actors’ sources have different characteristics: model mismatch is guaranteed!

‘Actor 1’ ‘Actor 2’ ‘Actor 3’ ‘Actor 4’ ‘Actor 5’ 1. Extract features. Use each actor’s output to estimate their overall distribution.

‘Actor 1’ ‘Actor 2’ ‘Actor 3’ ‘Actor 4’ ‘Actor 5’ 1. Extract features. Use each actor’s output to estimate their overall distribution. 2. Compute a distance between each pair of actors. 3. Identify the steganographer(s).

Features • ‘PF274’ features: 274-dimensional features for JPEGs. Same method should work with any stego-sensitive features. Distance between actors • Maximum Mean Discrepancy: Has simple consistent estimator. Identification of steganographer(s) • Previous work: hierarchical clustering. Worked well with 13 cameras as actors. Does not work well on real-world data.

Features • ‘PF274’ features: 274-dimensional features for JPEGs. Same method should work with any stego-sensitive features. Distance between actors • Maximum Mean Discrepancy: Has simple consistent estimator. Identification of steganographer(s) • New work: local outlier factor (LOF). Compares local density with density around k-nearest neighbours.

Advantages • Scale invariant. • Has only one parameter ( k ), to which it is relatively insensitive. • Allows actors to be ranked by ‘degree of being an outlier’. Even if you cannot identify the guilty actor uniquely, can hope to include them in a ‘top n most-suspicious’ list.

On a leading social networking site… • some users permit global access to images they appear in; • we can click next image or see more of user (if user permits). Automated process of following links, restricted to ‘Oxford University’ users, resulted in 4,051,928 images from 78,107 uploaders. Ethics • All data anonymized. • Kept only images, grouped by ‘owner’, no personal information. • All images globally visible at the time of download.

On a leading social networking site… • some users permit global access to images they appear in; • we can click next image or see more of user (if user permits). Automated process of following links, restricted to ‘Oxford University’ users, resulted in 4,051,928 images from 78,107 uploaders. Data set • Selected 200 images from each of 4000 uploaders (actors). • Filtered only for triviality and standard JPEG quality factor. + Same quality factor, similar size (around 1 Mpix). - Mixture of sources, even within actors. - Resampling & double-compression artefacts. - Some already-tampered images; hopefully no stego images.

• Select { 50, 100, 200 } images from { 100, 200, 400, 800, 1600, 3200 } actors at random. • One is the guilty steganographer: embed using nsF5 in some of their images. • Perform identification: 1. Compute PF274 features of every image, 2. MMD distance between each actor, 3. Rank actors by LOF (k = 10). • Measure how often guilty actor appears in top 10 or top 1% most suspicious.

PF274 features have different scales, and must be pre-processed: • scaling (zero mean, unit variance) • whitening (PCA) Also allows comparison with methods from literature, projecting features onto 1 dimension (distance to separating hyperplane): • 1-class SVM [Farid, Pevný et al.] v-SVM (v=0.01) trained on 6000 randomly-selected observed features. • 2-class SVM [many authors!] 20 C-SVMs trained on 6000 randomly-selected cover/stego features from a disjoint training set… … not a fair comparison because requires knowledge of embedding algorithm, as well as expensive training.

1 guilty actor using nsF5 0.7bpnc in 70% of images 0.6bpnc in 60% of images 0.5bpnc in 50% of images 0.4bpnc in 40% of images 0.3bpnc in 30% of images number of actors

1 guilty actor using nsF5 0.7bpnc in 70% of images 0.6bpnc in 60% of images 0.5bpnc in 50% of images 0.4bpnc in 40% of images 0.3bpnc in 30% of images

• Identifying steganographer(s) means working on the level of actors, not individual images. Allows us to identify a ‘typical’ level of model mismatch. Make heterogeneous data work for us, not against us. • Outlier analysis is more flexible than clustering. Can rank by suspicion. We are only identifying steganographer(s) if the feature set is stego- sensitive. • This method works on real-world data. • Potential for universal unsupervised steganalysis… …effectively self-training as long as the majority of actors are innocent.

• Is linear MMD really the best distance metric? Surely not. • How to deal with multiple guilty actors? LOF has some problems if k too small. • How few images (per actor) is sufficient? Fewer than 50? • How to refine features to make them more stego-sensitive? Massive feature sets are probably no good for this application.

SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012 - PowerPoint PPT Presentation

adk @ cs.ox.ac.uk Department of Computer Science, Oxford University pevnak @ gmail.com Agent Technology Center, Czech Technical University in Prague SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012 cover objects stego objects

PROCEEDINGS OF SPIE SPIEDigitalLibrary.org/conference-proceedings-of-spie Ultrasonic medical

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

City and County of San Francisco City and County of San Francisco San Francisco Planning

EunGyoung Han 2009 SPIE Medical Imaging Conference Scientific Computing and Imaging Institute,

SPIE INDUSTRY EVENTS FREE SEMINARS, KEYNOTES, AND PANEL SESSIONS. WWW.SPIE.ORG/BUSINESS

Desktop Scanning to sRGB Presented at IS&T and SPIE's 12th Annual Symposium on Electronic

San Francisco Transportation Plan (SFTP) and Early Action Plan October 2013 SAN FRANCISCO COUNT

SAN FRANCISCO HEALTH REFORM TASK FORCE FINAL REPORT Presentation to the San Francisco Health

High Risk Emergency Medicine San Francisco May 21-24, 2014 Westin Market Street San Francisco,

NORTHERN CALIFORNIA REGION Local Policy Maker Group August 27, 2020 SAN FRANCISCO TO SAN JOSE

The Coral Reef Satellite Mission Robert F. Goeke, Member SPIE Center for Space Research

ANSI ASC OP Task Force 6: IR Materials Standards and SPIE IR Materials Standards Working

Assessment of Fair Housing San Mateo Bidders Conference South San Francisco County of San

Transportation: San Francisco Capital Planning City and County of San Francisco 10-Year Capital

Transportation: San Francisco Capital Planning City and County of San Francisco 10-Year Capital

San Francisco San Francisco Draft Rate Application Funding the SF Recycling Program and

Automatic Data Analysis in Visual Analytics Selected Methods Multimedia Information Systems 2

HTTP Web eb and d URLs Web page consists of objects Addressable by a URL Can be HTML

Design Considerations for a DECADE SDT draft-kutscher-decade-protocol-00

Do HiPS yourself! HiPS tutorial ASTERICS Heidelberg - 17 june 2016 P. Fernique & G.

N e u r a l M o d e l s f o r M u l t i - S e n s o r I n t e g r

Networking Overview CS 161: Computer Security Prof. Vern Paxson TAs: Jethro Beekman, Mobin

AMS RTI Q. Yan / IHEP V. Choukto / MIT RTI Introduction 1: RTI record each second AMS global

MulticoreBSP for C a high-performance library for shared-memory parallel programming Albert-Jan

Sambuz

Useful Links

Newsletter

Mail Us

SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012 - PowerPoint PPT Presentation

adk @ cs.ox.ac.uk Department of Computer Science, Oxford University pevnak @ gmail.com Agent Technology Center, Czech Technical University in Prague SPIE/IS&T Electronic Imaging, San Francisco, 25 January 2012 cover objects stego objects

PROCEEDINGS OF SPIE SPIEDigitalLibrary.org/conference-proceedings-of-spie Ultrasonic medical

Nuclear Imaging Medical Imaging Medical Imaging Nuclear Imaging Nuclear Imaging Nuclear

City and County of San Francisco City and County of San Francisco San Francisco Planning

EunGyoung Han 2009 SPIE Medical Imaging Conference Scientific Computing and Imaging Institute,

SPIE INDUSTRY EVENTS FREE SEMINARS, KEYNOTES, AND PANEL SESSIONS. WWW.SPIE.ORG/BUSINESS

Desktop Scanning to sRGB Presented at IS&amp;T and SPIE's 12th Annual Symposium on Electronic

San Francisco Transportation Plan (SFTP) and Early Action Plan October 2013 SAN FRANCISCO COUNT

SAN FRANCISCO HEALTH REFORM TASK FORCE FINAL REPORT Presentation to the San Francisco Health

High Risk Emergency Medicine San Francisco May 21-24, 2014 Westin Market Street San Francisco,

NORTHERN CALIFORNIA REGION Local Policy Maker Group August 27, 2020 SAN FRANCISCO TO SAN JOSE

The Coral Reef Satellite Mission Robert F. Goeke, Member SPIE Center for Space Research

ANSI ASC OP Task Force 6: IR Materials Standards and SPIE IR Materials Standards Working

Assessment of Fair Housing San Mateo Bidders Conference South San Francisco County of San

Transportation: San Francisco Capital Planning City and County of San Francisco 10-Year Capital

Transportation: San Francisco Capital Planning City and County of San Francisco 10-Year Capital

San Francisco San Francisco Draft Rate Application Funding the SF Recycling Program and

Automatic Data Analysis in Visual Analytics Selected Methods Multimedia Information Systems 2

HTTP Web eb and d URLs Web page consists of objects Addressable by a URL Can be HTML

Design Considerations for a DECADE SDT draft-kutscher-decade-protocol-00

Do HiPS yourself! HiPS tutorial ASTERICS Heidelberg - 17 june 2016 P. Fernique &amp; G.

N e u r a l M o d e l s f o r M u l t i - S e n s o r I n t e g r

Networking Overview CS 161: Computer Security Prof. Vern Paxson TAs: Jethro Beekman, Mobin

AMS RTI Q. Yan / IHEP V. Choukto / MIT RTI Introduction 1: RTI record each second AMS global

MulticoreBSP for C a high-performance library for shared-memory parallel programming Albert-Jan

Sambuz

Useful Links

Newsletter

Mail Us

Desktop Scanning to sRGB Presented at IS&T and SPIE's 12th Annual Symposium on Electronic

Do HiPS yourself! HiPS tutorial ASTERICS Heidelberg - 17 june 2016 P. Fernique & G.