Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network - PowerPoint PPT Presentation

Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network Environments to o Im Improve Mal alware Clas lassification) Blake Anderson (blake.anderson@cisco.com), David McGrew (mcgrew@cisco.com) FloCon 2017 January, 2017

Data Collection and Training Malware Sandbox ... Malware Sandbox Malware Classifier/Rules Training/Storage Records ... Benign Records • Metadata • Packet lengths • TLS • DNS • HTTP

Deploying Classifier/Rules Enterprise A ... … Classifier/Rules Enterprise N ...

Problems with this Architecture • Models will not necessarily translate to new environments • Will be biased towards the artifacts of the malicious / benign collection environments • Collecting data from all possible end-point/network environments is not always possible

Network Features in Academic Literature • 2016 – IMC / USENIX Security / NDSS • Packet sizes • Length of URLs • 2012:2015 – CCS / SAC / ACSAC / USENIX Security • Time between ACKs • Packet sizes in each direction • Number of packets in each direction • Number of bytes in each direction

Network/Transport-Level Robustness

Ideal TCP Session

Inbound Packet Loss

Multi-Packet Messages

Collection Points / MTU / Source Ports • Collection points significantly affect packet sizes • Same flow collected within a VM and on the host machine will look very different • Path MTU can alter individual packet sizes • Source ports are very dependent on underlying OS • WinXP: 1024-5000 • NetBSD: 49152-65535

Application-Level Robustness

TLS Handshake Protocol Client Server ClientHello ServerHello / Certificate ClientKeyExchange / ChangeCipherSpec ChangeCipherSpec Application Data

TLS Client Fingerprinting OpenSSL Versions ClientHello Record Headers 1.0.2 1.0.1 Random Nonce [Session ID] 1.0.0 Cipher suites 0.9.8 Compression Indicative of TLS Client Methods Extensions

TLS Dependence on Environment • 73 unique malware samples were run under both WinXP and Win7 • 4 samples used the exact same TLS client parameters in both environments • 69 samples used the library provided by the underlying OS (some also had custom TLS clients) • Effects the distribution of TLS parameters • Also has secondary effects w.r.t. packet lengths

HTTP Dependence on Environment • 152 unique malware samples were run under both WinXP and Win7 • 120 samples used the exact same set of HTTP fields in both environments • 132 samples used the HTTP fields provided by the underlying OS’s library • Effects the distribution of HTTP parameters • Also has secondary effects w.r.t. packet lengths

Solutions

Potential Solutions • Collect training data from target environment • Ground truth is difficult • Models do not translate • Discard Biased Samples • Not always obvious which features are network/endpoint-independent • Train models on network/endpoint-independent features • Not always obvious which features are network/endpoint-independent • This often ignores interesting behavior • Modify existing training data to mimic target environment • Not always obvious which features are network/endpoint-independent • Can capture interesting network/endpoint-dependent behavior • Can leverage previous capture/curated datasets

Results • L1-logistic regression • L1-logistic regression • Meta + SPLT + BD • Meta + SPLT + BD + TLS • 0.01% FDR: 1.3% • 0.01% FDR: 92.8% • Total Accuracy: 98.9% • Total Accuracy: 99.6%

Results (without Schannel) • L1-logistic regression • L1-logistic regression • Meta + SPLT + BD • Meta + SPLT + BD + TLS • 0.01 FDR: 0.9% • 0.01 FDR: 87.2% • Total Accuracy: 98.5% • Total Accuracy: 99.6%

Conclusions • It is necessary to understand and account for the biases present in different environments • Helps to create more robust models • Models can be effectively deployed in new environments • We can reduce the number of false positives related to environment artifacts • Data collection was performed with: Joy

Thank You

Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network - PowerPoint PPT Presentation

Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network Environments to o Im Improve Mal alware Clas lassification) Blake Anderson (blake.anderson@cisco.com), David McGrew (mcgrew@cisco.com) FloCon 2017 January, 2017 Data

Sandboxes VM Most sandboxes provide an isolation-based approach where the effect of

EBA industry roundtable: regulatory sandboxes and innovation hubs EBAs premises, London, 3

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

TR15 2018 1 2 TR360 2016 + T h r e a t s + Typical Threats Low-Tech Malicious Threats

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

SAVING ATEWA RANGE FOREST IN GHANA The Sights, Assets, Threats and Opportunities of The Sights,

Loom ing Threats - transcript of presentation video Nick : Team 4 Looming Threats, please give

Lecture #5: On Safes, Sandboxes, and Spies 1 Now that we have some concepts... Its time

NEEDLES IN A HAYSTACK: MINING INFORMATION FROM PUBLIC DYNAMIC SANDBOXES FOR MALWARE INTELLIGENCE

Securi rity P y Principles & & Sandboxes CS 161: Computer Security Prof. Raluca Ada

iOS App Extensions Photo Extensions: Shared Settings Separate Sandboxes Extension App Sandbox

Enter The Sandbox: Developing Innovation Sandboxes for the Energy Sector PRESENTATION TITLE:

Do You See What I See? Navigating Human & Cyber Threats at the Workplace The views expressed

Detecting Chang Detecting Changes in W s in Water ter Qua Q ualit lity i lit lit i in L

Solving HTTP Problems With Code and Protocols NATASHA ROONEY @thisNatasha Web 7. Application

Title page goes here. This is a subhead. Month XX, XXXX The 2020 Census Briefing for

March 26, 2020 9 a.m. PST Hosts: Deborah Stanton, Executive Director Nicole Leonardi, Director

Investor Presentation 15 July 2020 Disclaimer Disclaimer This Presentation is intended only for

Reactive and Proactive Standardisation of TLS Kenny Paterson and Thyla van der Merwe Royal

D2 - 00 SPECIAL REPORT FOR SC D2 Information Systems and Telecommunication Giovanna DONDOSSOLA

Detecting Changes in Data Streams Shai Ben-David, Johannes Gehrke and Daniel Kifer Cornell

An Internet Protocol Address Clustering Algorithm Robert Beverly Karen Sollins MIT Computer

Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network - PowerPoint PPT Presentation

Detecting Threats, Not Sandboxes (C (Characterizin ing Ne Network Environments to o Im Improve Mal alware Clas lassification) Blake Anderson (blake.anderson@cisco.com), David McGrew (mcgrew@cisco.com) FloCon 2017 January, 2017 Data

Sandboxes VM Most sandboxes provide an isolation-based approach where the effect of

EBA industry roundtable: regulatory sandboxes and innovation hubs EBAs premises, London, 3

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

TR15 2018 1 2 TR360 2016 + T h r e a t s + Typical Threats Low-Tech Malicious Threats

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

SAVING ATEWA RANGE FOREST IN GHANA The Sights, Assets, Threats and Opportunities of The Sights,

Loom ing Threats - transcript of presentation video Nick : Team 4 Looming Threats, please give

Lecture #5: On Safes, Sandboxes, and Spies 1 Now that we have some concepts... Its time

NEEDLES IN A HAYSTACK: MINING INFORMATION FROM PUBLIC DYNAMIC SANDBOXES FOR MALWARE INTELLIGENCE

Securi rity P y Principles &amp; &amp; Sandboxes CS 161: Computer Security Prof. Raluca Ada

iOS App Extensions Photo Extensions: Shared Settings Separate Sandboxes Extension App Sandbox

Enter The Sandbox: Developing Innovation Sandboxes for the Energy Sector PRESENTATION TITLE:

Do You See What I See? Navigating Human &amp; Cyber Threats at the Workplace The views expressed

Detecting Chang Detecting Changes in W s in Water ter Qua Q ualit lity i lit lit i in L

Solving HTTP Problems With Code and Protocols NATASHA ROONEY @thisNatasha Web 7. Application

Title page goes here. This is a subhead. Month XX, XXXX The 2020 Census Briefing for

March 26, 2020 9 a.m. PST Hosts: Deborah Stanton, Executive Director Nicole Leonardi, Director

Investor Presentation 15 July 2020 Disclaimer Disclaimer This Presentation is intended only for

Reactive and Proactive Standardisation of TLS Kenny Paterson and Thyla van der Merwe Royal

D2 - 00 SPECIAL REPORT FOR SC D2 Information Systems and Telecommunication Giovanna DONDOSSOLA

Detecting Changes in Data Streams Shai Ben-David, Johannes Gehrke and Daniel Kifer Cornell

An Internet Protocol Address Clustering Algorithm Robert Beverly Karen Sollins MIT Computer

Securi rity P y Principles & & Sandboxes CS 161: Computer Security Prof. Raluca Ada

Do You See What I See? Navigating Human & Cyber Threats at the Workplace The views expressed