Security Testing Hard to Reach Code Mathias Payer - PowerPoint PPT Presentation

Security Testing Hard to Reach Code Mathias Payer <mathias.payer@epfl.ch> https://hexhive.github.io 1

Vulnerabilities everywhere? 2

Challenge: broken abstractions C/C++ void log( int a) { printf("A: %d", a); } void vuln( char *str) { char *buf[4]; void (* fun )( int ) = &log; strcpy (buf, str); ... fun (15); } ASM log: ... fun : .quad log vuln: movq log(%rip), 16(%rsp) ... call strcpy ... call 16(%rsp) 3

Challenge: software complexity Google Chrome: 76 MLoC Chrome and OS ~100 mLoC, 27 lines/page, Gnome: 9 MLoC 0.1mm/page ≈ 370m Xorg: 1 MLoC glibc: 2 MLoC Linux kernel: 17 MLoC Margaret Hamilton Brian Kernighan holding with code for Apollo Lion’s commentary on Guidance Computer BSD 6 (Bell Labs, ‘77) (NASA, ‘69) 4

Defense: Testing OR Mitigating? Software Testing Mitigations C/C++ vuln("AAA"); void log( int a) { printf("A: %d", a); vuln("ABC"); } void vuln( char *str) { vuln("AAAABBBB"); char *buf[4]; void (* fun )( int ) = &log; strcpy_chk(buf, 4, str); strcpy (buf, str); CHECK(fun, tgtSet); fun (15); } 5

Status of deployed defenses ● Data Execution Prevention (DEP) Memory 0x4?? R-X ● Address Space Layout Randomization (ASLR) text ● Stack canaries 0x8?? RW- ● Safe exception handlers data ● Control-Flow Integrity (CFI): Guard indirect control-flow 0xf?? RW- stack 6

Software testing: discover bugs security 7

Fuzz testing ● A random testing technique that mutates input to improve test coverage Input Generation Coverage Debug Exe Tests Crashes ● State-of-art fuzzers use coverage as feedback to evolutionarily mutate the input

Fuzzing as bug finding approach ● Fuzzing finds bugs effectively (CVEs) – Proactive defense, part of testing – Preparing offense, part of exploit development

Academic fuzzing research

Fuzzing frontiers

Fuzzing frontiers Mine Mine Cross Cross existing existing unknown unknown code code borders borders Explore Explore new new paths paths

Exploring hidden program paths

Challenges for Fuzzers ● Challenges Shallow code paths Shallow code paths – Shallow coverage start – Hard to find “deep” bugs Deep code paths Deep code paths check1 ● Root cause check2 – Fuzzer-generated inputs check3 cannot bypass complex sanity checks in the bug target program end

Limitations of existing approaches ● Existing approaches focus on input generation – AFL improvements (seed corpus generation) – Driller (selective concolic execution) – VUzzer (taint analysis, data-/control-flow analysis) – QSYM, Angora (symbolic/concolic analysis) ● Limitations: high overhead, not scalable ● Unable to bypass “hard” checks – Checksum values – Crypto-hash values

Non-Critical Checks (NCC) ● Some checks are not intended to prevent bugs – Checks on magic values, checksum, or hashes ● Removing NCCs – Won’t incur erroneous bugs, simplifies fuzzing void main () { int fd = open (...); char *hdr = read_header (fd); if ( strncmp (hdr, “ELF", 3) == 0) { // main program logic // ... } else { error (); } }

Fuzzing by Program Transformation ● Fuzzer fuzzes Transformed Programs ● When stuck – Detect NCC candidates Inputs Fuzzer Program (AFL) Transformer – Remove NCCs – Repeat Crashing inputs ● Verify crashes in original program Bug Reports Crash Analyzer False Positives

Detecting NCCs: imprecision Covered Node ● Approximate NCCs as Uncovered Node edges connecting NCC Candidate covered and uncovered nodes in CFG – Over approximate, may contain false positives – Lightweight and simple to implement 19

Program transformation ● Our approach: negate JCCs – Easy to implement: static binary patching – Zero runtime overhead in resulting target program – CFG/trace/path constraints remains the same start start Flipped Check A == B A != B True branch False branch True branch False branch end end

Crash analysis: false positives? False Positive Collect path Timeout constraints of Path SAT? original program constraints (concolic tracing on crashing input) Input to crash original program

NCC example Collected path constraints { x > 0, y == 0xdeadbeef } int main (){ int main (){ int x = read_input(); int x = read_input(); int y = read_input(); int y = read_input(); if (x > 0) { if (x > 0) { if (y == 0xdeadbeef) if (y != 0xdeadbeef) bug(); bug(); } } } } Original Program Transformed Program

NCC example Collected path constraints SAT True BUG { x > 0, y != 0xdeadbeef } flip int main (){ int main (){ int x = read_input(); int x = read_input(); int y = read_input(); int y = read_input(); if (x > 0) { if (x > 0) { Flipped if (y == 0xdeadbeef) if (y != 0xdeadbeef) check bug(); bug(); } } } } Original Program Transformed Program

NCC example 2 Collected path constraints { i > 0, i <= 0 } int main (){ int main (){ int i = read_input(); int i = read_input(); if (i > 0) { if (i > 0) { func(i); func(i); } } } } void func( int i) { void func( int i) { if (i <= 0) { if (i > 0) { bug(); bug(); } } //... //... } } Original Program Transformed Program

NCC example 2 Collected path constraints UNSAT False BUG { i > 0, i > 0 } int main (){ int main (){ int i = read_input(); int i = read_input(); flip if (i > 0) { if (i > 0) { func(i); func(i); } } } } void func( int i) { void func( int i) { Flipped if (i <= 0) { if (i > 0) { check bug(); bug(); } } //... //... } } Original Program Transformed Program

Comparison to Driller ● Fuzzing explores code paths ● Concolic execution explores Fuzzer new code paths when “stuck” Constraint Mutation solving ● Limitations Inputs – Constraints solving is slow – Unable to bypass “hard” checks program Crashes

T-Fuzzing ● Constraint solving and fuzzing are decoupled Fuzzer ● Constraint solving only for crashes Inputs Program ● T-Fuzz detects bug for Transformation “hard” checks, but program program program program cannot verify it Constraint solving Crashes

Limitations ● NCC selection: transformation explosion ● False bugs: fault before bug ● Crash analyzer: constraint solving is hard – Length of trace – Number of constraints – Non-termination

Case study: CROMU_00030 (CGC) void main() { Total time to find int step = 0; the bug: ~4h Packet packet; while (1) { memset(packet, 0, sizeof (packet)); if (step >= 9) { char name[5]; Stack Buffer overflow bug int len = read(stdin, name, 128); printf( "Well done, %s \n " , name); return SUCCESS; } read(stdin, &packet, sizeof (packet)); C1: check on magic if (strcmp((char *)&packet, "1212" ) == 0) return FAIL; C2: checksum if (compute_checksum(&packet) != packet.checksum) return FAIL; if (handle_packet(&packet) != 0) C3: authenticate user info return FAIL; step ++; } }

T-Fuzz summary ● Core idea: mutate both program and input ● T-Fuzz outperforms state-of-art fuzzers – Improvement over Driller/AFL by 45%/58% – Bugs: 1 in LAVA-M and 3 in real-world programs ● T-Fuzz future work – LLVM-based program transformation – Crash analyzer: optimize constraint solving – NCC detection through static analysis

Security-testing binary-only code

RetroWrite: static binary rewriting Processing Reassemblable Reg. Allocation Optimization Assembly Symbolization

afl-retrowrite ● Instrument basic blocks to update coverage map ● To show interoperability, we reuse afl-gcc – afl-gcc / afl-clang instruments assembly files – Our symbolized assembly files follow the format of compiler-generated ASM files – Enables reuse of existing transformations!

Binary-only ASan (retrowrite-asan) ● RetroWrite API to identify instrumentation sites ● Two kinds of instrumentation: – Allocation Instrumentation – Memory Check Instrumentation If 0x100 is poisoned: terminate(); var = access(0x100);

RetroWrite: static binary rewriting Processing Reassemblable Reg. Allocation Optimization Assembly Symbolization

Two-ended peripheral testing

USBFuzz: explore peripheral space QEMU/KVM Virtual Environment Linux Kernel Fake USB Device User-mode agent Fuzzer: Input Generation

USBFuzz Evaluation ● ~60 new bugs discovered in recent kernels ● 36 memory bugs (UaF / BoF) ● 8 bugs fixed (with CVEs) ● Bug reporting in progress

Security testing hard-to-reach code ● Fuzzing is an effective way to automatically test programs for security violations (crashes) – Key idea: optimize for throughput – Coverage guides mutation ● T-Fuzz: mutate code and input ● RetroWrite: efficient static rewriting ● USBFuzz: enable fuzzing of peripherals https://github.com/HexHive

Security Testing Hard to Reach Code Mathias Payer - PowerPoint PPT Presentation

Security Testing Hard to Reach Code Mathias Payer <mathias.payer@epfl.ch> https://hexhive.github.io 1 Vulnerabilities everywhere? 2 Challenge: broken abstractions C/C++ void log( int a) { printf("A: %d", a); } void vuln(

COMPANY PRESENTATION Company Presentation Agenda REACH The Profile REACH The Values REACH The

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

SECURITY TESTING Towards a safer web world AGENDA 1. 3 WS OF SECURITY TESTING 2. SECURITY

Presentation 4Q2014 REACH SUBSEA ASA IN BRIEF Reach Subsea was established in 2008 in Norway as a

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Security and the .NET Framework Code Access Security Enforces security policy on code

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

20 16 Residential & Hard-to-Reach Standard Offer Program s Workshop October 28, 2015 1 AEP

Jessica McCarty-Kern MICHIGAN TECH Jessica L. . McCarty , jmccarty@mtu.edu RESEARCH FORUM

D-F -FACTOR OR: : A Quant Quantit itativ ive e Per erfor ormance mance Model odel of

Federated Observational and Simulation Data in the NASA Center for Climate Simulation Data

Design, Implementation, and Experiences of Third-Party Software Administration at the ORNL NCCS

Plugins and Filters and Data Science: Displaying R Markdown in Drupal Today we are going to talk

Guarding Vulnerable Code: Module 1: Sanitization Mathias Payer, Purdue University

The Titan Tools Experience Michael J. Brim, Ph.D. Computer Science Research, CSMD/NCCS Petascale

Together in the Sandbox: Board and Staff Relationships Welcome! The webinar will begin at 10:00

Security Testing Hard to Reach Code Mathias Payer - PowerPoint PPT Presentation

Security Testing Hard to Reach Code Mathias Payer <mathias.payer@epfl.ch> https://hexhive.github.io 1 Vulnerabilities everywhere? 2 Challenge: broken abstractions C/C++ void log( int a) { printf("A: %d", a); } void vuln(

COMPANY PRESENTATION Company Presentation Agenda REACH The Profile REACH The Values REACH The

Property-Based Testing Matt Bachmann @mattbachmann Testing is Important Testing is Important

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

SECURITY TESTING Towards a safer web world AGENDA 1. 3 WS OF SECURITY TESTING 2. SECURITY

Presentation 4Q2014 REACH SUBSEA ASA IN BRIEF Reach Subsea was established in 2008 in Norway as a

Levels of Testing Chapter 12 Beyond unit testing Developer Testing stages Unit testing

Testing Terminology System testing Types of errors Function testing Structure

Security and the .NET Framework Code Access Security Enforces security policy on code

Software Testing Overview What is software testing? General testing criteria Testing

Software testing Software Testing Introduction Testing levels Automated testing Principles and

1. Test page This page is for testing. This page is for testing. This page is for testing.

HydroCare HC-44 HydroCare HC-44 Hard Water Problems Hard Water Problems Hard Water Costs You

6/18/2018 When Family Life Gets Hard 1 6/18/2018 When Family Life Gets Hard God

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

20 16 Residential &amp; Hard-to-Reach Standard Offer Program s Workshop October 28, 2015 1 AEP

Jessica McCarty-Kern MICHIGAN TECH Jessica L. . McCarty , jmccarty@mtu.edu RESEARCH FORUM

D-F -FACTOR OR: : A Quant Quantit itativ ive e Per erfor ormance mance Model odel of

Federated Observational and Simulation Data in the NASA Center for Climate Simulation Data

Design, Implementation, and Experiences of Third-Party Software Administration at the ORNL NCCS

Plugins and Filters and Data Science: Displaying R Markdown in Drupal Today we are going to talk

Guarding Vulnerable Code: Module 1: Sanitization Mathias Payer, Purdue University

The Titan Tools Experience Michael J. Brim, Ph.D. Computer Science Research, CSMD/NCCS Petascale

Together in the Sandbox: Board and Staff Relationships Welcome! The webinar will begin at 10:00

20 16 Residential & Hard-to-Reach Standard Offer Program s Workshop October 28, 2015 1 AEP