Automatic Testing of Symbolic Execution Engines via Program - PowerPoint PPT Presentation

Automatic Testing of Symbolic Execution Engines via Program Generation and Differential Testing Timotej Kapus, Cristian Cadar Department of Computing Imperial College London

if (x > 2294967295) { assert(false); } printf("x: %u\n", x); 2

Symbolic execution ● Used in industry: ○ IntelliTest ○ SAGE ○ KLOVER ○ SPF ○ Apollo ● Active research field 3

Symbolic execution 1 unsigned int x = 5; 2 int main() { 3 if (x > 2294967295) { 4 assert(false); 5 } 6 printf("x: %u\n",x); 7 } 4

Symbolic execution x = * 1 unsigned int x = 5; 2 int main() { TRUE FALSE x > 2294967295 3 make_symbolic(&x); 4 if (x > 2294967295) { 5 assert(false); x ≤ 2294967295 x > 2294967295 6 } 7 printf("x: %u\n",x); assert(false); printf("x: %d", x); 8 } Assertion x: 2 5 fail

Symbolic executors ● Many available open source ● Complex pieces of software ○ Accurate interpreter or precise instrumentation ○ Accurate constraint solving ○ Constraint gathering ○ Scheduling Angr ○ Effective optimizations such as caching, fast solving, etc. 6

Symbolic executors ● Many available open source ● Complex pieces of software ○ Accurate interpreter or precise instrumentation ○ Accurate constraint solving ○ Constraint gathering ○ Scheduling Angr ○ Effective optimizations such as caching, fast solving, etc. 7

Bugs in symbolic executors 1 unsigned int x = 5; ● Particularly bad 2 int main() { ● Lead to false sense of security 3 make_symbolic(&x); ● Examples: 4 if(x > 2294967295) { ○ Missing a branch 5 assert(false); ○ Exploring spurious branches 6 } 7 printf("x: %u\n",x); 8 } 8

Differential testing of symbolic execution Randomly generated program Compile Compile Execute Symbolically execute Compare 9

Testing symbolic executors ● Compare two executions (native/symbolic) in 3 different modes: ○ Concrete - tests interpretation/instrumentation ○ Single Path - tests constraint gathering and solving ○ Multi Path - tests scheduling, test case generation 10

Concrete mode x: 5 1 unsigned int x = 5; 2 int main() { 3 if (x > 2294967295) { 4 assert(false); 5 } 6 printf("x: %u\n",x); 7 } x: 343 11

Single-Path mode 1 unsigned int x = 5; x: 5 2 int main() { 3 make_symbolic(&x); 4 CONSTRAIN(x, 5); 5 if(x > 2294967295) { 6 assert(false); 7 } 8 printf("x: %u\n",x); 9 } Assertion fail 12

Single-Path mode: Constrainers 1 unsigned int x = 5; x: 5 2 int main() { 3 make_symbolic(&x); 4 CONSTRAIN(x, 5); CONSTRAIN(x, 5); 5 if(x > 2294967295) { 6 assert(false); if(x < 5) silent_exit(0); 7 } 8 printf("x: %u\n",x); if(x > 5) silent_exit(0); 9 } Assertion fail 13

Single-Path mode: Constrainers 1 unsigned int x = 5; x: 5 2 int main() { 3 make_symbolic(&x); 4 CONSTRAIN(x, 5); CONSTRAIN(x, 5); 5 if(x > 2294967295) { 6 assert(false); 7 } if(x != 5) silent_exit(0); 8 printf("x: %u\n",x); 9 } Assertion fail 14

Multi-Path mode Test case: Test case: 1 unsigned int x = 5; x = 7 x = 23 2 int main() { 3 make_symbolic(&x); 4 if(x > 2294967295) { 5 assert(false); 6 } 7 printf("x: %u\n",x); 8 } x: 7 x: 7 x: 23 x: 23 15 MATCH!

Multi-Path mode int x = 5; Test case: Test case: x = -7 x = 23 void main() { make_symbolic(&x); if(x < 0) printf("x: %d", -x); else printf("x: %d", x); } x: 7 x: 7 x: 23 x: -23 16

Testing symbolic executors ● Built a pipeline ● Run experiments in batches ● Avoid bugs found in previous batches 17

Instrumentation supports ● Csmith ○ Random program generator ○ Found many bugs in compilers ○ Doesn’t generate programs with undefined behaviour ● Instrumentation supports: ○ Marking variables as symbolic ○ Oracles ○ Constraining 18

Versions correspond to mode: ● Concrete mode - native version ● Single path mode - single path version ● Multi path mode - multi path version 19

Oracles can check: 1. Executor doesn’t crash 2. Function call chain 3. Output (values of all global variables) matches 4. Coverage achieved on the random program 20

Finally: ● Gather mismatches ● Reduce interesting ones ● Report bugs 21

CREST Case Studies ● Concolic execution KLEE ● Instrumentation instead of interpretation ● Main case study ● Doesn’t generate test cases ○ Familiarity ○ Flexibility ● Built on top of LLVM FuzzBALL ● Keeps all paths in memory ● Binary level executor ● Doesn’t generate test cases 22

0 0 0 0 28

Example bug: Crest Expected output Actual output 1 unsigned int a; 2 int main() { a: 6 a: 6 3 make_symbolic(&a); Assertion fail a: 23 4 if(a > 2294967295) { 5 assert(false); 6 } 7 printf("a: %d\n",a); 8 } 29

Example bug: KLEE Expected Actual output output 1 int g_10 = 0; loop loop 2 int main() { loop 3 make_symbolic(&g_10); loop loop 4 do { loop loop 5 printf("loop\n"); loop 6 g_10 &= 2; ... 7 } while(!((3 ^ g_10) / 1)); 8 } 30

Example bug: FuzzBALL Expected Actual output output 1 unsigned int g_54 = 0; 2 unsigned int g_56 = 0; g_56: 0 Strange term cast (cast(t2:reg32t)L:reg8t)U: 3 reg32t ^ 0xbc84814c:reg32t 4 void main ( void ) { 5 make_symbolic(&g_54); 6 CONSTRAIN(g_54, 0); 7 g_56 ^= 0 < g_54; 8 printf("g_56: %u\n", *(&g_56)); 9 } 31

Conclusions ● Developed techniques that test many aspects of symbolic executors ● Applied them to 3 different symbolic executors ● Total bugs found: ○ 14 in KLEE ○ 3 in Crest ○ 3 in FuzzBALL 32

Constrainers 34

CREST bug ● 14 in KLEE (9 fixed) ● 3 in Crest (1 fixed) ● 3 in FuzzBALL (3 fixed) ● found within first 5000 runs of a batch 37

Single-Path Mode Compare native execution, with symbolic execution constrained to the exact same path as native execution. 38

Symbolic execution ● Mark some inputs as symbolic ● Runs the program, while gathering constraints on the symbolic data ● Forks at branch points when both sides are feasible ● Upon hitting a terminal state (ie. error), solves the gathered constraints, to produce an input leading the program to the same state 39

Configuration includes: ● Program generation options ○ size/complexity of the program ○ language features to use ● Compilation options ● Mode ● Oracles to use 40

Instrumentation supports ● Marking variables as symbolic ● Oracles ● Constraining 41

V ersions correspond to mode used: ● Concrete mode - native version ● Single path mode - single path version ● Multi path mode - multi path version 42

Oracles can check: 1. Executor doesn’t crash 2. Function call chain 3. Output (values of all global variables) matches 4. Coverage achieved on the program 43

Finally: ● Gather mismatches ● Reduce interesting ones ● Report bugs 44

Expected output Actual output 1 void foo(unsigned int x) { x: 6 x: 6 2 if( x > 2294967295) { Assertion fail x: 23 3 assert(false); 4 } 5 printf("x: %u\n", x); 6 } 45

Automatic Testing of Symbolic Execution Engines via Program - PowerPoint PPT Presentation

Automatic Testing of Symbolic Execution Engines via Program Generation and Differential Testing Timotej Kapus, Cristian Cadar Department of Computing Imperial College London if (x > 2294967295) { assert(false); } printf("x:

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Concolic Testing Dynamic Symbolic Execution Marco Probst Albert-Ludwigs-Universitt Freiburg

Demo Symbolic Execution Probabilistic Symbolic Execution (Materials kindly provided by Willem

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Symbolic Execution of Maintainer Scripts Nicolas Jeannerod and Ralf Treinen joint work with

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution /

Symbolic Execution Mathy Vanhoef @vanhoefm HITB DXB 2018, Dubai, 27 November 2018 Overview

Learning to Fuzz from Symbolic Execution with Application to Smart Contracts Jingxuan Mislav

Constraint Solving in Symbolic Execution Cristian Cadar Department of Computing Imperial College

Game Engines 1 Overview Game engines are a significant part of the modern games industry

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation

Set11 Search Engines & SEO Outline How do search engines work? Basic operation

Thermodynamic Geometry of Yang-Mills Gauge Theory Bhupendra Nath Tiwari University of Information

Recent Developments in The Black Hole Microstate Geometry Program Masaki Shigemori Strings and

A Potential Astrophysical Test of Quantum Gravity Ue-Li Pen CITA November 12, 2013 U. Pen A

Describing the interior of a black hole using holography Marika Taylor Mathematical Sciences and

SimuVEX Using VEX in Symbolic Analysis Yan Shoshitaishvili yans@cs.ucsb.edu 2014 Who am I? My

Doing the unexpected with binary black holes ICERM Brown University Steve Liebling Work

Running a Stress (ish) -Free Company Thanks for joining todays talk and Ill be explaining

The bright side of black holes Vtor Cardoso [Tcnico & CERN] Fundamental questions a.

Automatic Testing of Symbolic Execution Engines via Program - PowerPoint PPT Presentation

Automatic Testing of Symbolic Execution Engines via Program Generation and Differential Testing Timotej Kapus, Cristian Cadar Department of Computing Imperial College London if (x > 2294967295) { assert(false); } printf("x:

Symbolic Execution of Linux binaries About Symbolic Execution Dynamically explore all

Symbolic Execution: Applications Symbolic execution is widely used in practice. Tools based on

Symbolic execution as search, and the rise of solvers Search and SMT Symbolic execution is

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Concolic Testing Dynamic Symbolic Execution Marco Probst Albert-Ludwigs-Universitt Freiburg

Demo Symbolic Execution Probabilistic Symbolic Execution (Materials kindly provided by Willem

Symbolic Execution Emina Torlak emina@cs.washington.edu Outline What is symbolic execution?

Symbolic Execution of Maintainer Scripts Nicolas Jeannerod and Ralf Treinen joint work with

Symbolic execution for binary-level security / 50 3 A number of shades of symbolic execution /

Symbolic Execution Mathy Vanhoef @vanhoefm HITB DXB 2018, Dubai, 27 November 2018 Overview

Learning to Fuzz from Symbolic Execution with Application to Smart Contracts Jingxuan Mislav

Constraint Solving in Symbolic Execution Cristian Cadar Department of Computing Imperial College

Game Engines 1 Overview Game engines are a significant part of the modern games industry

Set 10 Search Engines &amp; SEO Outline How do search engines work? Basic operation

Set 10 Search Engines &amp; SEO Outline How do search engines work? Basic operation

Set11 Search Engines &amp; SEO Outline How do search engines work? Basic operation

Thermodynamic Geometry of Yang-Mills Gauge Theory Bhupendra Nath Tiwari University of Information

Recent Developments in The Black Hole Microstate Geometry Program Masaki Shigemori Strings and

A Potential Astrophysical Test of Quantum Gravity Ue-Li Pen CITA November 12, 2013 U. Pen A

Describing the interior of a black hole using holography Marika Taylor Mathematical Sciences and

SimuVEX Using VEX in Symbolic Analysis Yan Shoshitaishvili yans@cs.ucsb.edu 2014 Who am I? My

Doing the unexpected with binary black holes ICERM Brown University Steve Liebling Work

Running a Stress (ish) -Free Company Thanks for joining todays talk and Ill be explaining

The bright side of black holes Vtor Cardoso [Tcnico &amp; CERN] Fundamental questions a.

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation

Set 10 Search Engines & SEO Outline How do search engines work? Basic operation

Set11 Search Engines & SEO Outline How do search engines work? Basic operation

The bright side of black holes Vtor Cardoso [Tcnico & CERN] Fundamental questions a.