SlowFuzz: Automated Domain-Independent Detection of Algorithmic - PowerPoint PPT Presentation

SlowFuzz: Automated Domain-Independent Detection of Algorithmic Complexity Vulnerabilities Theofilos Petsios , Jason Zhao, Angelos D. Keromytis, and Suman Jana Columbia University ACM Conference on Computer and Communications Security (CCS) 2017, Dallas, Texas

COMPLEXITY VULNERABILITIES ‣ Difference between average and worst-case complexity - CPU, memory, space etc. - User-controlled - Exploitability & Denial of Service (DoS)   ‣ Several instances seen in the wild 2

DOMAIN INDEPENDENT DETECTION OF COMPLEXITY VULNERABILITIES ‣ Heavily dependent on application logic   ‣ Algorithmic worst-case vs implementation worst-case - Minor changes often drastically change complexity   (e.g., pivot selection in quicksort)   ‣ Reasoning about the problem in the generic case is hard: - Theoretical analysis is often non-trivial - Implementation varies - Domain-specific tools predominantly require expert knowledge 7

EXAMPLE: QUICKSORT 2 ▸ Average O(nlogn) vs worst-case O(n ) complexity ▸ Implementation largely affects performance ▸ How do we reason on the effectiveness of a given implementation? ▸ How to test in a domain-agnostic manner? 8

EVOLUTIONARY TESTING ‣ Domain-independent test input generation ‣ Known to perform well in grey-box settings   ‣ Very effective in modern fuzzers targeting crash/memory corruption bugs - No expert knowledge - Production tools compete with domain-specific engines 9

EVOLUTIONARY TESTING ‣ Can we steer evolutionary testing towards complexity bugs? ‣ Coverage is irrelevant in this scenario ‣ Re-use fuzzing infrastructure 10

SLOWFUZZ PROTOTYPE ‣ SlowFuzz prototype ‣ Maintain and evolve an input corpus towards slower executions 11

SLOWFUZZ PROTOTYPE ‣ Maintain and evolve an input corpus towards slower executions 12

SLOWFUZZ KEY IDEAS ▸ Three key controls: - Instrumentation, Fitness Function, Mutations   ▸ Fitness Function should favor inputs that introduce slowdowns ▸ Mutation operations with locality in mind ▸ Avoid getting stuck! 17

SLOWFUZZ KEY IDEAS ‣ Fitness function maximizes CPU instructions   ‣ Mutation Strategies: - Random - Offset Priority - Mutation Priority - Hybrid 21

USECASE: SORTING ▸ Insertion sort & quicksort implementations ▸ Quadratic worst-case performance ▸ How close do we get to the theoretical worst slowdown? ▸ Slowdowns of 84.97% and 83.74% of   theoretical worst-case 22

USECASE: SORTING / REAL WORLD EXAMPLES ▸ Apple:3.34x ▸ OpenBSD: 3.3x ▸ GNU: 26.36% ▸ NetBSD: 8.7% 23

ENGINE PROPERTIES ‣ Fitness function: - CPU instructions vs Code Coverage vs Time-based tracing   ‣ Mutation Strategies: - Random - Offset Priority - Mutation Priority - Hybrid 24

ENGINE EVALUATION / MUTATION STRATEGIES - OPENBSD QUICKSORT 25

ENGINE EVALUATION / FITNESS FUNCTIONS - OPENBSD QUICKSORT 26

EVALUATION ‣ Evolutionary testing for complexity bugs is promising   ‣ Testcases: common instances of complexity vulnerabilities - Hashtables - Regular Expression Parsers - Compression/decompression routines 27

USECASE: PHP’S DJBX33A HASH ▸ Hash used for string keys in PHP ▸ Known worst-case performance ▸ Has been exploited in the wild ▸ For ‘ab’, ‘cd’ to collide it must hold   c = a + n ∧ d = b − 33 ∗ n, n ∈ Z ▸ If if two equal-length strings A and B collide, then strings xAy, xBy also collide 28

USECASE: PHP’S DJBX33A HASH ▸ 64 hashtable entries & 64 insertions ▸ Slowfuzz generated inputs causing monotonically increasing collisions ▸ No knowledge of the internals of the hash function 29

USECASE: REGEX PARSERS ▸ Multiple instances of ReDoS in the wild ▸ Backtracking can be catastrophic ▸ Handling of both regexes and inputs regex_match(regex, string) - Evil Regexes - Slowdowns on given inputs   ▸ Identifying evil regexes is a hard problem - Widely varying complexity: linear to exponential - Focus on super-linear & exponential matching 30

            USECASE: REGEX PARSERS / PCRE ▸ Can SlowFuzz find evil regexes given a fixed input?   31

            USECASE: REGEX PARSERS / PCRE ▸ Can SlowFuzz find evil regexes given a fixed input? - Yes! Without any knowledge of the regex logic   32

            USECASE: REGEX PARSERS / PCRE ▸ Can SlowFuzz find evil regexes given a fixed input? - Yes! Without any knowledge of the regex logic   33

            USECASE: REGEX PARSERS / PCRE ▸ Can SlowFuzz find evil regexes given a fixed input? - Yes! Without any knowledge of the regex logic   ▸ Example: (b+)+c 34

USECASE: REGEX PARSERS / PCRE ▸ 100 runs / 1 million generation each ▸ Regexes of 10 characters or less ▸ At least 31 regexes causing a slowdown   with 90% probability ▸ At least 2 regexes with super-linear matching with 90% probability ▸ At least 1 regex with exponential matching with 45.45% probability 35

USECASE: REGEX PARSERS / PCRE ▸ Can SlowFuzz find inputs causing a slowdown on a fixed regex? - Regexes from production WAFs - 8 - 25% slowdowns   36

USECASE: DECOMPRESSION / BZIP ▸ bzip2 ▸ 250-byte inputs ▸ 300x slowdown on fixed input size 37

CONCLUSION ‣ SlowFuzz: automated detection of complexity bugs through fuzzing   ‣ Found non-trivial issues involving high performant code - PHP’s hashtable implementation - PCRE regular expression library - bzip2   ‣ Evolutionary fuzzing as a generic means of code exploration - Different objectives for different bug types - Beyond code coverage maximization - Objective vs Controls: Instrumentation, Fitness Functions, Mutations 38

SlowFuzz: Automated Domain-Independent Detection of Algorithmic - PowerPoint PPT Presentation

SlowFuzz: Automated Domain-Independent Detection of Algorithmic Complexity Vulnerabilities Theofilos Petsios , Jason Zhao, Angelos D. Keromytis, and Suman Jana Columbia University ACM Conference on Computer and Communications Security (CCS)

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Automated Design of Digital Automated Design of Digital Automated Design of Digital Automated

Automated fault detection for Automated fault detection for Autosub6000: Autosub6000: What we've

Domain-independent planning and Domain-dependent planning Le Meilleur est lennemi

Overview of Automated Bus Consortium Program Accelerating automated technology for transit

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

C Context-based Visual Concept Context C t t t b t based Visual Concept b d Vi d Vi l C

Web Hosting and Domain Names Introduction to Web Design Web Hosting and Domain Names

Focusing the Core Domain Model A Domain-Driven Design Case Study, Eric Evans, Domain Language

Image Processing A case study for a domain decomposed MPI code Domain Decomposition 1

Low Level Low Level Low Level Low Level Detection of Detection of Detection of Detection of

Moving Forward Using Automated Measures for Lameness Detection Nria Chapinal, PhD Animal

Kicking Down the Cross Domain Door Techniques for Cross Domain Exploitation Billy K Rios (BK) and

Chapter 24 Chapter 24 Chapter 24 The Domain Name System The Domain Name System The Domain Name

Hospital Presumptive Eligibility Data requirements Standards and accountability Recordkeeping

Denial-of-Service (DoS) CS 161: Computer Security Prof. Vern Paxson TAs: Paul Bramsen, Apoorva

Announcements You must therefore hard-code target stack

Meaning, bounds, social kinds Dave Ripley University of Connecticut http://davewripley.rocks

Systemic Banks, Mortgage Supply and Housing Rents Pedro Gete and Michael Reher Georgetown &

setroubleshoot A User Friendly Tool to Diagnose & Manage AVC Denials John Dennis - Red Hat

INDE DEX OF F SL SLIDE DES Pre Pre-2017 Mo 2017 Money-Ba Based S System Pos Post-2016

Senate and House Proposals to Expand Medicaid in Kansas Medicaid Expansion Council Meeting

SlowFuzz: Automated Domain-Independent Detection of Algorithmic - PowerPoint PPT Presentation

SlowFuzz: Automated Domain-Independent Detection of Algorithmic Complexity Vulnerabilities Theofilos Petsios , Jason Zhao, Angelos D. Keromytis, and Suman Jana Columbia University ACM Conference on Computer and Communications Security (CCS)

Detection of neutral particles detection of neutrons detection of neutrinons detection of low

Automated Design of Digital Automated Design of Digital Automated Design of Digital Automated

Automated fault detection for Automated fault detection for Autosub6000: Autosub6000: What we've

Domain-independent planning and Domain-dependent planning Le Meilleur est lennemi

Overview of Automated Bus Consortium Program Accelerating automated technology for transit

Automated Reasoning: Some Successes and New Challenges Predrag Jani ci c

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Automated Reasoning Course Presentation Summary Automated Reasoning Motivations Course Plan

C Context-based Visual Concept Context C t t t b t based Visual Concept b d Vi d Vi l C

Web Hosting and Domain Names Introduction to Web Design Web Hosting and Domain Names

Focusing the Core Domain Model A Domain-Driven Design Case Study, Eric Evans, Domain Language

Image Processing A case study for a domain decomposed MPI code Domain Decomposition 1

Low Level Low Level Low Level Low Level Detection of Detection of Detection of Detection of

Moving Forward Using Automated Measures for Lameness Detection Nria Chapinal, PhD Animal

Kicking Down the Cross Domain Door Techniques for Cross Domain Exploitation Billy K Rios (BK) and

Chapter 24 Chapter 24 Chapter 24 The Domain Name System The Domain Name System The Domain Name

Hospital Presumptive Eligibility Data requirements Standards and accountability Recordkeeping

Denial-of-Service (DoS) CS 161: Computer Security Prof. Vern Paxson TAs: Paul Bramsen, Apoorva

Announcements You must therefore hard-code target stack

Meaning, bounds, social kinds Dave Ripley University of Connecticut http://davewripley.rocks

Systemic Banks, Mortgage Supply and Housing Rents Pedro Gete and Michael Reher Georgetown &amp;

setroubleshoot A User Friendly Tool to Diagnose &amp; Manage AVC Denials John Dennis - Red Hat

INDE DEX OF F SL SLIDE DES Pre Pre-2017 Mo 2017 Money-Ba Based S System Pos Post-2016

Senate and House Proposals to Expand Medicaid in Kansas Medicaid Expansion Council Meeting

Systemic Banks, Mortgage Supply and Housing Rents Pedro Gete and Michael Reher Georgetown &

setroubleshoot A User Friendly Tool to Diagnose & Manage AVC Denials John Dennis - Red Hat