Retrofitting Security in input parsing routines Jayakrishna Menon, - PowerPoint PPT Presentation

Retrofitting Security in input parsing routines Jayakrishna Menon, Christophe Hauser, Yan Shoshitaishvili, Stephen Schwab {jmenon, hauser, schwab}@isi.edu yans@asu.edu

Modern defenses Vulnerabilities Many programs are still ● OS defenses (ASLR, DEP). ● written in unsafe Compiler-level defenses ● languages like C/C++. (e.g., stack canaries). Memory corruption ● Code audit tools. ● vulnerabilities remain prominent.

parsers Directly exposed to user input. ● Many custom implementations in unsafe languages (C/C++). ● Over 170 vulnerabilities reported in various parsing ● mechanisms since 1999. Varying semantics and the abundance of string ● manipulations make their implementation error-prone.

Solution space

Design time post-design security security Parser libraries. ● Code audits. ● Parser generators. ● Refactoring/inserting ● correct parsers. Formal methods. ● No source code? ●

Binary-level approach Source code not always ● available (legacy code, uncooperative editors, untrusted IoT devices). What you see is not what ● you execute: compiler bugs, compiler “backdoors” WYSINWYX e.g., XCodeGhost (linking malicious code into executables).

challenges

Scaling problem Program analysis techniques are difficult to automate in a scalable and precise manner.

Static analysis Symbolic execution Precise. ● Scalable. ● Unscalable. ● Imprecise. ●

Dynamic analysis Precise. ● Low coverage. ●

Source code Binary Types. ● Registers. ● Variable names. ● Memory locations. ● Functions. ● Basic blocks. ● ... ● ... ●

How to scale to real world programs?

template-based approach … to discover vulnerabilities based on templates corresponding to common classes of security bugs. … to retrofit security by patching programs at the binary-level.

Initial approach classes/templates Focuses on overflows in ● buffers allocated Unconstrained input. ● statically on the stack. Under-constrained input ● size. template-based: ● Unchecked termination ● categorize causes of condition. vulnerabilities into ... ● three classes. Combines static analysis ● and symbolic execution.

Unconstrained Improper usage of functions that do not check for sizes such as input. strcpy, sprintf etc.

Example 1: CVE-2003-0390 int opt_atoi( char *s) { char buf[1024]; char *fmt = "String [ %s ] is not valid"; sprintf(buf, fmt, s); }

Under-constrained Improper validation of size field in functions such as memcpy. input size.

Example 2: CVE-2015-3329 void phar_set_inode( phar_entry_info *entry) { char tmp[1024]; memcpy(tmp, entry -> phar -> fname, entry->phar->fname_len); }

Unchecked Performing operations on termination (possibly) incorrectly terminated strings. condition.

2-step Analysis approach Symbolic analysis Static analysis } Identify string Identify } program paths. } CFG manipulation destination functions. buffers (sinks). SE Dangerous Identify user Analyze backward DDG input. data-dependency. Path constraints. (Memory corruption caused by unsafe buffer manipulation)

Analysis results Static Analysis Symbolic Overall execution False positive rate 6.6% 0% 0% * False negative rate 40% 0% * 40% Time 1-260s 1-400s 2-660s

New bugs 2 new bugs found in the binary code of common opensource projects and libraries (in a semi-automatic setting)

Retrofitting security: binary patching

Remember: we focus on stack ● Adding the missing buffers. On the identified program ● checks paths, we constrain the user input such that: user_input_size < stack_buffer_size

When the constraints are Adding the missing violated, we crash the program. checks This is equivalent to e.g., __sprintf_chk()

Static reassembly problems: breaking internal program references. Patching the binary Partial solution: inject trampoline gadgets in padding bytes between functions (up to 15 consecutive NOPs).

Inserting checks int opt_atoi(char *s) int opt_atoi(char *s) if(strlen(s)>1024) sprintf(buf, fmt, s); exit() sprintf(buf, fmt, s);

More templates

New template Memory allocation errors … authentication errors. … misuses of cryptographic APIs. … information leakage.

New bugs 12 new bugs found in the binary code of common opensource programs and libraries (in a fully automated setting).

discussion Lightweight and scalable approach. … but high rate of false negatives. … limited patching capabilities.

Data structure recovery. Stumbling blocks Pointer aliasing.

Future work Improve data dependence tracking. ● Leverage static reassembly techniques. ● More vulnerability templates. ● Apply to large corpus of IoT firmware. ●

Key takeaways - Templates per vulnerability class. - Scalable, two-level approach based on a combination of static analysis + symbolic execution. - High-precision: we can infer semantic-agnostic patches for each class. - New bugs.

Retrofitting Security in input parsing routines Jayakrishna Menon, - PowerPoint PPT Presentation

Retrofitting Security in input parsing routines Jayakrishna Menon, Christophe Hauser, Yan Shoshitaishvili, Stephen Schwab {jmenon, hauser, schwab}@isi.edu yans@asu.edu Modern defenses Vulnerabilities Many programs are still OS defenses

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Bottom-up parsing LR parsing Construct parse tree for input from leaves up LR( k ) parsing

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Mathematics Teaching and Learning Robert Q. Berry, III, PhD NCTM President rberry@nctm.org

Kindergarten Daily Routines (With Formative Assessment Questions Removed) 2015-03-11

1st Grade Daily Routines without formative assessment questions 2015-03-10 www.njctl.org Slide

DNS and Security DNS and Security DNS and Security DNS and Security DNS and Security DNS and

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Advanced Systems Security Retrofitting for Security Trent Jaeger Systems and Internet

Models of Human Parsing Experimental Data 2 Informatics 2A: Lecture 22 Eye-tracking Reading

Outline LR Parsing Review of bottom-up parsing LALR Parser Generators Computing the

Allocation for Social Good Auditing Mechanisms for Utility Maximization Taylor Lundy 1 , Alexander

David Rook Agnitio Security code review swiss army knife Hack in Paris, Paris Friday, 17 June

Development of Web Applications Principles and Practice Vincent Simonet, 2013-2014 Universit

Internet Systems Programming NFS: Protocols, Programming, and Implementation Erez Zadok

The 10 Commandments of Release Engineering Dinah McNutt Google, Inc. Overview This talk is

Dyninst: A Binary Analysis and Modification Framework Jeffrey K. Hollingsworth Ray Chen

Outline t t Why static analysis? t t What is it? Underlying technology t Some

CSE306 Software Quality in Practice Dr. Carl Alphonce alphonce@buffalo.edu 343 Davis Hall