Symbolic Verification of Programs with Pointers using Tree Automata - PowerPoint PPT Presentation

Symbolic Verification of Programs with Pointers using Tree Automata ı ˇ Jiˇ r´ Sim´ aˇ cek Universit´ e Joseph Fourrier (France) Brno University of Technology (Czech Republic) 1

Ph.D. • 1st year of doctoral degree programme at Brno University of Technology – supervised by Tom´ aˇ s Vojnar • joint supervision under the cotutelle agreement with Universit´ e Joseph Fourrier – supervised by Yassine Lakhnech – co-supervised by Radu Iosif • the topic of the research: Advanced Symbolic Verification Methods Using Finite-State Automata and Related Formalisms 2

General Program Structure • a computer program can combine various constructions such as: – arithmetic, – array manipulation, – pointer manipulation, – recursion, – parallel execution, etc. • verification of each of the above requires different approaches (which can be combined in the ideal case) • we focus on programs with pointers – bugs in pointer manipulation can be very tricky when using low level programming languages (C/C++) – yet the pointers allow construction of useful data structures (list, trees, etc.) 3

Programs with Pointers • we restrict to the following statements ( x , y are pointer variables, next(i) denotes i-th selector): – new(x) (heap allocation) – x := null (nil assignement) – x := y (simple assignement) – x := y.next(i) (assignement with dereference of source) – x.next(i) := y (assignement with dereference of destination) – if/while (x = y) (conditional branching) – delete(x) (heap deallocation – optional) • no C-style pointer arithmetic ( p++ , *(p+3) ) 4

Programs with Pointers – Verification • safety – a pointer variable has to point to some memory cell when dereferenced, i.e. it has to be assigned a valid address before – a memory cell released by calling delete is never used in the future (and also never released again) – user specified assertions • termination (liveness) – a program terminates for any input 5

Related Work • 3-valued predicate logic with transitive closure – [Sagiv, Reps, Wilhelm ’96] • separation logic – [Reynolds ’02] • regular model checking – [Kesten, Maler, Marcus, Pnueli, Shahar ’97] • many other approaches exist 6

3-valued Predicate Logic with Transitive Closure • at a given program point, a single pointer variable can point to a (possibly infinite) set of structures (in all possible executions of a program) • the aim of the analysis is to create a finite representation of the heap • it does so by using shape graphs , which consist of an abstract state , an abstract heap , and a sharing information for abstract locations 7

Separation Logic • the heap often consists of indipendent parts which are not interconnected or which are interconnected in a bounded way • separation logic extends Hoare logic in order to reason about different parts of the heap locally – heap configurations are represented by formulae in separation logic (data structures are described using recursive predicates) – an execution of the program statements is replaced by a Hoare-style reasoning and a generating of invariants 8

Seperation Logic – Example • list segment predicate: ⇒ E � = F ∧ ( E �→ F ∨ ( ∃ x ′ .E �→ x ′ ∗ ls ( x ′ , F ))) ls ( E, F ) ⇐ • list reversal ( u points to a singly-linked list at the beginning): 1: while (u � = null) do { ls ( u, ⊥ ) } 2: w := u.next; 3: u.next := v; 4: v := u; u := w; 5: od { ls ( u, ⊥ ) ∗ ls ( v, ⊥ ) } (inv.) 6: { ls ( v, ⊥ ) } • things to verify: – no null pointer dereference occurs, – the program eventually terminates, – v contains the reversal of u at the end 9

Regular Model Checking • heap configurations are represented by finite automata (over words or trees) • program statements are interpreted over these automata (usually using transducers) • it is possible to use CEGAR approach • some modifications (ARTMC) allow verification of more complex structures than trees by using tree automata only – [Bouajjani, Habermehl, Rogalewicz, Vojnar ’06] • it is possible to verify: – operations on doubly linked lists, – operations on different kind of trees, – Deutsch-Schorr-Waite algorithm, etc. 10

A New Method of Verification based on Tree Automata • why? – separation logic: often requires the specification of recursive predicates (e.g. for a singly-linked list) and invariant generation rules over these predicates; only a limited ability to handle something more complex than lists – regular model checking: the invariant generation is automated, but the heap is represented by a single automaton; doesn’t scale well on very complex structures • we want to combine advantages of both methods • we want to handle more general structures than lists or trees • we want to avoid using transducers for symbolic execution of statements (overhead) 11

Heap Representation • the heap can be viewed as a directed graph, where nodes represent memory cells and edges represent the selectors • an example ( ⊥ denotes null value, x , y are pointer variables, memory cells contain selectors 1, 2) y x 2 2 1 1 1 ⊥ 2 1 2 1 2 2 ⊥ 1 2 ⊥ 2 1 1 ⊥ ⊥ ⊥ 1 2 1 2 ⊥ ⊥ ⊥ ⊥ 12

Tree-based Heap Decomposition and Cut-points • the heap is a general directed graph, but we have tree automata only – graph automata exist, but operations are too hard • the heap can be decomposed into trees by using cut-points , which are nodes pointed to by a variable or nodes that contain more than one incoming edge (are pointed to by more than one selector) • example ( x , y point to c 1 and c 2 respectively): c 1 c 4 2 1 2 1 1 c 3 ⊥ c 2 2 1 2 1 2 2 ⊥ 1 2 ⊥ 2 1 1 1 ⊥ ⊥ ⊥ 2 1 2 ⊥ ⊥ ⊥ ⊥ 13

Representing Memory Configurations by Tree Automata • an accepting run (bottom-up) of the automaton describes a part of one heap configuration (memory cells and content of their selectors); the complete configuration is obtained by combining runs of several such automata • each cut-point can appear at most once (as an accepting state) in a run (it represents only a single memory cell) • the automaton contains leaf rules for ⊥ and for each cut-point • an example (a singly-linked list): c ′ 1( q 1 ) → 1 1 1 1 x . . . 1( q 1 ) → q 1 1( c 1 ) → q 1 1 (leaf rule: a → c 1 ) 14

Introducing hierarchy • what about a doubly-linked list? 1 1 1 2 1 ⊥ ⊥ x . . . 2 2 2 • we get an unbounded number of cut-points in the tree decomposition! c i c 1 c k . . . 1 2 . . . 1 2 1 2 ⊥ ⊥ c 2 c i +1 c i − 1 c k − 1 15

Introducing Hierarchy • try to hide some of the cut-points in the hierarchically structured automata • in the case of doubly-linked lists, create a box consisting of 2 automata – DLL ( out : c 1 , in : c 2 ) : 1 c ′ A 1 : 1( c 2 ) → 1 c 2 c 1 A 2 : 2( c 1 ) → c ′ 2 2 • use this box as a symbol on a higher level: c ′ � DLL, 2 � ( q 1 , ⊥ ) → 1 DLL ( q 1 ) → q 1 1( ⊥ ) → q 1 16

Introducing Hierarchy – Example • consider the doubly-linked list: 1 1 1 1 2 ⊥ x ⊥ 2 2 2 • the run of the corresponding automaton looks as follows (without leaf rules): 1 DLL DLL DLL ⊥ − → − → − → − → c ′ q 1 q 1 q 1 1 2 ⊥ − → 17

Main Challenges • language inclusion ( ⊆ ) – we don’t know how to complement hierarchical tree automata but we know how to test inclusion on tree automata without complementing [Bouajjani, Habermehl, Holik, Touili, Vojnar ’08] – we don’t know how to do the inclusion in general (yet) – there are some safe approximations though (top-level inclusion checking) • the other automata operations ( ∪ , ∩ ) • invariant generation 18

Low Level Symbolic Representation • automata tend to grow too much to fit in a memory • there are ways how to store them efficiently using symbolic representation – BDDs, – sparse matrices, . . . • already used in ARTMC (MONA) • current implementations usually targets deterministic automata only 19

Future Directions • an ability to handle dynamic structures containing data • an automated learning of the hierarchy • function calls – heap summaries – the recursion • multi-threaded programs – an ability to lock each node separately • a tool that scales 20

Thank You 21

Symbolic Verification of Programs with Pointers using Tree Automata - PowerPoint PPT Presentation

Symbolic Verification of Programs with Pointers using Tree Automata Ji r Sim a cek Universit e Joseph Fourrier (France) Brno University of Technology (Czech Republic) 1 Ph.D. 1st year of doctoral degree programme at

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Pointers to Functions C doesn t require that pointers point only to data; it s also

CS162 - POINTERS Lecture: Pointers and Dynamic Memory What are pointers Why dynamically

Topic 7 1. Defining and using pointers 2. Arrays and pointers 3. C and C++ strings 4.

Symbolic verification of cryptographic protocols using Tamarin Part 2 : Symbolic Verification

Fundamentals of Programming Lecture 15 Hamed Rasifard 1 Outline Types of Pointers

Pointers and Dynamic Memory Allocation Pointers 2 Pointers Pointer-type variables allow

Pointers II 1 Outline Pointers arithmetic and others Functions & pointers 2 Pointer

Pointers & Dynamic Memory Review C Pointers Introduce C++ Pointers Data Abstractions

Formal Verification Methods 2: Symbolic Simulation John Harrison Intel Corporation

CS 241: Systems Programming Lecture 25. Function Pointers Spring 2020 Prof. Stephen Checkoway 1

Class Five You havent run screaming yet... Lets do pointers! pointers are one of the

C Programming for Engineers Pointers ICEN 360 Spring 2017 Prof. Dola Saha 1 Pointers

Programming for Engineers Pointers ICEN 200 Spring 2018 Prof. Dola Saha 1 Pointers

Pointers II 1 Outline Pointers arithme.c and others

Pointers and dynamic objects Topics Pointers Memory addresses Declaration

Multiple tree automata a new model of tree automata Gwendal Collet (TU Wien), Julien David (LIPN)

Results of the Golden 1960s Wolfgang Thomas Francqui Lecture, Mons, April 2013 Golden Times

Finite Automata on Unranked Trees: Extensions by Arithmetical and Equality Constraints Karianto

Graph Automata Jan Leike July 2nd, 2012 Motivation We want an automata model that Motivation

Automata-Based Analysis of Recursive Concurrent Programs Markus Mller-Olm Westflische

Polishness of some topologies related to automata Olivier Finkel Joint work with Olivier Carton

Automata-based analysis of recursive cryptographic protocols Thomas Wilke Joint work with Ralf

What do you do if a computational object fails a specification? Target ... We have

Symbolic Verification of Programs with Pointers using Tree Automata - PowerPoint PPT Presentation

Symbolic Verification of Programs with Pointers using Tree Automata Ji r Sim a cek Universit e Joseph Fourrier (France) Brno University of Technology (Czech Republic) 1 Ph.D. 1st year of doctoral degree programme at

Decidability Decidability and Symbolic Symbolic Verification Symbolic Symbolic Verification

Pointers to Functions C doesn t require that pointers point only to data; it s also

CS162 - POINTERS Lecture: Pointers and Dynamic Memory What are pointers Why dynamically

Topic 7 1. Defining and using pointers 2. Arrays and pointers 3. C and C++ strings 4.

Symbolic verification of cryptographic protocols using Tamarin Part 2 : Symbolic Verification

Fundamentals of Programming Lecture 15 Hamed Rasifard 1 Outline Types of Pointers

Pointers and Dynamic Memory Allocation Pointers 2 Pointers Pointer-type variables allow

Pointers II 1 Outline Pointers arithmetic and others Functions &amp; pointers 2 Pointer

Pointers &amp; Dynamic Memory Review C Pointers Introduce C++ Pointers Data Abstractions

Formal Verification Methods 2: Symbolic Simulation John Harrison Intel Corporation

CS 241: Systems Programming Lecture 25. Function Pointers Spring 2020 Prof. Stephen Checkoway 1

Class Five You havent run screaming yet... Lets do pointers! pointers are one of the

C Programming for Engineers Pointers ICEN 360 Spring 2017 Prof. Dola Saha 1 Pointers

Programming for Engineers Pointers ICEN 200 Spring 2018 Prof. Dola Saha 1 Pointers

Pointers II 1 Outline Pointers arithme.c and others

Pointers and dynamic objects Topics Pointers Memory addresses Declaration

Multiple tree automata a new model of tree automata Gwendal Collet (TU Wien), Julien David (LIPN)

Results of the Golden 1960s Wolfgang Thomas Francqui Lecture, Mons, April 2013 Golden Times

Finite Automata on Unranked Trees: Extensions by Arithmetical and Equality Constraints Karianto

Graph Automata Jan Leike July 2nd, 2012 Motivation We want an automata model that Motivation

Automata-Based Analysis of Recursive Concurrent Programs Markus Mller-Olm Westflische

Polishness of some topologies related to automata Olivier Finkel Joint work with Olivier Carton

Automata-based analysis of recursive cryptographic protocols Thomas Wilke Joint work with Ralf

What do you do if a computational object fails a specification? Target ... We have

Pointers II 1 Outline Pointers arithmetic and others Functions & pointers 2 Pointer

Pointers & Dynamic Memory Review C Pointers Introduce C++ Pointers Data Abstractions