Extracting a Data Flow Analyser in Constructive Logic David - PowerPoint PPT Presentation

Extracting a Data Flow Analyser in Constructive Logic David Cachera, Thomas Jensen, David Pichardie and Vlad Rusu APPSEM’04, Tallinn

Static program analysis The goals of static program analysis ◮ To prove properties about the run-time behaviour of a program ◮ In a fully automatic way ◮ Without actually executing this program

Static program analysis The goals of static program analysis ◮ To prove properties about the run-time behaviour of a program ◮ In a fully automatic way ◮ Without actually executing this program Solid foundations for designing an analyser ◮ Formalization and correctness proof by abstract interpretation ◮ Resolution of constraints on lattices by iteration and symbolic computation

Formalization approximation domain programming language (lattice) + semantics correctness analysis specification proof

Resolution program to analyze (in)equation generation abstract domains (in)equation system computable resolution informations about the run-time behaviour of the program

So what’s the problem ?

Formalization part � P c . Cousot

Implementation part Formalization part int main(int argc, char **argv) { int i, j, t, parent_a, parent_b; int **swap, **newpop, **oldpop; double *fit, *normfit; get_options(argc, argv, options, help_string); srandom(seed); read_specs(specs); size += (size / 2 * 2 != size); newpop = xmalloc(sizeof(int *) * size); oldpop = xmalloc(sizeof(int *) * size); fit = xmalloc(sizeof(double) * size); normfit = xmalloc(sizeof(double) * size); for(i = 0; i < size; i++) { newpop[i] = xmalloc(sizeof(int) * len); oldpop[i] = xmalloc(sizeof(int) * len); for(j = 0; j < len * 2; j++) random_solution(oldpop[i]); } for(t = 0; t < gens; t++) { compute_fitness(oldpop, fit, normfit); dump_stats(t, oldpop, fit); for(i = 0; i < size; i += 2) { parent_a = select_one(normfit); parent_b = select_one(normfit); reproduce(oldpop, newpop, parent_a, parent_b, i); } swap = newpop; newpop = oldpop; oldpop = swap; } exit(0); } � P c . Cousot

Implementation part Formalization part int main(int argc, char **argv) { int i, j, t, parent_a, parent_b; int **swap, **newpop, **oldpop; double *fit, *normfit; get_options(argc, argv, options, help_string); srandom(seed); read_specs(specs); size += (size / 2 * 2 != size); newpop = xmalloc(sizeof(int *) * size); oldpop = xmalloc(sizeof(int *) * size); Do both parts talk about the same ? fit = xmalloc(sizeof(double) * size); normfit = xmalloc(sizeof(double) * size); for(i = 0; i < size; i++) { newpop[i] = xmalloc(sizeof(int) * len); oldpop[i] = xmalloc(sizeof(int) * len); for(j = 0; j < len * 2; j++) random_solution(oldpop[i]); } for(t = 0; t < gens; t++) { compute_fitness(oldpop, fit, normfit); dump_stats(t, oldpop, fit); for(i = 0; i < size; i += 2) { parent_a = select_one(normfit); parent_b = select_one(normfit); reproduce(oldpop, newpop, parent_a, parent_b, i); } swap = newpop; newpop = oldpop; oldpop = swap; } exit(0); } � P c . Cousot

Static Analysis for real-life languages Example of real-life language : bytecode JavaCard ◮ 180 instructions ◮ Real need of static analysis to verify properties about security, memory management, ... For this kind of languages, ◮ Abstract domains can be complex ◮ Correctness proofs become long and tiresome ◮ Implementation and maintenance of the analyser become a software engineering task

In this work We propose a technique based on the Coq proof assistant ◮ To specify a static analysis, ◮ To prove its correctness wrt. the semantics of the language, ◮ To extract a static analyser from the proof of existence of a correct program analysis result Program-as-proofs paradigm: Write a function f which ⇒ Make a constructive proof verifies a specification P ⇐ of ∀ x , ∃ y , P ( x , y ) ∀ x , P ( x , f ( x ))

Outline ◮ Motivation ◮ A Static Analysis for Carmel ◮ Building a certified static analyser ◮ Conclusion

Case study : a static analysis for Carmel e Rydhof Hansen 1 We follow the analysis proposed by Ren´ ◮ Carmel : an intermediate representation of Java Card byte code ◮ Construction of a certified data flow analyser for Carmel 1 Ren´ e Rydhof Hansen. Flow Logic for Carmel. SECSAFE-IMM-001, 2002

Syntax of Carmel Instruction ::= nop  push c   stack manipulation pop numop op � load x local variables manipulation store x � if pc jump goto pc  new cl  putfield f  heap manipulation getfield f � invokevirtual m id method call and return return

Semantic domains Val ::= num n n ∈ N ref r r ∈ Reference null Val ∗ Stack = LocalVar = Var → Val Frame = PointProg × NameMethod × LocalVar × Stack Frame ∗ CallStack = Object = FieldName → Val Heap = Reference → Object ⊥ State = Heap × CallStack Example : ( H , � m , pc , L , v :: S � :: SF )

Dynamic semantics Operational semantics with rules like instructionAt P ( m , pc ) = push c ( H , � m , pc , L , S � :: SF ) ⇒ ( H , � m , pc + 1 , L , c :: S � :: SF ) instructionAt P ( m , pc ) = invokevirtual m id m ′ = methodLookup ( m id , h ( loc )) f ′ � m ′ , 1 , V , ε � = f ′′ = � m , pc , l , s � ( h , � m , pc , l , loc :: V :: s � :: sf ) ⇒ ( h , f ′ :: f ′′ :: sf )

A Static Analysis for Carmel � � H , ˆ ˆ L , ˆ We want to calculate an approximation S on the domain � � � State = � � × NameMethod × PointProg → Heap LocalVar � � NameMethod × PointProg → � × Stack ◮ An approximation for all reachable heaps ◮ For each program points, an approximation of the operand stack and the local variables ◮ An object is abstracted to its class ◮ Numeric values are abstracted using Killdall’s Constant Propagation domain

Analysis specification � � H , ˆ ˆ L , ˆ Each instruction impose constraints on S . Example 0 : push 1 1 : push 2 2 : store 0 3 : load 0 4 : numop mult 5 : goto 1

Analysis specification � � H , ˆ ˆ L , ˆ Each instruction impose constraints on S . Example nil ⊑ ˆ b ⊤ ⊑ ˆ S ( m , 0 ) L ( m , 0 ) 0 : push 1 1 : push 2 2 : store 0 3 : load 0 4 : numop mult 5 : goto 1

Analysis specification � � H , ˆ ˆ L , ˆ Each instruction impose constraints on S . Example nil ⊑ ˆ b ⊤ ⊑ ˆ S ( m , 0 ) L ( m , 0 ) 0 : push 1 � push (ˆ 1 , ˆ S ( m , 0 )) ⊑ ˆ ˆ L ( m , 0 ) ⊑ ˆ S ( m , 1 ) L ( m , 1 ) 1 : push 2 2 : store 0 3 : load 0 4 : numop mult 5 : goto 1

Analysis specification � � H , ˆ ˆ L , ˆ Each instruction impose constraints on S . Example nil ⊑ ˆ b ⊤ ⊑ ˆ S ( m , 0 ) L ( m , 0 ) 0 : push 1 � push (ˆ 1 , ˆ S ( m , 0 )) ⊑ ˆ ˆ L ( m , 0 ) ⊑ ˆ S ( m , 1 ) L ( m , 1 ) 1 : push 2 � push (ˆ 2 , ˆ S ( m , 1 )) ⊑ ˆ ˆ L ( m , 1 ) ⊑ ˆ S ( m , 2 ) L ( m , 2 ) 2 : store 0 pop (ˆ S ( m , 2 )) ⊑ ˆ L ( m , 2 )[ 0 �→ c ˆ top (ˆ S ( m , 2 ))] ⊑ ˆ d S ( m , 3 ) L ( m , 3 ) 3 : load 0 � push (ˆ L ( m , 3 )[ 0 ] , ˆ S ( m , 3 )) ⊑ ˆ L ( m , 3 ) ⊑ ˆ ˆ S ( m , 4 ) L ( m , 4 ) 4 : numop mult ˆ L ( m , 4 ) ⊑ ˆ L ( m , 5 ) . . . 5 : goto 1 S ( m , 5 ) ⊑ ˆ ˆ ˆ L ( m , 5 ) ⊑ ˆ S ( m , 1 ) L ( m , 1 )

Analysis solution The smallest value which verifies all constraints. Example b nil [ 0 �→ ⊤ ; 1 �→ ⊤ ] 0 : push 1 < ˆ [ 0 �→ ˆ 2 > 1 ; 1 �→ ⊤ ] 1 : push 2 < ˆ 1 :: ˆ [ 0 �→ ˆ 1 ; 1 �→ ⊤ ] 2 > 2 : store 0 < ˆ [ 0 �→ ˆ 2 > 1 ; 1 �→ ⊤ ] 3 : load 0 < ˆ 1 :: ˆ [ 0 �→ ˆ 2 > 1 ; 1 �→ ⊤ ] 4 : numop mult < ˆ [ 0 �→ ˆ 2 > 1 ; 1 �→ ⊤ ] 5 : goto 1 ⊥ ⊥

Outline ◮ Motivation ◮ A Static Analysis for Carmel ◮ Building a certified static analyser ◮ Conclusion

Building a certified static analyser ◮ A puzzle with 8 pieces, ◮ Each piece interacts with its neighbors

Building a certified static analyser semantic domains ◮ Each semantic domain is modeled with a type ◮ Following exactly the definitions already seen in a previous slide

Building a certified static analyser semantic abstract domains domains ◮ Each semantic domain is in relation with an abstract domain ◮ an abstract domain is a lattice (formalization of lattices in Coq to follow...)

Building a certified static analyser semantic correctness abstract domains relations domains ◮ A relation ∼ between State and � State ◮ s ∼ � Σ interprets as “ � Σ is a correct approximation of s ” ◮ ∼ must be monotone : Σ 2 ∈ � ∀ s ∈ State , ∀ � Σ 1 , � State , if s ∼ � Σ 1 and � Σ 1 ⊑ � Σ 2 then s ∼ � Σ 2

Building a certified static analyser semantic correctness abstract domains relations domains semantic rules ◮ The transition relation · ⇒ · is defined using Coq inductive types ◮ Collecting semantics : ] = { s | ∃ s 0 an initial state , with s 0 ⇒ ∗ s } [ [ P ] We want to compute a correct approximation of [ [ P ] ]

Building a certified static analyser semantic correctness abstract domains relations domains analysis semantic specification rules ◮ we define a predicate P ⊢ � Σ which imposes a set of constraints on an abstract state � Σ

Extracting a Data Flow Analyser in Constructive Logic David - PowerPoint PPT Presentation

Extracting a Data Flow Analyser in Constructive Logic David Cachera, Thomas Jensen, David Pichardie and Vlad Rusu APPSEM04, Tallinn Static program analysis The goals of static program analysis To prove properties about the run-time

The I nCell Analyser 1 0 0 0 High throughput, high content screening The InCell Analyser 1000

RIPS RIPS A static source code analyser NDS Seminar for vulnerabilities in PHP scripts A

The Edge Handheld Lactate Analyser Rachel Hilton DipAVN(Medical) RVN The Edge Lactate

1 Methods of Extracting or Obtaining Essential Oils The most common method for extracting

THE VECTOR NETWORK ANALYSER A SERIOUS TOOL FOR THE TECHNICAL RADIO AMATEUR WHAT IS A VECTOR

The African Human Rights Case Law Analyser A Collection of decisions from the African Human Rights

Aims of Session Understand the concept of constructive alignment Identify the benefits

Constructive Mathematics in Constructive Set Theory Nicola Gambino University of Palermo MALOA

Extracting Gait Parameters Extracting Gait Parameters from Raw Data from Raw Data

Markov Logic Markov Logic Probability First-Order Logic Propositional Logic Markov Logic

Logic Synthesis Overview Design flow Principles of logic synthesis Logic

A simple and robust A simple and robust algorithm for extracting algorithm for extracting

Extracting Tables from PDFs Extracting Tables from PDFs Using Camelot and Excalibur to

Program Analysis Program Analysis Extracting information, in order to present Extracting

CKM 2006 CKM 2006 Extracting CKM phase from phase from Extracting CKM B K

Flow Visualization Overview: Flow Visualization (1) Introduction, overview Flow data Simulation

IC Project and Verifica7on Physical Verifica7on Joachim

sr r trt

Analysis of a door locking system Introduction Signals Hardware Software Rmi Audebert

Preallocating Resources for Distributed Memory based FPGA Debug Robert Hale & Brad Hutchings

C Execution Time Analysis Jan Gustafsson, Docent Mlardalen Real-Time Research Center (MRTC)

UDP Performance and PCI-X Activity of the Intel 10 Gigabit Ethernet Adapter on: HP rx2600 Dual

Dynamical Systems: Computability, Verification, Analysis Amaury Pouly Journe des nouveaux

A Ridiculously Brief Overview of Combinational Logic Design These slides provide a brief

Extracting a Data Flow Analyser in Constructive Logic David - PowerPoint PPT Presentation

Extracting a Data Flow Analyser in Constructive Logic David Cachera, Thomas Jensen, David Pichardie and Vlad Rusu APPSEM04, Tallinn Static program analysis The goals of static program analysis To prove properties about the run-time

The I nCell Analyser 1 0 0 0 High throughput, high content screening The InCell Analyser 1000

RIPS RIPS A static source code analyser NDS Seminar for vulnerabilities in PHP scripts A

The Edge Handheld Lactate Analyser Rachel Hilton DipAVN(Medical) RVN The Edge Lactate

1 Methods of Extracting or Obtaining Essential Oils The most common method for extracting

THE VECTOR NETWORK ANALYSER A SERIOUS TOOL FOR THE TECHNICAL RADIO AMATEUR WHAT IS A VECTOR

The African Human Rights Case Law Analyser A Collection of decisions from the African Human Rights

Aims of Session Understand the concept of constructive alignment Identify the benefits

Constructive Mathematics in Constructive Set Theory Nicola Gambino University of Palermo MALOA

Extracting Gait Parameters Extracting Gait Parameters from Raw Data from Raw Data

Markov Logic Markov Logic Probability First-Order Logic Propositional Logic Markov Logic

Logic Synthesis Overview Design flow Principles of logic synthesis Logic

A simple and robust A simple and robust algorithm for extracting algorithm for extracting

Extracting Tables from PDFs Extracting Tables from PDFs Using Camelot and Excalibur to

Program Analysis Program Analysis Extracting information, in order to present Extracting

CKM 2006 CKM 2006 Extracting CKM phase from phase from Extracting CKM B K

Flow Visualization Overview: Flow Visualization (1) Introduction, overview Flow data Simulation

IC Project and Verifica7on Physical Verifica7on Joachim

sr r trt

Analysis of a door locking system Introduction Signals Hardware Software Rmi Audebert

Preallocating Resources for Distributed Memory based FPGA Debug Robert Hale &amp; Brad Hutchings

C Execution Time Analysis Jan Gustafsson, Docent Mlardalen Real-Time Research Center (MRTC)

UDP Performance and PCI-X Activity of the Intel 10 Gigabit Ethernet Adapter on: HP rx2600 Dual

Dynamical Systems: Computability, Verification, Analysis Amaury Pouly Journe des nouveaux

A Ridiculously Brief Overview of Combinational Logic Design These slides provide a brief

Preallocating Resources for Distributed Memory based FPGA Debug Robert Hale & Brad Hutchings