A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan - PowerPoint PPT Presentation

A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan Christian Collberg

Research problem • Which CFG similarity algorithm is better? • I come up with a new algorithm, how does it compare to the existing ones? • Is there a systematic way to compare CFG similarity algorithms?

Research outcomes • A methodology to evaluate and compare CFG similarity algorithms • Comparison results of four CFG similarity algorithms • A survey of existing CFG similarity algorithms • A publicly available evaluation framework

What is CFG? • CFG stands for c ontrol- f low g raph • A CFG represents all possible execution paths of a function • And thus, it encodes its behavior

Entry a = input() if a % 2 == 0 True False print print “odd” “even” Exit

Why do we compare CFGs?

Why do we compare CFGs? • Malware detection / classification CFGs of malware Match

Why do we compare CFGs? • Software theft detection How similar? Original software Suspected pirated software

Why do we compare CFGs? • Programming assignments grading How similar? Assignment Submission Solution

Why do we compare CFGs? • Code clones detection How similar?

Why do we compare CFGs? • Detection of changes between different versions of a program

Why do we compare CFGs? • Detection of changes between different versions of a program Match the nodes of the enhanced CFGs

This leads to many algorithms to compare CFGs…

Let’s use two existing algorithms to compare these two CFGs 1 1 2 3 2 3 4 4 5 CFG A CFG B

Algorithm 1 from Kruegel et al. • Extract subgraphs that have k nodes (k-subgraphs) from CFGs and match them

1 1 2 3 2 3 4 4 5 CFG A CFG B 1 1 1 1 1 2 2 2 2 2 3 2 3 4 5 4 5 4 No match!

Algorithm 2 from Hu et al. • Approximates the minimum number of edit operations needed to transform one graph into another graph

1 1 2 3 2 3 4 4 5 CFG A CFG B Cost of matching nodes Cost of deleting nodes in CFG A Cost of matching node 1 of CFG A to node 1 of Cost of deleting CFG B node 4 of CFG B Cost of deleting node 1 of CFG B Cost of matching dummy Cost of deleting nodes in CFG B nodes

1 1 2 3 2 3 4 4 5 CFG A CFG B Total cost = 5

And there are many other algorithms… • Algorithm from Vujosˇevic ́ -Janicˇic ́ et al. iteratively builds a similarity matrix between the nodes of the two CFGs, based on the similarity of their neighbor • Algorithm from Sokolsky et al. models the control flow graphs using Labeled Transition Systems (LTS)

But which one is the best?

Evaluation of CFG similarity algorithms • Start by generating CFGs G 1 , G 2 ,...,G i with increasing edit distances with respect to a seed CFG G 0 • i.e. ED(G 0 ,G i ) = i • Use the algorithm under evaluation to rank the CFGs such that the higher is the similarity score between G i and G 0 given by that algorithm, the higher G i is ranked • Get a “goodness score” for the algorithm by comparing the ranking it produces to the ground truth ⟨ G 1 , G 2 , G 3 ,... ⟩ , using ranking correlation algorithms such as sortedness or Pearson correlation

Example G 0

Example G 1 ED = 1 G 0 ED = 2 ED = 3 G 2 G 3

Example G 1 ED = 1 G 0 ED = 2 ED = 3 G 2 G 3 Ranking: ⟨ G 1 , G 2 , G 3 ⟩

Example G 1 G 1 Sim A = 0.4 ED = 1 G 0 G 0 Sim A = 0.8 ED = 2 Sim A = 0.1 G 3 ED = 3 G 2 G 2 G 3 Ranking: ⟨ G 1 , G 2 , G 3 ⟩

Example G 1 G 1 Sim A = 0.4 ED = 1 G 0 G 0 Sim A = 0.8 ED = 2 Sim A = 0.1 G 3 ED = 3 G 2 G 2 G 3 Ranking: ⟨ G 1 , G 2 , G 3 ⟩ Ranking: ⟨ G 3 , G 1 , G 2 ⟩

Example G 1 G 1 Sim A = 0.4 ED = 1 G 0 G 0 Sim A = 0.8 ED = 2 Sim A = 0.1 G 3 ED = 3 G 2 G 2 G 3 Pearson correlation = -0.5 Ranking: ⟨ G 1 , G 2 , G 3 ⟩ Ranking: ⟨ G 3 , G 1 , G 2 ⟩

Two questions remain… 1. What is the definition of the edit distance between two CFGs? 2. How to generate those CFGs such that they have increasing edit distances with the seed CFG G 0 ?

What is the definition of the edit distance between two CFGs? • The Graph Edit Distance is a function ED : (G i , G j ) → N that computes the smallest number of edit operations needed to transform Gi into Gj. • There are four possible edit operations

What is the definition of the edit distance between two CFGs? • Add a zero-degree node 1 1 2 3 2 3 4

What is the definition of the edit distance between two CFGs? • Add an edge between two existing nodes 1 1 2 3 2 3 4 4

What is the definition of the edit distance between two CFGs? • Delete an edge between two existing nodes 1 1 2 3 2 3 4 4

What is the definition of the edit distance between two CFGs? • Delete a zero-degree node 1 1 2 3 2 4 4

How to generate those CFGs such that they have increasing edit distances with the seed CFG G0? a b c d G 0

How to generate those CFGs such that they have increasing edit distances with the seed CFG G0? a b c d Add Delete Node Edge Add Add Edge Edge a a a a e b c b c b c b c d d d d For every possible edit operation that can be applied to G 0 , apply that and generate a new graph

How to generate those CFGs such that they have increasing edit distances with the seed CFG G0? Obtain the Do the same for the Edit Distance Graph (EDG) newly generated graphs

How to generate those CFGs such that they have increasing edit distances with the seed CFG G0? a b c d Add Delete Node Edge Add Add Edge Edge a a a a e b c b c b c b c d d d d Add Add Edge Edge a Randomly pick a CFG on b c each level and they d become our G 1 , G 2 , G 3 ,…

Implementation • Re-coded four CFG similarity algorithms in Python • Implemented the evaluation framework • Generated an EDG with five levels • Picked 100 test cases (each test case comprises five CFGs)

Evaluation results

Evaluation results “Goodness score” statistics of the four algorithms

Evaluation results Time used by the four algorithms to finish 100 test cases

Related work • An evaluation framework for text plagiarism detection • Generate artificial plagiarism cases • Shuffling, removing, inserting, or replacing words or short phrases at random

Related work • An evaluation framework for code clone detection tools • Inject mutated code fragments into the code base

Future work • Generate CFGs with instructions in the nodes Edit instructions => huge EDG

Try our framework • http://cfgsim.cs.arizona.edu/ • Evaluate existing algorithms • Compare your own algorithm with the others • Fine tune your algorithm

Summary • A methodology to evaluate CFG similarity algorithms • Publicly available evaluation framework • Serves as a benchmark for CFG similarity algorithms users / researchers

Thank you!

A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan - PowerPoint PPT Presentation

A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan Christian Collberg Research problem Which CFG similarity algorithm is better? I come up with a new algorithm, how does it compare to the existing ones? Is there a

Equivalence of PDA, CFG Conversion of CFG to PDA Conversion of PDA to CFG 1 Overview When

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate

What CFGs do not capture Last class, we talked about over-generation problem of CFG

Boolean Matrix Multiplication (BMM) and CFG parsing by Franziska Ebert 23.3.2007 Boolean

Pushdown Automata Pushdown Automata p.1/25 Rationale CFG are specification

CSE 105 THEORY OF COMPUTATION Fall 2016 http://cseweb.ucsd.edu/classes/fa16/cse105-abc/ T

Chomsky and Greibach Normal Forms Chomsky and Greibach Normal Forms p.1/24 Simplifying

Parsing beyond context-free grammar: 1. N and T are disjoint alphabets, the nonterminals and

Natural Language Processing Lecture 13: More on CFG Parsing Probabilistjc/Weighted Parsing

Simplification of CFG and Normal Forms Wen-Guey Tzeng Computer Science Department National

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Evaluate the effectiveness of your social media marketing plan - implement - evaluate --- amend

Method Handles Everywhere! Charles Oliver Nutter @headius Method Handles What are method

B Method Proof assistants May 16, 2017 Lucas Franceschino What is B method? B-method goal

Newtons method Newtons method 1 / 8 Newtons method Objective: solving a non-linear

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Natural gas elasticities and optimal cost recovery under heterogeneity Evidence from 300 million

FIND BETTER HIGH RES PHOTO #PCMHEvidence WELCOME & OPENING REMARKS DOUG HENLEY, MD, FAAFP

Ed Turanchik Of Counsel, Government Affairs and Public Policy Florida 3P Law Opens Door to

Mergeable Summaries Graham Cormode graham@research.att.com graham@research.att.com Pankaj

Lecture 20: Motion estimation Most slides from S. Lazebnik, which are based on other slides from

Protecting Immigrant Families Advancing Our Future Campaign Public Charge Finalization Webinar

Current through a very small conductor nano HUB .org online simulations and more 2 /

Point-Voxel CNN for E ffi cient 3D Deep Learning Zhijian Liu* , Haotian Tang* , Yujun Lin , and

A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan - PowerPoint PPT Presentation

A Method to Evaluate CFG Comparison Algorithms Patrick P.F. Chan Christian Collberg Research problem Which CFG similarity algorithm is better? I come up with a new algorithm, how does it compare to the existing ones? Is there a

Equivalence of PDA, CFG Conversion of CFG to PDA Conversion of PDA to CFG 1 Overview When

2020-07-29_SHPWG_Issue1-Themes Address Calibrate, dynamics of Review the Evaluate Evaluate

What CFGs do not capture Last class, we talked about over-generation problem of CFG

Boolean Matrix Multiplication (BMM) and CFG parsing by Franziska Ebert 23.3.2007 Boolean

Pushdown Automata Pushdown Automata p.1/25 Rationale CFG are specification

CSE 105 THEORY OF COMPUTATION Fall 2016 http://cseweb.ucsd.edu/classes/fa16/cse105-abc/ T

Chomsky and Greibach Normal Forms Chomsky and Greibach Normal Forms p.1/24 Simplifying

Parsing beyond context-free grammar: 1. N and T are disjoint alphabets, the nonterminals and

Natural Language Processing Lecture 13: More on CFG Parsing Probabilistjc/Weighted Parsing

Simplification of CFG and Normal Forms Wen-Guey Tzeng Computer Science Department National

The Scientific Method The Scientific Method The Scientific Method involves 6 steps: Problem

Evaluate the effectiveness of your social media marketing plan - implement - evaluate --- amend

Method Handles Everywhere! Charles Oliver Nutter @headius Method Handles What are method

B Method Proof assistants May 16, 2017 Lucas Franceschino What is B method? B-method goal

Newtons method Newtons method 1 / 8 Newtons method Objective: solving a non-linear

Graph Algorithms Chapter 22 1 CPTR 430 Algorithms Graph Algorithms Why Study Graph Algorithms?

Natural gas elasticities and optimal cost recovery under heterogeneity Evidence from 300 million

FIND BETTER HIGH RES PHOTO #PCMHEvidence WELCOME &amp; OPENING REMARKS DOUG HENLEY, MD, FAAFP

Ed Turanchik Of Counsel, Government Affairs and Public Policy Florida 3P Law Opens Door to

Mergeable Summaries Graham Cormode graham@research.att.com graham@research.att.com Pankaj

Lecture 20: Motion estimation Most slides from S. Lazebnik, which are based on other slides from

Protecting Immigrant Families Advancing Our Future Campaign Public Charge Finalization Webinar

Current through a very small conductor nano HUB .org online simulations and more 2 /

Point-Voxel CNN for E ffi cient 3D Deep Learning Zhijian Liu* , Haotian Tang* , Yujun Lin , and

FIND BETTER HIGH RES PHOTO #PCMHEvidence WELCOME & OPENING REMARKS DOUG HENLEY, MD, FAAFP