Domain-Specific Languages for Program Analysis Mark Hills OOPSLE - PowerPoint PPT Presentation

Domain-Specific Languages for Program Analysis Mark Hills OOPSLE 2015: Open and Original Problems in Software Language Engineering March 6, 2014 Montreal, Canada http://www.rascal-mpl.org 1

Overview • A Starting Example: DCFlow • Other Early-Stage Ideas • Summary extraction from documentation • Trace processing • Discussion 2

Say you need a control flow graph… entry entry 3 x true false 3 x := 3 x := 3 10 15 x false true y := 10 y := 15 15 10 y := 15 y := 10 exit exit 3

Building control flow graph extractors • First, define how to represent control flow graphs • Then, pick a language — hopefully we can reuse the first part for di ff erent languages, but maybe not… • Next, define the control flow rules, using your favorite language (such as Rascal, of course…) • Finally, define something that uses the graph — this makes sure the data structure is rich enough to be useful as well… 4

What if we want to work with another language? • May be able to reuse base CFG definition (but maybe not) • Cannot reuse flow definition (unless CFG def is the same and features have identical semantics — the flow rules are specific to the features being defined) • Cannot easily reuse analysis (since CFG definition and semantics di ff er) 5

  What if we want to work with another language? • May be able to reuse base CFG definition (but maybe not) • Cannot reuse flow definition (unless CFG def is the same and features have identical semantics — the flow rules are specific to the features being defined) • Cannot easily reuse analysis (since CFG definition and semantics di ff er)   So, we write the entire thing over again   (and again, and again…) 6

DCFlow: Declarative Control Flow • Declarative DSL for defining control flow rules • Generates Rascal code to build intraprocedural control flow graphs with reusable library of CFG concepts • Provides basic visualization to allow graphs to be rendered in GraphViz dot • Provides ignore mechanism to indicate which language constructs we are not trying to define • IDE provides basic checking to aid user (with more coming) 7

DCFlow Architecture DCFlow CFG Builder DCFlow Language-Specific Translator Modules Definition Functions (Rascal) (Rascal) (Rascal) Source Program DCFlow Libraries CFG Construction (Input Language) (Rascal) (Rascal) GraphViz CFG Visualization Control Flow Visualizations (Rascal) Graphs (Rascal) (GraphViz,dot) 8

    Building up an example: plus • What should plus do?   binaryOperation(Expr left, Expr right, plus()) 9

          Building up an example: plus • What should plus do?   binaryOperation(Expr left, Expr right, plus()) • Run left, then run right, then add them together   rule EXP::add = left --> right --> self; 10

        Building up an example: plus • What should plus do?   binaryOperation(Expr left, Expr right, plus()) • Run left, then run right, then add them together   rule EXP::add = left --> right --> self; • That’s it!   11

            Something more complex: while loops • What should while do?   \while(Expr cond, list[Stmt] body) 12

        Something more complex: while loops • What should while do?   \while(Expr cond, list[Stmt] body) • The exp is the first and last thing we should do • A footer is useful as a target for break and continue • We need a back-edge, and it would be nice to label others   13

        Something more complex: while loops • What should while do?   \while(Expr cond, list[Stmt] body) • The exp is the first and last thing we should do • A footer is useful as a target for break and continue • We need a back-edge, and it would be nice to label others   rule STATEMENT::whileStat = create(footer), ^exp -conditionTrue-> body -backedge-> exp, exp -conditionFalse-> $footer; 14

Design Decisions • Focus on abstract syntax trees (should   almost work on Rascal concrete syntax,   but there are some di ff erences) • Leverage reified types for generation and checking • Try to ensure added features are general — don’t want to add something just because PHP or Java needs it • Make sure generated code is understandable — it should look close to what you would write yourself 15

How about for other domains? • Idea 1: Program tracing • Internal DSL — goal is to build this as a library in Rascal • Allow filter functions to keep or discard events of interest • Use closures to support registration of handlers for specific events or event patterns • What we have now: rudimentary tracing for PHP programs using Rascal and xdebug (running over TCP sockets) 16

How about for other domains? • Idea 2: Summary extraction • Libraries make it harder to analyze code, we may not know what these libraries actually do • Extract function/procedure/method summaries from existing documentation — basic info such as signatures, types, maybe ability to attach more advanced info • No work on this yet, still deciding what makes sense — currently works for PHP by extracting very generic HTML representation and using Rascal to match over it 17

Related work • “Extensible intraprocedural flow analysis at the abstract syntax tree level”, Söderberg, Ekman, Hedin, Magnusson • Uses attribute grammars to represent control flow • Reference attributes represent edges • Collection attributes represent inverse relations (e.g., pred) • Higher-order attributes allow building new AST nodes (e.g., entry and exit)

Related work • Spoofax: NaBL, language for incremental type checking • DHAL and variants for data flow analysis • Related conceptually — use domain-specific languages for specific analysis-related tasks • Direct language support: Rascal, TXL, Spoofax, ASF+SDF , etc

Discussion 20

Discussion: Some possible topics… • What opportunities are there for creating DSLs for program analysis? Which parts of the process would be best for this? • Which is best: internal or external? What circumstances drive this? • Is this even a good idea? Why not just use Rascal (or something else, if you must…) 21

Which design decisions are important? • Focus on abstract syntax trees (should   almost work on Rascal concrete syntax,   but there are some di ff erences) • Leverage reified types for generation and checking • Try to ensure added features are general — don’t want to add something just because PHP or Java needs it • Make sure generated code is understandable — it should look close to what you would write yourself 22

Domain-Specific Languages for Program Analysis Mark Hills OOPSLE - PowerPoint PPT Presentation

Domain-Specific Languages for Program Analysis Mark Hills OOPSLE 2015: Open and Original Problems in Software Language Engineering March 6, 2014 Montreal, Canada http://www.rascal-mpl.org 1 Overview A Starting Example: DCFlow Other

Domain Specific Languages Domain Specific Languages in Erlang Dennis Byrne

hendren@cs.mcgill.ca COMP 520 Winter 2016 Domain-Specific Languages - OncoTime (2) Designing

Language engineering and Domain Specific Languages Perdita Stevens School of Informatics

Customizable Domain- Customizable Domain -Specific Computing Specific Computing Jason Cong

Domain-Specific Engineering of Domain-Specific Languages Rapha el Mannadiar and ,

Visual Domain Specific Languages for Actuarial Models: An Industrial Experience Report Workshop

DSL Engineering with Sven Efftinge - itemis.com DOMAIN-SPECIFIC LANGUAGE A Domain Specific

Using Domain Specific Languages for Software Process Modeling Daro Correal Rubby Casallas

Towards Model-Based Testing of Domain-Specific Modelling Languages J. Merilinna, Olli-Pekka

Demanding First-Class Equality for Domain Specific Aspect Languages Arik Hadas Dept. of

Toward Disposable Domain- Specific Aspect Languages Arik Hadas Dept. of Mathematics and Computer

(Domain-Specific) Modelling Language Engineering Hans Vangheluwe 5 September 2010, Lisboa,

Organization of DSLE part Tooling Domain Specific Language Domain Specific Language

Adding domain-specific constructs to Event B Adding domain-specific constructs to Event B for

Domain-specific front-end for virtual Domain-specific front-end for virtual system modeling

TDDE45 - Lecture 5: Domain-Specifjc Languages Martin Sjlund Department of Computer and

Karnatik Music- Svara, Gamaka, Phraseology, Raga Identity T. M. Krishna 12th July 2012

EDA222/DIT161 Real-Time Systems, Chalmers/GU, 2010/2011

Shape Dynamics of Point Vortices Tomoki Ohsawa April Fools Day, 2019 Tomoki Ohsawa

1. CDH and DDH One of the most important goals in Cryptography is to identify the exact complexity

Documents Transliterated Queries Transliterated Documents Native script Queries 5 teams, 25

Opera Productions Old and New 5. Grandest of the Grand Grand Opera Essentially a French

Marc Lachize-Rey Grenoble 2008 Outline I Historical elements II Elements of

From Prevalence to Vulnerability Implications of Climate Change on Health Policy in India Nitish