Compiler construction Martin Steffen March 13, 2017 Contents 1 - PDF document

Compiler construction Martin Steffen March 13, 2017 Contents 1 Abstract 1 1.1 Semantic analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1.1 Intro . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1.2 Attribute grammars . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.1.3 Rest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 2 Reference 23 1 Abstract Abstract This is the handout version of the slides. It contains basically the same content, only in a way which allows more compact printing. Sometimes, the overlays, which make sense in a presentation, are not fully rendered here. Besides the material of the slides, the handout versions may also contain additional remarks and background information which may or may not be helpful in getting the bigger picture. 1.1 Semantic analysis 1. 3. 2017 1.1.1 Intro Overview over the chapter 1 • semantic analysis in general • attribute grammars (AGs) • symbol tables (not today) • data types and type checking (not today) Where are we now? 1 The slides are originally from Birger Møller-Pedersen. 1

What do we get from the parser? • output of the parser: (abstract) syntax tree • often: in anticipation: nodes in the tree contain “space” to be filled out by SA • examples: – for expression nodes: types – for identifier/name nodes: reference or pointer to the declaration assign-expr subscript expr additive expr identifier identifier number number a index 2 4 assign-expr : ? subscript-expr additive-expr :int :int identifier :array of int identifier :int number :int number :int a :array of int index :int 4 :int 2 :int General remarks on semantic (or static) analysis 1. Rule of thumb Check everything which is possible before executing (run-time vs. compile-time), but cannot already done during lexing/parsing (syntactical vs. semantical analysis) 2. Rest: • Goal: fill out “semantic” info (typically in the AST) 2

• typically: – are all names declared ? (somewhere/uniquely/before use) – typing : ∗ is the declared type consistent with use ∗ types of (sub)-expression consistent with used operations • border between sematical vs. syntactic checking not always 100% clear – if a then ... : checked for syntax – if a + b then ... : semantical aspects as well? SA is nessessarily approximative • note: not all can (precisely) be checked at compile-time 2 – division by zero? – “array out of bounds” – “null pointer deref” (like r.a , if r is null) • but note also: exact type cannot be determined statically either 1. if x then 1 else "abc" • statically: ill-typed 3 • dynamically (“run-time type”): string or int , or run-time type error, if x turns out not to be a boolean, or if it’s null SA remains tricky 1. A dream 2. However • no standard description language • no standard “theory” (apart from the too general “context sensitive languages”) – part of SA may seem ad-hoc, more “art” than “engineering”, complex • but : well-established/well-founded (and decidedly non-ad-hoc) fields do exist – type systems , type checking – data-flow analysis . . . . • in general – semantic “rules” must be individually specified and implemented per language – rules: defined based on trees (for AST): often straightforward to implement – clean language design includes clean semantic rules 2 For fundamental reasons (cf. also Rice’s theorem). Note that approximative checking is doable, resp. that’s what the SA is doing anyhow. 3 Unless some fancy behind-the-scence type conversions are done by the language (the compiler). Perhaps print(if x then 1 else "abc") is accepted, and the integer 1 is implicitly converted to "1" . 3

1.1.2 Attribute grammars Attributes 1. Attribute • a “property” or characteristic feature of something • here: of language “constructs”. More specific in this chapter: • of syntactic elements, i.e., for non-terminal and terminal nodes in syntax trees 2. Static vs. dynamic • distinction between static and dynamic attributes • association attribute ↔ element: binding • static attributes: possible to determine at/determined at compile time • dynamic attributes: the others . . . Examples in our context • data type of a variable : static/dynamic • value of an expression: dynamic (but seldomly static as well) • location of a variable in memory: typically dynamic (but in old FORTRAN: static) • object-code : static (but also: dynamic loading possible) Attribute grammar in a nutshell • AG: general formalism to bind “attributes to trees” (where trees are given by a CFG) 4 • two potential ways to calculate “properties” of nodes in a tree: 1. “Synthesize” properties define/calculate prop’s bottom-up 2. “Inherit” properties define/calculate prop’s top-down 3. Rest • allows both at the same time 4. Attribute grammar CFG + attributes one grammar symbols + rules specifing for each production, how to determine attributes 5. Rest • evaluation of attributes: requires some thought, more complex if mixing bottom-up + top- down dependencies Example: evaluation of numerical expressions 1. Expression grammar (similar as seen before) exp → exp + term ∣ exp − term ∣ term term → term ∗ factor ∣ factor factor → ( exp ) ∣ number 2. Rest • goal now: evaluate a given expression, i.e., the syntax tree of an expression, resp: 3. more concrete goal Specify, in terms of the grammar, how expressions are evaluated 4. Ignore • grammar: describes the “format” or “shape” of (syntax) trees • syntax-directedness • value of (sub-)expressions: attribute here 5 4 Attributes in AG’s: static , obviously. 5 Stated earlier: values of syntactic entities are generally dynamic attributes and cannot therefore be treated by an AG. In this simplistic AG example, it’s statically doable (because no variables, no state-change etc.). 4

Expression evaluation: how to do if on one’s own? • simple problem, easy solvable without having heard of AGs • given an expression, in the form of a syntax tree • evaluation: – simple bottom-up calculation of values – the value of a compound expression (parent node) determined by the value of its subnodes – realizable, for example by a simple recursive procedure 6 1. Connection to AG’s • AGs: basically a formalism to specify things like that • however : general AGs will allow more complex calculations: – not just bottom up calculations like here but also – top-down , including both at the same time 7 Pseudo code for evaluation eval_exp ( e ) = case : : e equals PLUSnode − > return eval_exp ( e . l e f t ) + eval_term ( e . r i g h t ) : : e equals MINUSnode − > return eval_exp ( e . l e f t ) − eval_term ( e . r i g h t ) . . . end case productions/grammar rules semantic rules → exp 2 + term exp 1 . val = exp 2 . val + term . val 1 exp 1 → exp 2 − term exp 1 . val = exp 2 . val − term . val 2 exp 1 exp → term exp . val = term . val 3 term 1 → term 2 ∗ factor term 1 . val = term 2 . val ∗ factor . val 4 term → factor term . val = factor . val 5 factor → ( exp ) factor . val = exp . val 6 factor → number factor . val = number . val 7 AG for expression evaluation • specific for this example – only one attribute (for all nodes), in general: different ones possible – (related to that): only one semantic rule per production – as mentioned: rules here define values of attributes “bottom-up” only • note: subscripts on the symbols for disambiguation (where needed) Attributed parse tree 6 Resp. a number of mutually recursive procedures, one for factors, one for terms, etc. See the xnext slide 7 Top-down calculations will not be needed for the simple expression evaluation example. 5

First observations concerning the example AG • attributes – defined per grammar symbol (mainly non-terminals), but – they get their values “per node” – notation exp . val – to be precise: val is an attribute of non-terminal exp (among others), val in an expression-node in the tree is an instance of that attribute – instance not the same as the value ! Semantic rules • aka: attribution rule • fix for each symbol X : set of attributes 8 • attribute: intended as “fields” in the nodes of syntax trees • notation: X. a : attribute a of symbol X • but: attribute obtain values not per symbol, but per node in a tree (per instance) 1. Semantic rule for production X 0 → X 1 ...X n X i . a j = f ij ( X 0 . a 1 ,...,X 0 . a k 0 ,X 1 . a 1 ,...X 1 . a k 1 ,...,X n . a 1 ,...,X n . a k n ) (1) 2. Rest • X i on the left-hand side: not necessarily head symbol X 0 of the production • evaluation example: more restricted (to make the example simple) 8 Different symbols may share same attribute with the same name. Those may have different types but the type of an attribute per symbol is uniform. Cf. fields in classes (and objects). 6

Compiler construction Martin Steffen March 13, 2017 Contents 1 - PDF document

Compiler construction Martin Steffen March 13, 2017 Contents 1 Abstract 1 1.1 Semantic analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.1.1 Intro . . . . . . . . . . . . . . . . . . . .

Compiler Construction Chapter 11 1 Compiler Construction Compiler Construction A New Compiler

Compiler Construction Compiler Construction 1 / 111 Mayer Goldberg \ Ben-Gurion University

Compiler Construction November 21, 2018 Compiler Construction November 21, 2018 1 / 102 Mayer

Compiler Construction Compiler Construction 1 / 54 Mayer Goldberg \ Ben-Gurion University Tuesday

Compiler Construction Compiler Construction 1 / 193 Mayer Goldberg \ Ben-Gurion University Friday

Compiler Construction October 20, 2018 Compiler Construction October 20, 2018 1 / 115 Mayer

Compiler Construction Compiler Construction 1 / 177 Mayer Goldberg \ Ben-Gurion University

Compiler Construction Compiler Construction 1 / 87 Mayer Goldberg \ Ben-Gurion University

Compiler Construction Compiler Construction 1 / 88 Mayer Goldberg \ Ben-Gurion University Tuesday

Compiler Construction Compiler Construction 1 / 104 Mayer Goldberg \ Ben-Gurion University Friday

Compiler Construction Compiler Construction 1 / 104 Mayer Goldberg \ Ben-Gurion University Monday

Compiler Construction October 31, 2018 Compiler Construction October 31, 2018 1 / 175 Mayer

Compiler Construction Compiler Construction 1 / 114 Mayer Goldberg \ Ben-Gurion University

Compiler Construction Compiler Construction 1 / 112 Mayer Goldberg \ Ben-Gurion University

Compiler Construction Christian Rinderknecht 31 October 2008 1 Why study compiler construction?

Compiler Construction Lecture 19: Code Generation V (Compiler Backend) Winter Semester 2018/19

Lecture Slides for MAT-60556 PART III: First order logic: Formulas, Models, Tableaux Henri

Decision Procedures for Satisfiability and Validity in Propositional Logic Meghdad Ghari

Model Checking Tutorial 4 1. Let the set of atomic propositions be { a, b, c } . (a) Rewrite the

Lecture 9: More on predicate logic Prof. Julia Hockenmaier juliahmr@illinois.edu

Semantic Analysis with Attribute Grammars Part 1 Y.N. Srikant Department of Computer Science and

SEMANTIC ANALYSIS PRINCIPLES OF PROGRAMMING LANGUAGES Norbert Zeh Winter 2018 Dalhousie

Compiler Construction Lecture 13: Semantic Analysis II (Circularity Check) Thomas Noll Lehrstuhl

Syntax-Directed Translation ASU Textbook Chapter 5 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw