Code Generation Wilhelm/Maurer: Compiler Design, Chapter 12 - PowerPoint PPT Presentation

Code Generation Code Generation – Wilhelm/Maurer: Compiler Design, Chapter 12 – Reinhard Wilhelm Universität des Saarlandes wilhelm@cs.uni-sb.de and Mooly Sagiv Tel Aviv University 11. Januar 2010

Code Generation “Standard” Structure source(text) ❄ lexical analysis(7) finite automata ❄ tokenized-program ❄ syntax analysis(8) pushdown automata ❄ syntax-tree ❄ semantic-analysis(9) attribute grammar evaluators ❄ decorated syntax-tree ❄ optimizations(10) abstract interpretation + transformations ❄ intermediate rep. ❄ code-generation(11, 12) tree automata + dynamic programming + · · · ❄ machine-program

Code Generation Code Generation Real machines (instead of abstract machines): ◮ Register machines, ◮ Limited resources (registers, memory), ◮ Fixed word size, ◮ Memory hierarchy, ◮ Intraprocessor parallelism.

Code Generation Architectural Classes: CISC vs. RISC CISC IBM 360, PDP11, VAX series, INTEL 80x86, Pentium, Motorola 680x0 ◮ A large number of addressing modes ◮ Computations on stores ◮ Few registers ◮ Different instruction lengths ◮ Different execution times for instructions ◮ Microprogrammed instruction sets RISC Alpha, MIPS, PowerPC, SPARC ◮ One instruction per cycle (with pipeline for load/stores) ◮ Load/Store architecture – Computations in registers (only) ◮ Many registers ◮ Few addressing modes ◮ Uniform lengths ◮ Hard-coded instruction sets ◮ Intra-processor parallelism: Pipeline, multiple units, Very Long Instruction Words (VLIW), Superscalarity, Speculation

Code Generation Phases in code generation Code Selection: selecting semantically equivalent sequences of machine instructions for programs, Register Allocation: exploiting the registers for storing values of variables and temporaries, Code Scheduling: reordering instruction sequences to exploit intraprocessor parallelism. Optimal register allocation and instruction scheduling are NP-hard.

Code Generation Phase Ordering Problem Partly contradictory optimization goals: Register allocation: minimize number of registers used = ⇒ reuse registers, Code Scheduling: exploit parallelism = ⇒ keep computations independent, no shared registers Issues: ◮ Software Complexity ◮ Result Quality ◮ Order in Serialization

Code Generation Challenges in real machines: CISC vs. RISC CISC IBM 360, PDP11, VAX series, INTEL 80x86, Motorola 680x0 ◮ A large number of addressing modes ◮ Computations on stores ◮ Few registers ◮ Different instruction lengths ◮ Different execution times for instructions ◮ Microprogrammed instruction sets RISC Alpha, MIPS, PowerPC, SPARC ◮ One instruction per cycle (with pipeline for load/stores) ◮ Load/Store architecture – Computations in registers (only) ◮ Many registers ◮ Few addressing modes ◮ Uniform lengths ◮ Hard-coded instruction sets ◮ Intra-processor parallelism: Pipeline, multiple units, Very Long Instruction Words (VLIW), Superscalarity, Speculation

Code Generation Example: x = y + z CISC/Vax addl3 4 ( fp ) , 6 ( fp ) , 8 ( fp ) RISC load r 1 , 4 ( fp ) load r 2 , 6 ( fp ) add r 1 , r 2 , r 3 store r 3 , 8 ( fp )

Code Generation The VLIW Architecture ◮ Several functional units, ◮ One instruction stream, ◮ Jump priority rule, ◮ FUs connected to register banks, ◮ Enough parallelism available? Main Memory ✻ ✻ ✻ ❄ ❄ ❄ Register set ✻ ✻ ✻ Control ❄ ❄ ❄ ❄ ❄ ❄ unit FU FU . . . FU ❄ ✻ ✻ ✻ Instruction ✲ store

Code Generation Instruction Pipeline Several instructions in different states of execution Potential structure: 1. instruction fetch and decode, 2. operand fetch, 3. instruction execution, 4. write back of the result into target register. cycle 1 2 3 4 5 6 7 Pipe- 1 B 1 B 2 B 3 B 4 line- 2 B 1 B 2 B 3 B 4 stage B 1 B 2 B 3 B 4 3 B 1 B 2 B 3 B 4 4

Code Generation Pipeline hazards ◮ Cache hazards: Instruction or operand not in cache, ◮ Data hazards: Needed operand not available, ◮ Structural hazards: Resource conflicts, ◮ Control hazards: (Conditional) jumps.

Code Generation Program Representations ◮ Abstract syntax tree: algebraic transformations, code generation for expression trees, ◮ Control Flow Graph: Program analysis (intraproc.) ◮ Call Graph: Program analysis (interproc.) ◮ Static Single Assignment: optimization, code generation ◮ Program Dependence Graph: instruction scheduling, parallelization ◮ Register Interference graph: register allocation

Code Generation Code Generation: Integrated Methods ◮ Integration of register allocation with instruction selection, ◮ Machine with interchangeable machine registers, ◮ Input: Expression trees ◮ Simple target machines. ◮ Two approaches: 1. Ershov[58], Sethi&Ullman[70]: unique decomposition of expression trees, 2. Aho&Johnson[76]: dynamic programming for more complex machine models.

Code Generation Contiguous evaluation (Sub-)expression can be evaluated into register: this register is needed to hold the result, memory cell: no register is needed to hold the result. Contiguous evaluation of an expression, thus, needs 0 or 1 registers while other (sub-)expressions are evaluated. Evaluate-into-memory-first strategy: evaluate subtrees into memory first. Contiguous evaluation + evaluate-into-memory-first define a normal form for code sequences. Theorem (Aho&Johnson[76]): Any optimal program using no more than r registers can be transformed into an optimal one in normal form using no more than r registers.

Code Generation Simple machine model, Ershov[58], Sethi&Ullman[70] ◮ r general purpose nterchangeable registers R 0 , . . . , R r − 1 , ◮ Two-address instructions R i := M [ V ] Load M [ V ] := R i Store R i := R i op M [ V ] Compute R i := R i op R j Two phases: 1. Computing register requirements, 2. Generating code, allocating registers and temporaries.

Code Generation Example Tree Source r := ( a + b ) − ( c − ( d + e )) Tree := r − + − a b c + d e

Code Generation Generated Code 2 Registers R 0 and R 1 Two possible code sequences: R 0 := M [ a ] R 0 := M [ c ] R 0 := R 0 + M [ b ] R 1 := M [ d ] R 1 := M [ d ] R 1 := R 1 + M [ e ] R 1 := R 1 + M [ e ] R 0 := R 0 − R 1 M [ t 1 ] := R 1 R 1 := M [ a ] R 1 := M [ c ] R 1 := R 1 + M [ b ] R 1 := R 1 − M [ t 1 ] R 1 := R 1 − R 0 R 0 := R 0 − R 1 M [ f ] := R 1 M [ f ] := R 0 stores result for c − ( d + 2 ) evaluates c − ( d + 2 ) first in a temporary (needs 2 registers) no register available saves one instruction

Code Generation The Algorithm op t 1 t 2 Principle: Given tree t for expression e 1 op e 2 t 1 needs r 1 registers, t 2 needs r 2 registers, r ≥ r 1 > r 2 : After evaluation of t 1 : r 1 − 1 registers freed, one holds the result, t 2 gets enough registers to evaluate, hence t can be evaluated in r 1 registers, r 1 = r 2 : t needs r 1 + 1 registers to evaluate, r 1 > r or r 2 > r : spill to temporary required.

Code Generation Labeling Phase ◮ Labels each node with its register needs, ◮ Bottom-up pass, ◮ Left leaves labeled with ’1’ have to be loaded into register, ◮ Right leaves labeled with ’0’ are used as operands, ◮ Inner nodes: � max ( r 1 , r 2 ) , if r 1 � = r 2 regneed ( op ( t 1 , t 2 )) = r 1 + 1 , if r 1 = r 2 where r 1 = regneed ( t 1 ) , r 2 = regneed ( t 2 )

Code Generation Example := 2 f − 2 + − 1 2 a b c + 1 1 0 1 d e 1 0

Code Generation Generation Phase Principle: ◮ Generates instruction Op for operator op in op ( t 1 , t 2 ) after generating code for t 1 and t 2 . ◮ Order of t 1 and t 2 depends on their register needs, ◮ The generated Op –instruction finds value of t 1 in register, ◮ RSTACK holds available registers, initially all registers, Before processing t : top(RSTACK) is determined as result register for t , After processing t : all registers available, but top(RSTACK) is result register for t . ◮ TSTACK holds available temporaries.

Code Generation Algorithm Gen_Opt_Code Algorithm RSTACK -Contents result register var RSTACK : stack of register ; var TSTACK : stack of address ; proc Gen_Code ( t : tree ) ; ( R ′ , R ′′ , . . . ) var R : register , T : address ; case t of ( leaf a , 1 ) : ( ∗ left leaf ∗ ) emit ( top ( RSTACK ) := a ) ; result in R ′ op (( t 1 , r 1 ) , ( leaf a , 0 )) : ( ∗ right leaf ∗ ) Gen_Code ( t 1 ) ; result in R ′ emit ( top ( RSTACK ) := top ( RSTACK ) Op a ) ;

Code Generation op (( t 1 , r 1 ) , ( t 2 , r 2 )) : ( R ′ , R ′′ , . . . ) cases r 1 < min ( r 2 , r ) : ( R ′ , R ′′ , . . . ) begin exchange ( RSTACK ) ; ( R ′′ , R ′ , . . . ) Gen_Code ( t 2 ) ; result in R ′′ ( R ′ , . . . ) R := pop ( RSTACK ) ; Gen_Code ( t 1 ) ; result in R ′ emit ( top ( RSTACK ) := top ( RSTACK ) Op R ) ; result in R ′ push ( RSTACK , R ) ; ( R ′′ , R ′ , . . . ) exchange ( RSTACK ) ; ( R ′ , R ′′ , . . . ) end ;

Code Generation r 1 ≥ r 2 ∧ r 2 < r : ( R ′ , R ′′ , . . . ) begin Gen_Code ( t 1 ) ; result in R ′ R := pop ( RSTACK ) ; ( R ′′ , . . . ) Gen_Code ( t 2 ) ; result in R ′′ emit ( R := R Op top ( RSTACK )) ; result in R ′ push ( RSTACK , R ) ; ( R ′ , R ′′ , . . . ) end ;

Code Generation Wilhelm/Maurer: Compiler Design, Chapter 12 - PowerPoint PPT Presentation

Code Generation Code Generation Wilhelm/Maurer: Compiler Design, Chapter 12 Reinhard Wilhelm Universitt des Saarlandes wilhelm@cs.uni-sb.de and Mooly Sagiv Tel Aviv University 11. Januar 2010 Code Generation Standard

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

Code Generation Chapter 9 1 Compiler Construction Code Generation Issues in Code Generation

Instruction Selection and Scheduling Machine code generation cs5363 1 Machine code generation

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Compilers Introduction to Code Generation Alex Aiken Code Generation We focus on generating

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

80% of Code Red 2 Code Red 2 re-re- Code Red 1 and Code Red 2 Code Red 2 re- cleaned up

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

in practice source code source code javac scalac groovyc jrubyc 0xCAFEBABE byte code

Array Code Generation 1. Array code generation 2. Surprises in memory access 3. Lessons learned

Compiler Design and Construction Code Generation Pop Quiz/Review What options do we have for

CMSC 430 Introduction to Compilers Spring 2016 Code Generation Introduction Code generation

6. Code Generation 6.1 Overview 6.2 The MicroJava VM 6.3 Code Buffer 6.4 Operands 6.5

Code Generation OSU CSE 2 April 2015 BL Compiler Structure Code Tokenizer Parser Generator

MANINDER KAUR professormaninder@gmail.com Maninder Kaur www.eazynotes.com 1

Design of Control Path Debdeep Mukhopadhyay IIT Madras Hardwired Hardware GCD Processor An

Fundamental brain theory: the MAIN challenge to theoretical physics and mathematics of the brain

Overview & Organisation of the PICSEL Group Activities M.Winter, on behalf of J.Baudot &

Towards a Reconfigurable Bit-Serial/Bit-Parallel Vector Accelerator Using In-Situ

263-2810: Advanced Compiler Design 8.2 Scheduling for ILP processors Thomas R. Gross Computer

1 How gates work NOT ( ) NAND ( ) o Transistors can be made to

Concepts of programming languages Lecture 6 Wouter Swierstra Faculty of Science Information and

Code Generation Wilhelm/Maurer: Compiler Design, Chapter 12 - PowerPoint PPT Presentation

Code Generation Code Generation Wilhelm/Maurer: Compiler Design, Chapter 12 Reinhard Wilhelm Universitt des Saarlandes wilhelm@cs.uni-sb.de and Mooly Sagiv Tel Aviv University 11. Januar 2010 Code Generation Standard

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

Code Generation Chapter 9 1 Compiler Construction Code Generation Issues in Code Generation

Instruction Selection and Scheduling Machine code generation cs5363 1 Machine code generation

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Compilers Introduction to Code Generation Alex Aiken Code Generation We focus on generating

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

80% of Code Red 2 Code Red 2 re-re- Code Red 1 and Code Red 2 Code Red 2 re- cleaned up

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

in practice source code source code javac scalac groovyc jrubyc 0xCAFEBABE byte code

Array Code Generation 1. Array code generation 2. Surprises in memory access 3. Lessons learned

Compiler Design and Construction Code Generation Pop Quiz/Review What options do we have for

CMSC 430 Introduction to Compilers Spring 2016 Code Generation Introduction Code generation

6. Code Generation 6.1 Overview 6.2 The MicroJava VM 6.3 Code Buffer 6.4 Operands 6.5

Code Generation OSU CSE 2 April 2015 BL Compiler Structure Code Tokenizer Parser Generator

MANINDER KAUR professormaninder@gmail.com Maninder Kaur www.eazynotes.com 1

Design of Control Path Debdeep Mukhopadhyay IIT Madras Hardwired Hardware GCD Processor An

Fundamental brain theory: the MAIN challenge to theoretical physics and mathematics of the brain

Overview &amp; Organisation of the PICSEL Group Activities M.Winter, on behalf of J.Baudot &amp;

Towards a Reconfigurable Bit-Serial/Bit-Parallel Vector Accelerator Using In-Situ

263-2810: Advanced Compiler Design 8.2 Scheduling for ILP processors Thomas R. Gross Computer

1 How gates work NOT ( ) NAND ( ) o Transistors can be made to

Concepts of programming languages Lecture 6 Wouter Swierstra Faculty of Science Information and

Overview & Organisation of the PICSEL Group Activities M.Winter, on behalf of J.Baudot &