Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, - PowerPoint PPT Presentation

Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, 8.7, 8.8, 9.1 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1

Introduction For some compiler, the intermediate code is a pseudo code of a virtual machine. • Interpreter of the virtual machine is invoked to execute the intermediate code. • No machine-dependent code generation is needed. • Usually with great overhead. • Example: ⊲ Pascal: P-code for the virtual P machine. ⊲ JAVA: Byte code for the virtual JAVA machine. Motivation: • Statement by statement translation might generate redundant codes. • Locally improve the target code performance by examine a short sequence of target instructions (called a peephole ) and do optimization on this sequence. • Note: Complexity depends on the “window size.” Optimization. • Machine-dependent issues. • Machine-independent issues. Compiler notes #8, 20070622, Tsan-sheng Hsu 2

Machine-dependent issues (1/2) Input and output formats: • The formats of the intermediate code and the target program. Memory management: • Alignment, indirect addressing, paging, segment, . . . • Those you learned from your assembly language class. Instruction cost: • Special machine instructions to speed up execution. • Example: ⊲ Increment by 1. ⊲ Multiplying or dividing by 2. ⊲ Bit-wise manipulation. ⊲ Operators applied on a continuous block of memory space. • Pick a fastest instruction combination for a certain target machine. Compiler notes #8, 20070622, Tsan-sheng Hsu 3

Machine-dependent issues (2/2) Register allocation: in-between machine dependent and independent issues. • C language allows the user to management a pool of registers. • Some language leaves the task to compiler. • Idea: save mostly used intermediate result in a register. However, finding an optimal solution for using a limited set of registers is NP-hard. • Example: t := a + b load R0,a load R0,a load R1,b add R0,b add R0,R1 store R0,T store R0,T • Heuristic solutions: similar to the ones used for the swapping problem. Compiler notes #8, 20070622, Tsan-sheng Hsu 4

Machine-independent issues Techniques. • Analysis of dependence graphs. • Analysis of basic blocks and flow graphs. • Semantics-preserving transformations. • Algebraic transformations. Compiler notes #8, 20070622, Tsan-sheng Hsu 5

✂ � ✆ ☎ ✄ ✁ Dependence graphs Issues: • In an expression, assume its dependence graph is given. • We can evaluate this expression using any topological ordering. • There are many legal topological orderings. • Pick one to increase its efficiency. Example: order#1 reg# order#2 reg# E 0 E2 1 E6 1 E3 2 E5 2 E 1 E 2 E5 3 E4 1 E6 4 E3 2 E 3 E 4 E4 3 E1 1 E1 2 E2 2 E 5 E 6 E0 1 E0 1 On a machine with only 2 free registers, some of the intermediate results in order#1 must be stored in the temporary space. • STORE/LOAD takes time. Compiler notes #8, 20070622, Tsan-sheng Hsu 6

� ✁ Basic blocks and flow graphs Basic block : a sequence of code such that • jump statements, if any, are at the end of the sequence; • codes in other basic block can only jump to the beginning of this sequence, but not in the middle. • Example: ⊲ t 1 := a ∗ a ⊲ t 2 := a ∗ b ⊲ t 3 := 2 ∗ t 2 ⊲ goto outter • Single entry, single exit. Using a flow chart-like B 1 graph to represent a pro- Flow graph : gram where nodes are B 2 basic blocks and edges are flow of control. B 3 Compiler notes #8, 20070622, Tsan-sheng Hsu 7

How to find basic blocks How to find leaders , which are the first statements of basic blocks? • The first statement of a program is a leader. • For each conditional and unconditional goto, ⊲ its target is a leader; ⊲ its next statement is also a leader. Using leaders to partition the program into basic blocks. Ideas for optimization: • Two basic blocks are equivalent if they compute the same expression. • Use transformation techniques below to perform machine-independent optimization. Compiler notes #8, 20070622, Tsan-sheng Hsu 8

Finding basic blocks — examples Example: Three-address code for computing the dot product of two vectors a and b . ⊲ prod := 0 ⊲ i := 1 ⊲ loop: t 1 := 4 ∗ i ⊲ t 2 := a [ t 1 ] ⊲ t 3 := 4 ∗ i ⊲ t 4 := b [ t 3 ] ⊲ t 5 := t 2 ∗ t 4 ⊲ t 6 := prod + t 5 ⊲ prod := t 6 ⊲ t 7 := i + 1 ⊲ i := t 7 ⊲ if i ≤ 20 goto loop ⊲ · · · There are three blocks in the above example. Compiler notes #8, 20070622, Tsan-sheng Hsu 9

DAG representation of a basic block Inside a basic block: • Expressions can be expressed using a DAG that is similar to the idea of a dependence graph. • Graph might not be connected. Example: (1) t 1 := 4 ∗ i + (2) t 2 := a [ t 1 ] t5 (3) t 3 := 4 ∗ i t6 * (1) (4) t 4 := b [ t 3 ] t4 t2 i’ (5) t 5 := t 2 ∗ t 4 <= t7 [] [] (6) t 6 := prod + t 5 t3 prod’ (7) prod := t 6 + t1 * (8) t 7 := i + 1 20 prod a b i 4 1 (9) i := t 7 (10) if i ≤ 20 goto (1) Compiler notes #8, 20070622, Tsan-sheng Hsu 10

Semantics-preserving transformations (1/3) Techniques: using the information contained in the flow graph and DAG representation of basic blocks to do optimization. a := b+c a := b+c b := a−d b := a−d c := b+c c := b+c d := a−d d := b • Common sub-expression elimination. • Dead-code elimination: remove unreachable codes. • Remove redundant codes such as loads and stores. ⊲ MOV R 0 , a ⊲ MOV a, R 0 • Code motion. ⊲ Find loop-invariants inside a loop. ⊲ Obtain the values of loop-invariants outside the loop. ⊲ Example: t = limit - 2 while(i <= limit - 2) while (i <= t) ... ... • Renaming temporary variables: better usage of registers and avoiding using unneeded temporary variables. Compiler notes #8, 20070622, Tsan-sheng Hsu 11

Semantics-preserving transformations (2/3) More techniques: • Copy propagation: ⊲ De-reference a chain of variable copies. ⊲ Example: a = x; a = x; y = a; y = x; b = y; b = x; • Flow of control simplification: ⊲ De-reference a chain of goto’s. ⊲ Example: goto L1 goto L2 · · · · · · L1: goto L2 L1: goto L2 Compiler notes #8, 20070622, Tsan-sheng Hsu 12

Semantics-preserving transformations (3/3) Interchange of two independent adjacent statements, which might be useful in discovering the above transformations. • Same expressions that are too far away to store E 1 into a register. t1 := E1 t2 := const // swap t2 and tn ⊲ Example: ... tn := E1 • Note: The order of dependence cannot be altered after the exchange. t1 := E1 t2 := t1 + tn // canoot swap t2 and tn ⊲ Example: ... tn := E1 Compiler notes #8, 20070622, Tsan-sheng Hsu 13

Algebraic transformations Algebraic identities: • x + 0 ≡ 0 + x ≡ x • x − 0 ≡ x • x ∗ 1 ≡ 1 ∗ x ≡ x • x/ 1 ≡ x Reduction in strength: • x 2 ≡ x ∗ x • 2 . 0 ∗ x ≡ x + x • x/ 2 ≡ x ∗ 0 . 5 Constant folding: • 2 ∗ 3 . 14 ≡ 6 . 28 Standard representation for subexpression by commutativity and associativity: • n ∗ m ≡ m ∗ n . • b < a ≡ a > b . Compiler notes #8, 20070622, Tsan-sheng Hsu 14

✁ ✄ ✂ � ✆ ✄ ☎ ☎ ✆ ✂ ✁ � Correctness after optimization When side effects are expected, different evaluation orders may produce different results for expressions. LL E 0 E 0 LR E 1 E 1 E 2 E 2 E 3 E 4 E 3 E 4 E 5 E 6 E 5 E 6 • Assume E 5 is a procedure call with the side effect of changing some values in E 6 . • LL and LR parsing produce different results. Watch out precisions when doing algebraic transformations. • if ( x = 321 . 00000123456789 − 321 . 00000123456788) > 0 then · · · Need to make sure code before and after optimization produce the same result. Complications arise when debugger is involved. Compiler notes #8, 20070622, Tsan-sheng Hsu 15

Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, - PowerPoint PPT Presentation

Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, 8.7, 8.8, 9.1 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction For some compiler, the intermediate code is a pseudo code of a virtual

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

Code Generation Chapter 9 1 Compiler Construction Code Generation Issues in Code Generation

Instruction Selection and Scheduling Machine code generation cs5363 1 Machine code generation

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Compilers Introduction to Code Generation Alex Aiken Code Generation We focus on generating

Code optimization in GCC S ebastian Pop Universit e Louis Pasteur Strasbourg FRANCE Code

Runtime Environments Where We Are Source Lexical Analysis Code Syntax Analysis Semantic

80% of Code Red 2 Code Red 2 re-re- Code Red 1 and Code Red 2 Code Red 2 re- cleaned up

Compiler Design and Construction Code Generation Pop Quiz/Review What options do we have for

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

in practice source code source code javac scalac groovyc jrubyc 0xCAFEBABE byte code

Array Code Generation 1. Array code generation 2. Surprises in memory access 3. Lessons learned

Welcome to your VISTA Leader Orientation, also known as VLO! In this section youll start to

Measure data for AmeriCorps VISTA projects, and how you might support that process as a VISTA

FEDEX CORPORATION GLOBALLY 69.7 Billion FY19 AnnualRevenue > 15M Shipments each business day

Broadening the Housing Movement Webinar August 16, 2018 @OppStartsatHome National

Computer and Information Security Fall 2019 Denial of Service Attacks Tyler Bletsch Duke

Self-Directed Care Implementation Manual: A Comprehensive Mental Health Program Guide Judith A.

ACCELERATING IMPACT WITH GUARANTEES Another tool in the impact investing toolkit 2121 Crystal

Web Development: Youre (Still) Doing it Wrong Stefan Tilkov | innoQ | @stilkov Thursday 17

Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, - PowerPoint PPT Presentation

Code Generation and Optimization ALSU Textbook Chapters 8.4, 8.5, 8.7, 8.8, 9.1 Tsan-sheng Hsu tshsu@iis.sinica.edu.tw http://www.iis.sinica.edu.tw/~tshsu 1 Introduction For some compiler, the intermediate code is a pseudo code of a virtual

Code Generation Machine code generation cs4713 1 Machine code generation machine Intermediate

Code Generation Chapter 9 1 Compiler Construction Code Generation Issues in Code Generation

Instruction Selection and Scheduling Machine code generation cs5363 1 Machine code generation

INF5110 Compiler Construction Spring 2016 1 / 98 Outline 1. Intermediate code generation

INF5110 Compiler Construction Spring 2017 1 / 97 Outline 1. Intermediate code generation

15-780: Optimization J. Zico Kolter March 14-16, 2015 1 Outline Introduction to optimization

INF5110 Compiler Construction Code generation Spring 2016 1 / 123 Outline 1. Code

{Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code} {Sequential Code}

Compilers Introduction to Code Generation Alex Aiken Code Generation We focus on generating

Code optimization in GCC S ebastian Pop Universit e Louis Pasteur Strasbourg FRANCE Code

Runtime Environments Where We Are Source Lexical Analysis Code Syntax Analysis Semantic

80% of Code Red 2 Code Red 2 re-re- Code Red 1 and Code Red 2 Code Red 2 re- cleaned up

Compiler Design and Construction Code Generation Pop Quiz/Review What options do we have for

Selection Sort Section 10.2 Code for Selection Sort (cont.) Code for an Array Sort Code for an

in practice source code source code javac scalac groovyc jrubyc 0xCAFEBABE byte code

Array Code Generation 1. Array code generation 2. Surprises in memory access 3. Lessons learned

Welcome to your VISTA Leader Orientation, also known as VLO! In this section youll start to

Measure data for AmeriCorps VISTA projects, and how you might support that process as a VISTA

FEDEX CORPORATION GLOBALLY 69.7 Billion FY19 AnnualRevenue &gt; 15M Shipments each business day

Broadening the Housing Movement Webinar August 16, 2018 @OppStartsatHome National

Computer and Information Security Fall 2019 Denial of Service Attacks Tyler Bletsch Duke

Self-Directed Care Implementation Manual: A Comprehensive Mental Health Program Guide Judith A.

ACCELERATING IMPACT WITH GUARANTEES Another tool in the impact investing toolkit 2121 Crystal

Web Development: Youre (Still) Doing it Wrong Stefan Tilkov | innoQ | @stilkov Thursday 17

FEDEX CORPORATION GLOBALLY 69.7 Billion FY19 AnnualRevenue > 15M Shipments each business day