Alive: Provably Correct InstCombine Optimizations David Menendez - PowerPoint PPT Presentation

Alive: Provably Correct InstCombine Optimizations David Menendez John Regehr Santosh Nagarakatte University of Utah Rutgers University Nuno Lopes Microsoft Research

Can We Trust Compilers? • Any large software project will have bugs • LLVM is no exception • CSmith project found 203 bugs by random testing • InstCombine is especially buggy 2

Why Is InstCombine Buggy? • It’s huge: • over 20,000 lines of code • visitICmpInst alone is 924 lines • Complicated to write • LLVM Semantics are subtle • Hard to tell when the code is correct 16

For example…

{ Value *Op1C = Op1; BinaryOperator *BO = dyn_cast<BinaryOperator>(Op0); if (!BO || (BO->getOpcode() != Instruction::UDiv && BO->getOpcode() != Instruction::SDiv)) { Op1C = Op0; BO = dyn_cast<BinaryOperator>(Op1); } Value *Neg = dyn_castNegVal(Op1C); if (BO && BO->hasOneUse() && (BO->getOperand(1) == Op1C || BO->getOperand(1) == Neg) && (BO->getOpcode() == Instruction::UDiv || BO->getOpcode() == Instruction::SDiv)) { Value *Op0BO = BO->getOperand(0), *Op1BO = BO->getOperand(1); // If the division is exact, X % Y is zero, so we end up with X or -X. if (PossiblyExactOperator *SDiv = dyn_cast<PossiblyExactOperator>(BO)) if (SDiv->isExact()) { if (Op1BO == Op1C) return ReplaceInstUsesWith(I, Op0BO); return BinaryOperator::CreateNeg(Op0BO); } Value *Rem; if (BO->getOpcode() == Instruction::UDiv) Rem = Builder->CreateURem(Op0BO, Op1BO); else Rem = Builder->CreateSRem(Op0BO, Op1BO); Rem->takeName(BO); if (Op1BO == Op1C) return BinaryOperator::CreateSub(Op0BO, Rem); return BinaryOperator::CreateSub(Rem, Op0BO); } } 18

// (X / Y) * Y = X - (X % Y) // (X / Y) * -Y = (X % Y) - X { Value *Op1C = Op1; BinaryOperator *BO = dyn_cast<BinaryOperator>(Op0); if (!BO || Improve transition to next slide (BO->getOpcode() != Instruction::UDiv && BO->getOpcode() != Instruction::SDiv)) { Op1C = Op0; BO = dyn_cast<BinaryOperator>(Op1); } Value *Neg = dyn_castNegVal(Op1C); if (BO && BO->hasOneUse() && (BO->getOperand(1) == Op1C || BO->getOperand(1) == Neg) && (BO->getOpcode() == Instruction::UDiv || BO->getOpcode() == Instruction::SDiv)) { Value *Op0BO = BO->getOperand(0), *Op1BO = BO->getOperand(1); // If the division is exact, X % Y is zero, so we end up with X or -X. if (PossiblyExactOperator *SDiv = dyn_cast<PossiblyExactOperator>(BO)) if (SDiv->isExact()) { if (Op1BO == Op1C) return ReplaceInstUsesWith(I, Op0BO); return BinaryOperator::CreateNeg(Op0BO); } Value *Rem; if (BO->getOpcode() == Instruction::UDiv) Rem = Builder->CreateURem(Op0BO, Op1BO); else Rem = Builder->CreateSRem(Op0BO, Op1BO); Rem->takeName(BO); if (Op1BO == Op1C) return BinaryOperator::CreateSub(Op0BO, Rem); return BinaryOperator::CreateSub(Rem, Op0BO); } } 19

Flags can be confusing…

Is This Valid? %L = mul nsw i8 %A, %B %R = mul nsw i8 %B, %C %I = mul nsw i8 %L, %C %I = mul nsw i8 %A, %R Seemingly just (A × B) × C = A × (B × C) 21

Is This Valid? %A = -1, %B = 4, %C = 32 %L = mul nsw i8 %A, %B %R = mul nsw i8 %B, %C %I = mul nsw i8 %L, %C %I = mul nsw i8 %A, %R %L = -4 %I = -128 22

Is This Valid? %A = -1, %B = 4, %C = 32 %L = mul nsw i8 %A, %B %R = mul nsw i8 %B, %C %I = mul nsw i8 %L, %C %I = mul nsw i8 %A, %R %L = -4 %R = poison %I = -128 %I = poison 23

Is This Valid? %A = -1, %B = 4, %C = 32 %L = mul nsw i8 %A, %B %R = mul i8 %B, %C %I = mul nsw i8 %L, %C %I = mul nsw i8 %A, %R %L = -4 %R = -128 %I = -128 %I = poison 24

Is This Valid? %A = -1, %B = 4, %C = 32 %L = mul nsw i8 %A, %B %R = mul i8 %B, %C %I = mul nsw i8 %L, %C %I = mul i8 %A, %R %L = -4 %R = -128 %I = -128 %I = -128 25

…but flags are also essential

Flags Aid Optimization %C = mul i8 %A, %B %R = sdiv i8 %C, %B R = (A × B)÷B Is R = A? 27

Flags Aid Optimization %A = 100, %B = 100 %C = mul i8 %A, %B %R = sdiv i8 %C, %B %C = 10000 %R = 100 28

Flags Aid Optimization %A = 100, %B = 100 %C = mul i8 %A, %B %R = sdiv i8 %C, %B %C = 10000 %R = 100 Too big for i8! 29

Flags Aid Optimization %C = mul i8 %A, %B %R = sdiv i8 %C, %B We could do this if we knew that A × B fits in 8 bits 30

Flags Aid Optimization More context for why flags are helpful and where they are generated (C) %C = mul nsw i8 %A, %B %R = sdiv i8 %C, %B We could do this if we knew that A × B fits in 8 bits …which is just what NSW/NUW are for 31

Outline • Motivation • Introducing Alive • Language Overview • Automated Verification • Code Generation • Conclusion 32

Alive: Fast, Safe, Easy Slower here Name: sdiv exact %a = sdiv exact %x, %y %r = mul %a, %y • Write optimizations in LLVM-like => DSL %r = %x • Automatically verify correctness Name: sdiv inexact %a = sdiv %x, %y • Automatically generate C++ %r = mul %a, %y code => %b = srem %x, %y %r = sub %x, %b Partial translation 33

Prove Optimizations Correct Alive SMT Alive DSL Queries Analysis Alive handles all the tricky corners of LLVM’s IR 34

Automatic Code Generation Alive C++ Alive LLVM DSL Alive writes your InstCombine code for you 35

Why Alive? • Use of formal methods is not new • CompCert—Formally verified C compiler • Vellvm—Formal semantics for LLVM in Coq • Nuno Lopes described verifying InstCombine with SMT solvers last year 36

Lightweight Formal Methods Emphasize that Alive is easier than manual proofs • Automation • Use SMT to avoid manual proofs • Use Alive to avoid writing SMT queries • High-level specification language 37

More set up. Explain what this does. Additional slide before this? Alive Language Pre: C2 % (1<<C1) == 0 %s = shl nsw %A, C1 • 1<<C 1 divides C 2 %r = sdiv %s, C2 => • r s = (A<<C 1 )÷C 2 %r = sdiv %A, C2/(1<<C1) • r t = A÷(C 2 ÷(1<<C 1 )) 38

Alive Language Pre: C2 % (1<<C1) == 0 Precondition %s = shl nsw %A, C1 %r = sdiv %s, C2 Source => Target %r = sdiv %A, C2/(1<<C1) 39

Alive Language Pre: C2 % (1<<C1) == 0 %s = shl nsw %A, C1 %r = sdiv %s, C2 Constants => %r = sdiv %A, C2/(1<<C1) • Represent arbitrary immediate values • Expressions permitted in target and precondition 40

Predicates and Functions • Predicates can be used in precondition • May invoke heuristics in LLVM, e.g., WillNotOverflowSignedAdd • Functions extend constant language • Most apply only to constant values, e.g., umax • width(%x) returns the bit width of any value 41

Predicates and Functions Pre: C < 0 && isPowerOf2(abs(C)) %Op0 = add %Y, C1 %r = mul %Op0, C => %sub = sub -C1, %Y %r = mul %sub, abs(C) 42

Checking Correctness • SMT (“Satisfiability Modulo Theories”) • Generalizes SAT solving • Additional theories for integers, bit vectors, etc. • Undecidable in general • Efficient in practice • Z3, Boolector, CVC4, etc. 43

Type Checking • Translate type constraints to SMT • Binary operation arguments and answer have same type • Trunc result has fewer bits than argument, etc. • Find and test all solutions to constraints 44

Checking Correctness • Need to show that target refines source • Target’s behavior undefined only when source’s is • Target returns poison only when source does • For all other inputs, target and source yield same result 45

Checking Correctness • SMT finds satisfying instances • Phrase queries as negations: • “Find an input where the source is defined but the target is undefined” • Z3 either finds a counterexample, or shows that none exists 46

Checking Correctness • Translation uses Z3’s theory of bitvectors • Sized, 2s-complement arithmetic • undef uses theory of quantifiers • Optimization must hold for all possible values 47

Alive Language Pre: C2 % (1<<C1) == 0 %s = shl nsw %A, C1 %r = sdiv %s, C2 => %r = sdiv %A, C2/(1<<C1) 48

Raw SMT (declare-fun C1 () (_ BitVec 4)) (declare-fun C2 () (_ BitVec 4)) (declare-fun %A () (_ BitVec 4)) (assert (and (and (distinct C2 (_ bv0 4)) true) (or (and (distinct (bvshl %A C1) (_ bv8 4)) true) (and (distinct C2 (_ bv15 4)) true) ) (bvult C1 (_ bv4 4)) (= (bvashr (bvshl %A C1) C1) %A) (= (bvsrem C2 (bvshl (_ bv1 4) C1)) (_ bv0 4)) (and (distinct (bvsdiv (bvshl %A C1) C2) (bvsdiv %A (bvsdiv C2 (bvshl (_ bv1 4) C1))) ) true ) ) ) (check-sat) i4 Equality Check 49

Raw SMT (declare-fun C1 () (_ BitVec 4)) (declare-fun C2 () (_ BitVec 4)) (declare-fun %A () (_ BitVec 4)) (assert (and (and (distinct C2 (_ bv0 4)) true) (or (and (distinct (bvshl %A C1) (_ bv8 4)) true) Defined (and (distinct C2 (_ bv15 4)) true) ) (bvult C1 (_ bv4 4)) Non-poison (= (bvashr (bvshl %A C1) C1) %A) (= (bvsrem C2 (bvshl (_ bv1 4) C1)) (_ bv0 4)) Precondition (and (distinct Source (bvsdiv (bvshl %A C1) C2) (bvsdiv %A (bvsdiv C2 (bvshl (_ bv1 4) C1))) Target ) true ) ) ) (check-sat) i4 Equality Check 50

Alive: Provably Correct InstCombine Optimizations David Menendez - PowerPoint PPT Presentation

Alive: Provably Correct InstCombine Optimizations David Menendez John Regehr Santosh Nagarakatte University of Utah Rutgers University Nuno Lopes Microsoft Research Can We Trust Compilers? Any large software project will have bugs

Loop Optimizations Important because lots of execution Loop Optimizations Loop Optimizations

What is the B-method? Welcome to Provably Correct Software http://www.it.uu.se/

Provably Correct Development of Reconfigurable Hardware Designs via Equational Reasoning Ian

Is SUSY still alive? Felix Brmmer Felix Brmmer Is SUSY still alive? 1 / 13 Felix Brmmer

8. God is risen; we are alive #whodoyouthinkyouare? @simongharris @burlington_ips Hes alive

Provably secure hash functions - do we care? Krystian Matusiewicz Technical University of Denmark

Alive2 Verifying existing optimizations Nuno Lopes John Regehr Microsoft Research University

Lab 2 discussion Last Time Debugging Its a science use experiments to refine

Analysis and Optimizations Analysis and Optimizations Program Analysis Program Analysis

Concepts Introduced in Chapter 9 introduction to compiler optimizations basic blocks and

Staying Alive Animals need to eat to stay alive. Different animals eat different types of food.

WHAT IS LONDON NEEDS YOU ALIVE? London Needs You Alive is an anti- knife crime campaign run by the

The Knock Em Alive The Knock Em Alive Positively Powerful Positively Powerful

It's Alive !! Soil is our greatest resource It's alive, as opposed to dirt It's

Who Watches the Watchers: Toward Provably-correct Decision Diagram Code Yousra Lembachar , Ryan

On the synthesis of provably correct discrete controllers Jri Vain Dept. of Computer

Making AQM Work: Making AQM Work: Active Queue Management Active Queue Management

Outline I. CAT( ) and -hyperbolic II. Curvature conjecture III. Decidability issues IV.

If you are viewing this presentation in a PDF, please stop and open the slides in browser (slides

Lecture 5 Multiplication and Division 1 Multiplication More complicated than addition A

Chapt er 15: Numer ical St r engt h Reduct ion Keshab K. Parhi Sub-expression eliminat ion

Experiments on Union-Find Algorithms for the Disjoint-Set Data Structure Md. Mostofa Ali Patwary*

Exact Recovery for a Family of Community-Detection Generative Models Paolo Penna joint work

Note: PowerPoint and audio for the December 17, 2015 Webinar, Basic Manual: Organization,