PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School - PowerPoint PPT Presentation

PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture

Overview ¨ Announcement ¤ Homework 1 submission deadline: Jan. 30 th ¨ This lecture ¤ Impacts of pipelining on performance ¤ The MIPS five-stage pipeline ¤ Pipeline hazards n Structural hazards n Data hazards

Pipelining Technique ¨ Improving throughput at the expense of latency ¤ Delay: D = T + n δ ¤ Throughput: IPS = n/(T + n δ ) Combinational Logic D = Critical Path Delay = 30 IPS = Combinational Logic Combinational Logic D = IPS = Critical Path Delay = 15 Critical Path Delay = 15 D = Comb. Logic Comb. Logic Comb. Logic IPS = Delay = 10 Delay = 10 Delay = 10

Pipelining Technique ¨ Improving throughput at the expense of latency ¤ Delay: D = T + n δ ¤ Throughput: IPS = n/(T + n δ ) Combinational Logic D = 31 Critical Path Delay = 30 IPS = 1/31 Combinational Logic Combinational Logic D = 32 IPS = 2/32 Critical Path Delay = 15 Critical Path Delay = 15 D = 33 Comb. Logic Comb. Logic Comb. Logic IPS = 3/33 Delay = 10 Delay = 10 Delay = 10

Pipelining Latency vs. Throughput ¨ Theoretical delay and throughput models for perfect pipelining Delay (D) Throughput (IPS) 20 Relative Performance 15 10 5 0 0 50 100 150 200 Number of Pipeline Stages

Five Stage MIPS Pipeline

Simple Five Stage Pipeline ¨ A pipelined load-store architecture that processes up to one instruction per cycle Write Back PC Inst. Register Data ALU Memory File Memory Inst. Fetch Inst. Decode Execute Memory

Instruction Fetch ¨ Read an instruction from memory (I-Cache) ¤ Use the program counter (PC) to index into the I- Memory ¤ Compute NPC by incrementing current PC n What about branches? ¨ Update pipeline registers ¤ Write the instruction into the pipeline registers

Instruction Fetch clock Branch Target NPC = PC + 4 NPC clock PC + Why increment 4 by 4? Instruction Memory Pipeline Register

Instruction Fetch clock P3 Branch Target NPC = PC + 4 NPC clock PC + P2 Why increment 4 by 4? Instruction P1 Memory Critical Path = Max{P1, P2, P3} Pipeline Register

Instruction Decode ¨ Generate control signals for the opcode bits ¨ Read source operands from the register file (RF) ¤ Use the specifiers for indexing RF n How many read ports are required? ¨ Update pipeline registers ¤ Send the operand and immediate values to next stage ¤ Pass control signals and NPC to next stage

Instruction Decode target NPC NPC reg Register Instruction File reg ctrl decode Pipeline Pipeline Register Register

Execute Stage ¨ Perform ALU operation ¤ Compute the result of ALU n Operation type: control signals n First operand: contents of a register n Second operand: either a register or the immediate value ¤ Compute branch target n Target = NPC + immediate ¨ Update pipeline registers ¤ Control signals, branch target, ALU results, and destination

Execute Stage Target NPC + Res reg ALU reg reg ctrl ctrl Pipeline Pipeline Register Register

Memory Access ¨ Access data memory ¤ Load/store address: ALU outcome ¤ Control signals determine read or write access ¨ Update pipeline registers ¤ ALU results from execute ¤ Loaded data from D-Memory ¤ Destination register

Memory Access Target Res Res addr Dat reg Memory data data ctrl ctrl Pipeline Pipeline Register Register

Register Write Back ¨ Update register file ¤ Control signals determine if a register write is needed ¤ Only one write port is required n Write the ALU result to the destination register, or n Write the loaded data into the register file

Five Stage Pipeline ¨ Ideal pipeline: IPC=1 ¤ Is there enough resources to keep the pipeline stages busy all the time? Inst. Fetch Decode Execute Memory Writeback + + PC ALU Reg. Reg. 4 File File Mem Mem

Pipeline Hazards

Pipeline Hazards ¨ Structural hazards: multiple instructions compete for the same resource ¨ Data hazards: a dependent instruction cannot proceed because it needs a value that hasn’t been produced ¨ Control hazards: the next instruction cannot be fetched because the outcome of an earlier branch is unknown

Structural Hazards ¨ 1. Unified memory for instruction and data R1 ß Mem[R2] R3 ß Mem[R20] R6 ß R4-R5 R7 ß R1+R0

Structural Hazards ¨ 1. Unified memory for instruction and data R1 ß Mem[R2] R3 ß Mem[R20] R6 ß R4-R5 R7 ß R1+R0 Separate inst. and data memories.

Structural Hazards ¨ 1. Unified memory for instruction and data ¨ 2. Register file with shared read/write access ports R1 ß Mem[R2] R3 ß Mem[R20] R6 ß R4-R5 R7 ß R1+R0

Structural Hazards ¨ 1. Unified memory for instruction and data ¨ 2. Register file with shared read/write access ports R1 ß Mem[R2] R3 ß Mem[R20] R6 ß R4-R5 R7 ß R1+R0 Register access in half cycles.

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Loading data from memory. R1 ß Mem[R2] R3 ß R1+R0 R4 ß R1-R3

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Loaded data will be available two cycles later. R1 ß Mem[R2] R3 ß R1+R0 R4 ß R1-R3

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Inserting two bubbles. R1 ß Mem[R2] Nothing Nothing R3 ß R1+R0 R4 ß R1-R3

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Inserting single bubble + RF bypassing. R1 ß Mem[R2] Nothing R3 ß R1+R0 R4 ß R1-R3 Load delay slot. SW vs. HW management?

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Using the result of an ALU instruction. R1 ß R2+R3 R5 ß R1+R0 R3 ß R1+R0 R4 ß R1-R3

Data Hazards ¨ True dependence: read-after-write (RAW) ¤ Consumer has to wait for producer Using the result of an ALU instruction. R1 ß R2+R3 R5 ß R1+R0 R3 ß R1+R0 R4 ß R1-R3 Forwarding ALU result.

Data Hazards ¨ True dependence: read-after-write (RAW) ¨ Anti dependence: write-after-read (WAR) ¤ Write must wait for earlier read R1 ß R2+R1 R2 ß R8+R9

Data Hazards ¨ True dependence: read-after-write (RAW) ¨ Anti dependence: write-after-read (WAR) ¤ Write must wait for earlier read R1 ß R2+R1 R2 ß R8+R9 No WAR hazards in 5-stage pipeline!

Data Hazards ¨ True dependence: read-after-write (RAW) ¨ Anti dependence: write-after-read (WAR) ¨ Output dependence: write-after-write (WAW) ¤ Old writes must not overwrite the younger write R1 ß R2+R3 R1 ß R8+R9

Data Hazards ¨ True dependence: read-after-write (RAW) ¨ Anti dependence: write-after-read (WAR) ¨ Output dependence: write-after-write (WAW) ¤ Old writes must not overwrite the younger write R1 ß R2+R3 R1 ß R8+R9 No WAW hazards in 5-stage pipeline!

Data Hazards ¨ Forwarding with additional hardware

Data Hazards ¨ How to detect and resolve data hazards ¤ Show all of the data hazards in the code below R1 ß Mem[R2] R2 ß R1+R0 R1 ß R1-R2 Mem[R3] ß R2

Data Hazards ¨ How to detect and resolve data hazards ¤ Show all of the data hazards in the code below R1 ß Mem[R2] WAR R2 ß R1+R0 WAW RAW R1 ß R1-R2 Mem[R3] ß R2

PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School - PowerPoint PPT Presentation

PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Homework 1 submission deadline: Jan. 30 th This lecture Impacts of

Pipelining Instruction Pipelining is the use of pipelining to allow more than one instruction to

CSCI341 Lecture 36, Pipelining & Hazards RECALL... RECALL... HAZARDS Data Hazards

Pipelining is Hazardous! Hazards are situations where pipelining does not work as elegantly as

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

The Challenge of Natural Hazards This PowerPoint will cover information on: Natural Hazards

Pipelining 1 Today Quiz Introduction to pipelining 2 Pipelining L L a a Logic

Appendix A Appendix A Pipelining: Basic and Intermediate Concepts p 1 Overview Basics of

Occupational Health Hazards PPT-SM-OCPHLTHHAZ 1 V.A.0.0 Occupational Health Hazards Three

Health Hazards in Construction Health Hazards Potential exposures to health hazards: Worker

Overview Basics of Pipelining Pipeline Hazards Appendix A Pipeline Implementation

Chapter 3: Pipelining and Parallel Processing Keshab K. Parhi Outline Introduction

Lecture 2 (I ): Lecture 2 (I ): Pipelining & Retiming Pipelining & Retiming

Hazards Introduction Pipelining up until now has been ideal In real life, though, we

Appendix A Pipelining: Basic and Intermediate C Concepts t 1 Overview Basics of

Unit 5: Pipelining Load-use stalling Pipelined multi-cycle operations Control hazards

1 What Limits Performance? Stalls (Data Hazards) Data hazards Code Instruction depends on

Heavy Elements and the Path to FRIB W. Loveland Oregon State University The Current Situation

Transformation based parallel programming Program parallelization techniques. 1. Program Mapping

Instruction Scheduling cs5363 1 Instruction scheduling Reordered Original Instruction code

Gauge-Invariant Gluon TMD from large- to small-x in the coordinate space I.O. Cherednikov 7th

From Intent to Action: Nudging Users Towards Secure Mobile Payments Peter Story , Daniel Smullen,

UMASS SYSTEM FY2009 CUTS FROM THE STATE START: $492M REDUCTION: $25M (=$36M) FY2010: CUT OF

Financial Projections Board of Education February 26, 2018 Mrs. Luann Kolstad Chief School

Staff Open Meeting 8 June 2020 Virtual meeting Professor Paul Layzell - Principal Introduction

PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School - PowerPoint PPT Presentation

PIPELINING: HAZARDS Mahdi Nazm Bojnordi Assistant Professor School of Computing University of Utah CS/ECE 6810: Computer Architecture Overview Announcement Homework 1 submission deadline: Jan. 30 th This lecture Impacts of

Pipelining Instruction Pipelining is the use of pipelining to allow more than one instruction to

CSCI341 Lecture 36, Pipelining &amp; Hazards RECALL... RECALL... HAZARDS Data Hazards

Pipelining is Hazardous! Hazards are situations where pipelining does not work as elegantly as

Computer Systems Lecture 15 Pipelining and Hazards CS 230 - Spring 2020 3-1 Pipelining CS

The Challenge of Natural Hazards This PowerPoint will cover information on: Natural Hazards

Pipelining 1 Today Quiz Introduction to pipelining 2 Pipelining L L a a Logic

Appendix A Appendix A Pipelining: Basic and Intermediate Concepts p 1 Overview Basics of

Occupational Health Hazards PPT-SM-OCPHLTHHAZ 1 V.A.0.0 Occupational Health Hazards Three

Health Hazards in Construction Health Hazards Potential exposures to health hazards: Worker

Overview Basics of Pipelining Pipeline Hazards Appendix A Pipeline Implementation

Chapter 3: Pipelining and Parallel Processing Keshab K. Parhi Outline Introduction

Lecture 2 (I ): Lecture 2 (I ): Pipelining &amp; Retiming Pipelining &amp; Retiming

Hazards Introduction Pipelining up until now has been ideal In real life, though, we

Appendix A Pipelining: Basic and Intermediate C Concepts t 1 Overview Basics of

Unit 5: Pipelining Load-use stalling Pipelined multi-cycle operations Control hazards

1 What Limits Performance? Stalls (Data Hazards) Data hazards Code Instruction depends on

Heavy Elements and the Path to FRIB W. Loveland Oregon State University The Current Situation

Transformation based parallel programming Program parallelization techniques. 1. Program Mapping

Instruction Scheduling cs5363 1 Instruction scheduling Reordered Original Instruction code

Gauge-Invariant Gluon TMD from large- to small-x in the coordinate space I.O. Cherednikov 7th

From Intent to Action: Nudging Users Towards Secure Mobile Payments Peter Story , Daniel Smullen,

UMASS SYSTEM FY2009 CUTS FROM THE STATE START: $492M REDUCTION: $25M (=$36M) FY2010: CUT OF

Financial Projections Board of Education February 26, 2018 Mrs. Luann Kolstad Chief School

Staff Open Meeting 8 June 2020 Virtual meeting Professor Paul Layzell - Principal Introduction

CSCI341 Lecture 36, Pipelining & Hazards RECALL... RECALL... HAZARDS Data Hazards

Lecture 2 (I ): Lecture 2 (I ): Pipelining & Retiming Pipelining & Retiming