CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 - PowerPoint PPT Presentation

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html

Lecture Outline ● Vector Processors ● ● Scribe for today?

Why Vector Processing ● Deep pipeline ==> more parallelism – But more dependences – Need to fetch and issue many instructions (Flynn bottleneck) ● Same issues with multiple-issue processor ● Operations on vectors: – No data dependences – No control hazards – Single instn. ==> instn. bandwidth reduced – Well defined memory access pattern

Basic Architecture ● Vector-register processors vs. memory- memory vector processor ● DLXV: vector extn. of DLX (vector-register) ● Components: – Vector registers (V0..V7), 64-element – Vector functional units: ● ADD/SUB, MUL, DIV, Integer, Logical ● Each is pipelined, can start a new opn. every cycle – Vector load/store unit: also pipelined – Scalar registers and scalar unit (like in DLX)

Some Vector Instructions ● ADDV V1, V2, V3 ● ADDSV V1, F0, V2 ● SUBV V1, V2, V3 ● SUBVS V1, V2, F0 ● SUBSV V1, F0, V2 ● Similar for MUL and DIV ● LV V1, R1 ● SV R1, V1

SAXPY/DAXPY Loop ● Y = aX + Y (caps ==> vector) LD F0, a LD F0, a ADDI R4, Rx, 512 LV V1, Rx Loop: LD F2, 0(Rx) MULTSV V2, F0, V1 MULTD F2, F0, F2 LV V3, Ry LD F4, 0(Ry) ADDV V4, V2, V3 ADDD F4, F2, F4 SV Ry, V4 SD 0(Ry), F4 Reduction in instn. bandwidth ADDI Rx, Rx, 8 Lesser pipeline interlocks ADDI Ry, Ry, 8 SUB R20, R4, Rx BNEZ R20, Loop

Estimating Execution Time ● Convoy: set of vector instructions which can begin execution in same cycle – Check for structural, data hazards ● For simplicity: convoy must complete before initiating next convoy ● Chime: time taken to execute one vector opn. ● Approximations: – Only one instn. can be initiated per cycle – Pipeline setup latency

Adding Flexibility ● Vector-length register (VLR), Maximum vector length (MVL) – MOVI2S VLR, R1 – MOVS2I R1, VLR ● Vector longer than MVL ==> use strip-mining ● Vector stride: – LVWS V1, (R1, R2) – SVWS (R1, R2), V1 ● Memory-bank conflicts?

Enhancing Vector Performance ● Chaining: data-forwarding ● Conditional execution: – Vector Mask Register – Some related instructions ● SNEV V1, V2 ● SGTSV F0, V1 ● CVM ● Sparse matrices: scatter-gather – LVI V1, (R1+V2) – SVI (R1+V2), V1

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 - PowerPoint PPT Presentation

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Lecture Outline Vector Processors Scribe for today? Why Vector Processing

CS422 Computer Architecture Spring 2004 Lecture 04, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 23, 26 Mar 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 18, 26 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 13, 17 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 15, 20 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 05, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 02, 01 Jan 2004 Bhaskaran Raman Department of

Theory of Computation Textbook The Nature of Computation by Cristopher Moore and (CS

User Interface Design and Programming - CS422 Luc Renambot renambot@uic.edu Yiwen Sun

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

Architecture: Culture and Space Architecture: Culture and Space Architecture: Culture and Space

CSE 675.02: three aspects of computer design: instruction set architecture, Introduction to

ICS 233 ICS 233 ICS 233 ICS 233 Computer Architecture & Computer Architecture &

Introduction to Software Architecture Reid Holmes Architecture Architecture is: All

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p

A New Golden Age for 1. Software advances can inspire architecture Computer Architecture:

Multicast Source Notification of Interest Protocol (MSNIP) draft-ietf-idmr-msnip-01 Bill Fenner

INSTITUTE of ATOMIC PHYSICS Magurele-Bucharest Gamma Laser Controlled by High External Fields M

Neutrino mass constraint from CMB and its degeneracy with other cosmological parameters Kazuhide

Internal dissipation and heat leaks in nanoscale heat devices Luis A. Correa QUANTUM T E

Resonant Excitation of Envelope Modes as an Emittance Diagnostic in High-Intensity Circular

AmI Taxonomy AmI Taxonomy Network Characteristics of the technologies allowing devices to

ESI Workshop on Higher Spin Gravity Based on: hep-th/1006.4788 [Ammon, Gutperle, Kraus, EP]

Software Trigger Board Reader Progress Report Giovanna Lehmann Miotto, Alessandro Thea 1 R

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 - PowerPoint PPT Presentation

CS422 Computer Architecture Spring 2004 Lecture 33, 22 Apr 2004 Bhaskaran Raman Department of CSE IIT Kanpur http://web.cse.iitk.ac.in/~cs422/index.html Lecture Outline Vector Processors Scribe for today? Why Vector Processing

CS422 Computer Architecture Spring 2004 Lecture 04, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 23, 26 Mar 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 18, 26 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 13, 17 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 15, 20 Feb 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 05, 06 Jan 2004 Bhaskaran Raman Department of

CS422 Computer Architecture Spring 2004 Lecture 02, 01 Jan 2004 Bhaskaran Raman Department of

Theory of Computation Textbook The Nature of Computation by Cristopher Moore and (CS

User Interface Design and Programming - CS422 Luc Renambot renambot@uic.edu Yiwen Sun

An Agent Architecture An Agent Architecture An Agent Architecture An Agent Architecture for

Architecture: Culture and Space Architecture: Culture and Space Architecture: Culture and Space

CSE 675.02: three aspects of computer design: instruction set architecture, Introduction to

ICS 233 ICS 233 ICS 233 ICS 233 Computer Architecture &amp; Computer Architecture &amp;

Introduction to Software Architecture Reid Holmes Architecture Architecture is: All

CMS Strip Readout Architecture for SLHC OUTLINE brief review of LHC strip readout architecture p

A New Golden Age for 1. Software advances can inspire architecture Computer Architecture:

Multicast Source Notification of Interest Protocol (MSNIP) draft-ietf-idmr-msnip-01 Bill Fenner

INSTITUTE of ATOMIC PHYSICS Magurele-Bucharest Gamma Laser Controlled by High External Fields M

Neutrino mass constraint from CMB and its degeneracy with other cosmological parameters Kazuhide

Internal dissipation and heat leaks in nanoscale heat devices Luis A. Correa QUANTUM T E

Resonant Excitation of Envelope Modes as an Emittance Diagnostic in High-Intensity Circular

AmI Taxonomy AmI Taxonomy Network Characteristics of the technologies allowing devices to

ESI Workshop on Higher Spin Gravity Based on: hep-th/1006.4788 [Ammon, Gutperle, Kraus, EP]

Software Trigger Board Reader Progress Report Giovanna Lehmann Miotto, Alessandro Thea 1 R

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

ICS 233 ICS 233 ICS 233 ICS 233 Computer Architecture & Computer Architecture &