Systems Interleaving + Parallelism Shankar Balachandran* Associate - - PowerPoint PPT Presentation

systems
SMART_READER_LITE
LIVE PREVIEW

Systems Interleaving + Parallelism Shankar Balachandran* Associate - - PowerPoint PPT Presentation

Spring 2015 Week 8 Module 45 Digital Circuits and Systems Interleaving + Parallelism Shankar Balachandran* Associate Professor, CSE Department Indian Institute of Technology Madras *Currently a Visiting Professor at IIT Bombay


slide-1
SLIDE 1

Shankar Balachandran* Associate Professor, CSE Department Indian Institute of Technology Madras

*Currently a Visiting Professor at IIT Bombay

Digital Circuits and Systems

Spring 2015 Week 8 Module 45

Interleaving + Parallelism

slide-2
SLIDE 2

Acknowledgements

 MIT’s Open Course Contents of 6.004

Pipelining Methodology 2

slide-3
SLIDE 3

Problem with Pipelining

 Slowest component becomes the bottleneck

Pipelining Methodology 3

slide-4
SLIDE 4

Pipelined Components

Pipelining Methodology 4

 Pipeline the slowest component  Here, throughput = 1  Replacing a slow combinational component with a k-

pipe version may increase clock frequency

 Must account for new pipeline stages in our plan

slide-5
SLIDE 5

Laundry Example

 Find a place with twice as many dryers as washers.

Pipelining Methodology 5

 Throughput = 1/30 loads/min.  Latency = 90 mins/load

slide-6
SLIDE 6

Back to Our Bottleneck

 C is the slowest  Either find a pipelined

version of C

 Or interleave multiple

copies of C

Pipelining Methodology 6

slide-7
SLIDE 7

Circuit Interleaving

 Copy the critical element  Alternate between copies to simulate pipelining of the

slowest component

Pipelining Methodology 7

slide-8
SLIDE 8

Interleaving

Pipelining Methodology 8

slide-9
SLIDE 9

Interleaving

Pipelining Methodology 9

 Clock period 0: X0 presented at input, propagates through

upper latch, C0

 Clock period 1: X1 presented at input, propagates through

lower latch, C1.

 C0(X0) propagates to register inputs.

 Clock period 2: X2 presented at input, propagates through

upper latch, C.

 C0(X0) loaded into register, appears at output.

Latency = 2 cycles

slide-10
SLIDE 10

N-Way Interleaving

Pipelining Methodology 10

slide-11
SLIDE 11

Combine Interleaving with Pipelining

 Shifts the bottleneck from C to F

Pipelining Methodology 11

slide-12
SLIDE 12

And a Little Parallelism

 Combine Interleaving

and pipelining with parallelism

 Throughput = 2/30 =

1/15 load/min

 Latency = 90 min

Pipelining Methodology 12

slide-13
SLIDE 13

Summary

 Latency (L) = time it takes for given input to arrive at

  • utput

 Throughput (T) = rate at each new outputs appear  For combinational circuits: L = tPD of circuit, T = 1/L  For K-pipelines (K > 0):

 always have register on output(s)  K registers on every path from input to output  Inputs available shortly after clock i, outputs available shortly after

clock (i+K)

 T = 1/(tPD,REG + tPD of slowest pipeline stage + tSETUP)  more throughput =>split slowest pipeline stage(s)  use replication/interleaving if no further splits possible

 L = K / T

 pipelined latency >= combinational latency

Pipelining Methodology 13

slide-14
SLIDE 14

End of Week 8: Module 45

Thank You

Pipelining Methodology 14