EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder) 2 - PowerPoint PPT Presentation

1 EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder)

2 Carry-Lookahead Adders FAST ADDERS

3 Ripple Carry Adder Critical Path • Critical Path = Longest possible delay path Assume t sum = 5 ns, t carry = 4 ns X Y X Y X Y X Y 16 ns 12 ns 8 ns 4 ns Co Ci Co Ci Co Ci Co Ci FA FA FA FA S S S S 17 ns 13 ns 9 ns 5 ns Critical Path

4 Ripple Carry Adders • Ripple-carry adders (RCA) are slow due to carry propagation – At least 2 levels of logic per full adder – Total delay for n-bit adder = n * T fa 6 5 4 3 2 1

5 Fast Adders • Recall that any logic function can be implemented as a 2-level implementation – SOP (AND-OR / NAND-NAND) implementation – POS (OR-AND / NOR-NOR) implementation • Rather than waiting for the previous carry, [C i+1 = f(X i ,Y i ,C i )] can we compute the carry as a function of just the inputs – C i+1 = f(X i ,X i-1 ,…X 0 ,Y i ,Y i-1 ,…Y 0 ) – This requires gates with many inputs which is infeasible in modern technologies above 4 or 5 inputs – But, we can try to use this idea of generating multiple carries at once by looking at many inputs

6 Fast Adders • To produce multiple carries in parallel, let us define some new signals for each column of addition that indicate information about the carry-out regardless of carry-in: – g i = Generate: This column will generate a carry-out whether or not the carry- in is ‘1’ g i is true when A i and B i is 1 => g i = A i • B i – p i = Propagate: This column will propagate a carry-in (if there is one) to the carry-out. p i is true when A i or B i is 1 => p i = A i + B i • Using these signals, we can define the carry-out (c i+1 ) as: c i+1 = g i + p i c i

7 Carry Lookahead Analogy • Consider the carry-chain like a long tube broken into segments. Each segment is controlled by a valve (propagate signal) and can insert a fluid into that segment (generate signal) • The carry-out of the diagram below will be true if g1 is true or p1 is true and g0 is true, or p1, p0 and c1 is true

8 Carry Lookahead Logic • Define each carry in terms of p i , g i and the initial carry-in (c 0 ) and not in terms of carry chain (intermediate carries: c1,c2,c3,…) • c1 = g 0 + p 0 c 0 • c2 = g 1 + p 1 c 1 = g 1 + p 1 g 0 + p 1 p 0 c 0 • c3 = … • c4 = …

9 4-Bit CLA • At this point we should probably stop as we have a 5-input gate in our equation • Let’s take our logic and build a 4 -bit carry lookahead adder (CLA) a0 c0 a3 a2 a1 b0 b3 b2 b1 Delay to produce s2 • Delay for pi,gi = 1 • Delay to produce c2 = 2 • Delay to produce s2 = 2 s0 s3 s2 s1 = 5 gates c4 p3 g3 c3 p2 g2 c2 p1 g1 c1 p0 g0 c0 (Compare to 8 gate delays for C4 P G RCA) Is S3 produced later than S2? Is C3 the last signal produced?

10 Carry Lookahead Adder • Use carry-lookahead logic to generate all the carries in one shot and then create the sum • Example 4-bit CLA shown below

11 Carry Lookahead Adder • Use carry-lookahead logic to generate all the carries in one shot and then 1 create the sum • Example 4-bit CLA shown below 3 3 3 3 2 5

12 16-Bit CLA • At this point we should probably stop as we have a 5-input gate in our equation A[11:8] A[15:12] B[11:8] A[7:4] A[3:0] B[15:12] B[7:4] B[3:0] C0 C8 C12 C4 PG PG PG PG 3 7 5 11 C16 S[15:12] S[11:8] S[7:4] S[3:0] 16-bit RCA Delay = 16*2 = 32 gate delays Delay of the above adder design = 3+2+2+4 = 11 gates Let us improve by looking ahead at a higher level to produce C16, C12, C8, C4 in parallel Define P and G as the overall Propagate and Generate signals for a set of 4 bits What’s the difference P = p3  p2  p1  p0 between the equation for G here and C4 on G = g3 + p3  g2 + p3  p2  g1 + p3  p2  p1  g0 the previous slides

13 16-bit CLA Closer Look • Each 4-bit CLA only propagates its overall carry-in if each of the 4 columns propagates: P0 = p3  p2  p1  p0 – P1 = p7  p6  p5  p4 – P2 = p11  p10  p9  p8 – P3 = p15  p14  p13  p12 – • Each 4-bit CLA generates a carry if any column generates and the more significant columns propagate G0 = g3 + (p3  g2) + (p3  p2  g1)+(p3  p2  p1  g0) – – … G3 = g15 + (p15  g14) + (p15  p14  g13)+(p15  p14  p13  g12) – • The higher order CLL logic (producing C4,C8,C12,C16) then is realized as: (C4) =>C1 = G0 + (P0  c0) – – … (C16) => C4 = G3 + (P3  G2) + (P3  P2  G1) +(P3  P2  P1  G0)+ (P3  P2  P1  P0  c0) – • These equations are exactly the same CLL logic we derived earlier

14 16-Bit CLA • Understanding 16- bit CLA hierarchy… C0 c15 G CLL CLL CLL CLL P G P G P G P G C8 C12 C4 p3 g3 c3 p2 g2 c2 p1 g1 c1 p0 g0 CLL c4 c0 P* G* C16 Delay = = 3 = Delay in producing Pi,Gi = 5 = Delay in producing Pi*,Gi* = 5 = Delay in producing C4,C8,C12,C16 = 7 = Delay in producing c15 = 9 = Delay in producing S15

15 64-Bit CLA • We can reuse the same CLL logic to build a 64-bit CLA C0 c63 s35 Pi,Gi C8 C4 C56 C52 C40 C36 C24 C20 C12 C60 C44 C28 G CLL CLL CLL CLL Pi*,Gi* P G P G P G P G C32 C48 C16 Pi**,Gi** p3 g3 c3 p2 g2 c2 p1 g1 c1 p0 g0 CLL c0 c4 P G = 3 = Delay in producing Pi,Gi = 13 = Delay in producing S63 = 5 = Delay in producing Pj*,Gj* Is the delay in producing s63 the same as in s35? = 7 = Delay in producing C48 = 5 = Delay in producing S2 = 9 = Delay in producing C60 = 4 = Delay in producing S0 = 11 = Delay in producing C63 = 13 = Delay in producing S63 = 13 Total Delay

16 Extrapolating CLA Logic Levels • In the above designs we’ve assumed 5 -input AND and OR gates are reasonable allowing us to group in blocks of 4 – Define b = blocking factor = number of carries produced in parallel • The greater the blocking factor the smaller the depth of logic (and vice-versa) • This leads us to reason that the delay of a CLA is O(log b n) • If we could only use 3- input gates we’d need a blocking factor of 2

17 Blocking factor of 2 13 • Each A box generates – p i = a i + b i 1 – g i = a i  b i 11 – s i = a i  b i 3 9 • Each B box 5 generates 7 – P i = p i  p i-1 – G i = g i +p i  g i-1 – c i+1 =G i + (P i  c i )

18 Credits • These slides were derived from Gandhi Puvvada’s EE 457 Class Notes

EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder) 2 - PowerPoint PPT Presentation

1 EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder) 2 Carry-Lookahead Adders FAST ADDERS 3 Ripple Carry Adder Critical Path Critical Path = Longest possible delay path Assume t sum = 5 ns, t carry = 4 ns X Y X Y X Y X Y 16 ns

457 Retirement Program 41-10390-29 2018/01/05 457 Retirement Program Things You Already Know

Credits These slides were derived from Gandhi Puvvadas EE 457 Class Notes EE 457 Unit 1

EE 457 Focus on CPU Design Microarchitecture EE 457 Unit 0 General Digital System

Deferred Compensation Plans 457(b) & 457(f) Presented By: Nonqualified Deferred Compensation

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

EE 457 Unit 1 Overview of Digital System Design 1.2 Credits These slides were derived from

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

Unit Title: Presentation Software Unit Level: 2 Unit Credit Value: 4 GLH: 30 LASER Unit

Caroline Van Wie AT&T Services Inc. T: 202.457.3053 AVP - Federal Regulatory 1120 20 th

EE 457 Unit 4 Computer System Performance 2 Motivation An individual user wants to:

EE 457 Unit 7a Cache and Memory Hierarchy 2 Memory Hierarchy & Caching Use several

EE 457 Unit 2 Fixed Point Systems and Arithmetic 2 Unsigned 2s Complement Sign and Zero

EE 457 Unit 6c Control Hazards 2 Control Hazards Control (branch) hazards are named such

EE 457 Unit 6b Data Hazards 2 Data Hazards Consider the data dependencies in the following

EE 457 Unit 2a Unsigned 2s Complement Sign and Zero Extension Fixed Point Systems and

EE 457 Unit 6a Basic Pipelining Techniques 2 Pipelining Introduction Consider a drink

Clocking & Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

Racket Values booleans: #t, #f numbers: The Pros of cons : integers: 42 , 0, -273

Objectives Principles of ERAS multiple small interventions effect big changes. What is

On the Potential of Proactive Domain Blacklisting Mrk Flegyhzi, Christian Kreibich and Vern

RSVP-TE Extensions for LSP Rerouting in NS2 D. Adami, C. Callegari, S. Giordano, M. Pagano

MDT Electronics ATLAS "Lite" Electronics Status ASD Lite STATUS 16,000 chips in house

Argon scintillation light triplet component decay time in presence of contaminant(s) and the Air

Search for Supersymmetry Karri Folan DiPetrillo, on behalf Karri Folan DiPetrillo, on behalf of

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder) 2 - PowerPoint PPT Presentation

1 EE 457 Unit 2b Fast Adders (Carry-Lookahead Adder) 2 Carry-Lookahead Adders FAST ADDERS 3 Ripple Carry Adder Critical Path Critical Path = Longest possible delay path Assume t sum = 5 ns, t carry = 4 ns X Y X Y X Y X Y 16 ns

457 Retirement Program 41-10390-29 2018/01/05 457 Retirement Program Things You Already Know

Credits These slides were derived from Gandhi Puvvadas EE 457 Class Notes EE 457 Unit 1

EE 457 Focus on CPU Design Microarchitecture EE 457 Unit 0 General Digital System

Deferred Compensation Plans 457(b) &amp; 457(f) Presented By: Nonqualified Deferred Compensation

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

EE 457 Unit 1 Overview of Digital System Design 1.2 Credits These slides were derived from

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

Unit Title: Presentation Software Unit Level: 2 Unit Credit Value: 4 GLH: 30 LASER Unit

Caroline Van Wie AT&amp;T Services Inc. T: 202.457.3053 AVP - Federal Regulatory 1120 20 th

EE 457 Unit 4 Computer System Performance 2 Motivation An individual user wants to:

EE 457 Unit 7a Cache and Memory Hierarchy 2 Memory Hierarchy &amp; Caching Use several

EE 457 Unit 2 Fixed Point Systems and Arithmetic 2 Unsigned 2s Complement Sign and Zero

EE 457 Unit 6c Control Hazards 2 Control Hazards Control (branch) hazards are named such

EE 457 Unit 6b Data Hazards 2 Data Hazards Consider the data dependencies in the following

EE 457 Unit 2a Unsigned 2s Complement Sign and Zero Extension Fixed Point Systems and

EE 457 Unit 6a Basic Pipelining Techniques 2 Pipelining Introduction Consider a drink

Clocking &amp; Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

Racket Values booleans: #t, #f numbers: The Pros of cons : integers: 42 , 0, -273

Objectives Principles of ERAS multiple small interventions effect big changes. What is

On the Potential of Proactive Domain Blacklisting Mrk Flegyhzi, Christian Kreibich and Vern

RSVP-TE Extensions for LSP Rerouting in NS2 D. Adami, C. Callegari, S. Giordano, M. Pagano

MDT Electronics ATLAS &quot;Lite&quot; Electronics Status ASD Lite STATUS 16,000 chips in house

Argon scintillation light triplet component decay time in presence of contaminant(s) and the Air

Search for Supersymmetry Karri Folan DiPetrillo, on behalf Karri Folan DiPetrillo, on behalf of

Explore More Topics

Sambuz

Useful Links

Newsletter

Mail Us

Deferred Compensation Plans 457(b) & 457(f) Presented By: Nonqualified Deferred Compensation

Caroline Van Wie AT&T Services Inc. T: 202.457.3053 AVP - Federal Regulatory 1120 20 th

EE 457 Unit 7a Cache and Memory Hierarchy 2 Memory Hierarchy & Caching Use several

Clocking & Timing Asynchronous Self Timed Design Self Timed Design Synchronous Circuit

MDT Electronics ATLAS "Lite" Electronics Status ASD Lite STATUS 16,000 chips in house