chapter 18 programmable dsps
play

Chapter 18: Programmable DSPs Keshab K. Parhi and Viktor Owall DSP - PowerPoint PPT Presentation

Chapter 18: Programmable DSPs Keshab K. Parhi and Viktor Owall DSP Applications DSP applications are often real time but with a wide variety of sample rates High rates Radar Video Medium rates Audio Speech Low rates


  1. Chapter 18: Programmable DSPs Keshab K. Parhi and Viktor Owall

  2. DSP Applications DSP applications are often real time but with a wide variety of sample rates • High rates – Radar – Video • Medium rates – Audio – Speech • Low rates – Weather – Finance Chap. 18 2

  3. ...with different demands on • numeric representation – float or fixed – and nmber of bits • Throughput/speed • Power/energy dissipation • Cost Chap. 18 3

  4. DSP features Fast Multiply/Accumulate (MAC) x(n) • FIR D D D • FFT h0 h1 h2 h3 • etc. y(n) • Multiple Access Memories • Specialized addressing modes • Specialized execution control (loops) • Specialized interfaces, e.g. AD/DA Chap. 18 4

  5. Addressing Modes • Implied addressing P=X*Y; operation sets location • Immediate data AX0=1234 • Memory direct R1=Mem[101] • Register direct sub R1, R2 • Register indirect A0=A0+ *R5 • Register indirect with increment/decrement A0=A0+ *R5++ A0=A0+ *R5-- Chap. 18 5

  6. Standard DSP Alternatives PCs or Workstations • Non-real time • low requirements General purpose microprocessors • slower for DSP applications • might be one µ proc. there anyway Custom • perfomance • low cost at volume • High development cost Chap. 18 6

  7. Standard Processors vs. Special Purpose Algorithm Special Standard Processor Processor Cores Purpose Domain Specific • High Calculation Capacity Processors • Programmable • Low Power etc. • Low Design cost • User defined Interface • Standard Interface • Variable Wordlength • Good supply of tools • Low Price at Volume Chap. 18 7

  8. Architectural Partitioning Local busses Processor Processor ASIC and Core Core Distributed Dist. memory MEM to decrease Main Main ASIC data transfers MEM MEM MIPS intensive Conflicting req. algorithms in • Throughput Flexibility by dedicated HW • Flexibility using programmable to increase • Power Consumption processor core throughput and • Time to market save power Chap. 18 8

  9. Fixed point DSP Motorola DSP56000x 24 24 X0 Operand X1 Y0 Registers • Usually DSP has single cycle Y1 multiplier, may be pipelined 24 24 • Double wordlength out 56 56 + guard bits ALU Shifter 56 24 A (56) • scaling B (56) 24 56 56 Shifter/ Accumulators • Altenative is mult Limiter with reduced 24 24 wordlength output, e.g. 24 Chap. 18 9

  10. Memory Structures, von Neuman Processor Core Addresss bus Data bus Memory Chap. 18 10

  11. Memory Structures, Harvard Original Harvard Processor Core • one data • one program Addresss bus 2 Data bus 2 Addresss bus 2 Data bus 2 Memory A Memory B Chap. 18 11

  12. TI Processors, high speed Chap. 18 12

  13. TI Processors, low power Chap. 18 13

  14. TI, C64 Chap. 18 14

  15. TI, C55 Chap. 18 15

  16. Processor Architectures SIMD – Single Instruction Multiple Data Program Processor Processor Processor Processor Chap. 18 16

  17. Processor Architectures MIMD – Multiple Instruction Multiple Data Program Program Program Program Processor Processor Processor Processor Chap. 18 17

  18. Processor Architectures VLIW – Very Long Instruction Words Control Unit VLIW Instruction Functional Functional Functional Functional Unit Unit Unit Unit Chap. 18 18

  19. Split Processors Functional units can be split into submodules, e.g. for images (8bits) TI320C80, 1 RISC 4 x 32bit DSP which can be split into 8bit modules Chap. 18 19

  20. Vector Processors Chap. 18 20

  21. Low Power MMAC Multiplier Multiple Accumulator V. Sundararajan and K.K. Parhi, "A Novel Multiply Multiple Accumulator Component for Low Power PDSP Design", Proc. of 2000 IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Vol. 6, pp. 3247-3250, Istanbul, June 2000 Chap. 18 21

  22. Low Power MMAC Schedule 16-tap FIR 4 acc. MMAC Chap. 18 22

Recommend


More recommend