hardware accelerators
play

Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , - PowerPoint PPT Presentation

Exploiting Dataflows for Reconfigurable Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , Carlo Sau 3 , Tiziana Fanni 3 , Luigi Raffo 3 1 University of Sassari, PolComIng Information Engineering Group 2 University of Rennes,


  1. Exploiting Dataflows for Reconfigurable Hardware Accelerators Francesca Palumbo 1 , Claudio Rubattu 1,2 , Carlo Sau 3 , Tiziana Fanni 3 , Luigi Raffo 3 1 University of Sassari, PolComIng – Information Engineering Group 2 University of Rennes, INSA Group 3 University of Cagliari, Diee – Microelectronics and Bioengineering Group Rennes, 12-14 December 2017

  2. Who and Where UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

  3. Who and Where UNIVERSITY OF SASSARI UNIVERSITY OF CAGLIARI

  4. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  5. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  6. Modern Embedded Systems Embedded Systems ( real-time computing systems with a dedicated functionality) are pervasive ( 98% of computers are embedded) and may present sensing and actuating capabilities.

  7. Modern Embedded Systems Embedded Systems ( real-time computing systems with a dedicated functionality) are pervasive ( 98% of computers are embedded) and may present sensing and actuating capabilities. Complex Seamless Security Distrib. MPSoC Energy Safety Certif. functionalities. HMI Automotive x x x x x x x Aerospace x x x x x x x Colliding technical Healthcare x x x x x x x x requirements. Consumer x x x IDC - Design of Future ES

  8. Multimedia Domain HIGH PERFORMANCES real time, portability, long battery life UP-TO-DATE SOLUTIONS last audio/video codecs, file formats... MORE INTEGRATED FEATURES MP3, Camera, Video, GPS... MARKET DEMAND convenient form factor, affordable price, fashion

  9. Target & Technological Challenges • DATAFLOW MODEL OF COMPUTATION – Modularity and parallelism  EASIER INTEGRATION AND FAVOURED RE-USABILITY • COARSE-GRAINED RECONFIGURABILITY – Flexibility and resource sharing  MULTI-APPLICATION PORTABLE DEVICES The RPCT project (2012-2015) has been funded by Sardinian Regional Government (L.R. 7/2007, CRP-18324). http://sites.unica.it/rpct/

  10. Target & Technological Challenges • DATAFLOW MODEL OF COMPUTATION – Modularity and parallelism  EASIER INTEGRATION AND FAVOURED RE-USABILITY • COARSE-GRAINED RECONFIGURABILITY – Flexibility and resource sharing  MULTI-APPLICATION PORTABLE DEVICES Reconfigurable Platform Composer Tool Project Automated are fundamental to guarantee . Dealing with systems, in particular for , state of the art still lacks in providing a broadly accepted solution. The RPCT project (2012-2015) has been funded by Sardinian Regional Government (L.R. 7/2007, CRP-18324). http://sites.unica.it/rpct/

  11. Reasons for Coarser-Grain Flexibility Performance GP RECONF CPU GPU DSP FG CG ASIC

  12. Reasons for Coarser-Grain Flexibility Performance GP RECONF CPU GPU DSP FG CG ASIC • Coarse Grained (CG): – both in ASIC and FPGA Fine Grained Coarse Grained bit-level word-level – 1 clock cycle switching, with ☺  dedicated switching blocks. Flexibility  ☺ • Fine Grained (FG): Speed   – FPGA only Memory – switching requires a new bit- stream

  13. Framework Development 2010 2011 2012 2013 2014 2015 2016 Baseline tool specification: Multi-Dataflow Composer (MDC) tool MPEG-RVC Framework Integration: Orcc + MDC + Xronos + Turnus

  14. Framework Development 2010 2011 2012 2013 2014 2015 2016 Baseline tool specification: Multi-Dataflow Composer (MDC) tool MPEG-RVC Framework Integration: Orcc + MDC + Xronos + Turnus MDC: Structural Profiler MDC: Low-Power Extension MDC: Co-processor Generator

  15. Framework Evaluation 2010 2011 2012 2013 2014 2015 2016 Reconfigurable Image/Video Coding: JPEG e H.264 Adaptive Filtering: HEVC Encoding

  16. Framework Evaluation 2010 2011 2012 2013 2014 2015 2016 Reconfigurable Image/Video Coding: JPEG e H.264 Neural Signal Decoding Adaptive Filtering: HEVC Encoding Cryptograph ic Systems

  17. Outline • The origins of our dataflow to hardware studies: the RPCT Project – Context – Target Technologies – Project Development • The MDC tool – Approach – Baseline Functionality and Extensions • Contexts of application – Neural Signal Decoding – HEVC Interpolation Filters • Final Remarks

  18. Design Suite & Targeted Challenges Multi Dataflow Co-Processor Generator Composer Tool Structural Profiler Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  19. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Structural Profiler Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  20. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite http://sites.unica.it/rpct/

  21. Design Suite & Targeted Challenges Functional Complexity Time to Market: Design & Mapping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite Power Efficiency http://sites.unica.it/rpct/

  22. Design Suite & Targeted Challenges Functional Complexity Time to Market: Fast Integration Design & Mapping and Prototyping Automation Multi Dataflow Co-Processor Generator Composer Tool Constraint Driven Structural Profiler Optimisation Dynamic Power Manager MDC design suite Power Efficiency http://sites.unica.it/rpct/

  23. Baseline: Dataflow to HW coarse grained A A substrate C D C D 1:1 B B

  24. Baseline: Dataflow to HW coarse grained A A substrate C D C D 1:1 B B coarse grained A reconfigurable E substrate A C D SB B 2:1 D SB C A E D B

  25. MDC Front-End: Multi-Dataflow Generator α β γ A C D A E D F D B MDC front-end E 0 0 SB 0 1 2 SB A SB 0 α 1 1 0 0 1 SB 1 D β 0 0 0 C 1 2 B γ x x 1 F 1 shared multi-dataflow

  26. Datapath Merging Problem: Graph Model GRAPHS Gᵢ = (Vᵢ, Eᵢ) a₁₁ b₁₁ G₁ a₁₂ c₁₁ a₂₁ b₂₁ G₂ a₂₂ c₂₁ a₂₃

  27. Datapath Merging Problem: Graph Model GRAPHS LABELING πᵢ : V ᵢ  T Gᵢ = (Vᵢ, Eᵢ) a₁₁ b₁₁ π₁ G₁ a₁₁ A a₁₂ c₁₁ a₂₁ b₂₁ π₂ G₂ a₂₁ A a₂₂ c₂₁ a₂₃

  28. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃

  29. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| )

  30. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| ) feasible solution: ꓯ T ϵ T , Vᵀ={ v : π (v) = T}  |V ᵀ| = max |V ᵢᵀ|, Vᵢᵀ={ v ᵢ : πᵢ(vᵢ) = T}

  31. Datapath Merging Problem: Graph Model GRAPHS LABELING MAPPING πᵢ : V ᵢ  T μᵢ(v) = u, Gᵢ = (Vᵢ, Eᵢ) e (vᵢ, vᵢ′) ϵ Eᵢ (v ϵ V ᵢ, u ϵ V)   a₁₁ b₁₁ e( μᵢ(vᵢ), μᵢ(vᵢ′)) ϵ E π₁ πᵢ(v) = π (u) G₁ a₁₁ A a₁₂ c₁₁ a₁₁ μ a₂₁ b₂₁  A π₂ G₂ a₂₁ A a₂₁ a₂₂ c₂₁ a₂₃ PROBLEM STATEMENT: find a Reconfigurable Graph G (V,E) with the minimum costs ( min|V| and min |E| ) feasible solution: ꓯ T ϵ T , Vᵀ={ v : π (v) = T}  |V ᵀ| = max |V ᵢᵀ|, Vᵢᵀ={ v ᵢ : πᵢ(vᵢ) = T} optimal solution: feasible solution with min|E|

Recommend


More recommend