energy efficient hpc
play

Energy-Efficient HPC A Tools Perspective February 2nd, 2016 - PowerPoint PPT Presentation

Mitglied der Helmholtz-Gemeinschaft Energy-Efficient HPC A Tools Perspective February 2nd, 2016 Michael Knobloch Short Intro to Performance Tools Trigger performance data Sampling Code instrumentation Recording performance data Profiling


  1. Mitglied der Helmholtz-Gemeinschaft Energy-Efficient HPC A Tools Perspective February 2nd, 2016 Michael Knobloch

  2. Short Intro to Performance Tools Trigger performance data Sampling Code instrumentation Recording performance data Profiling Tracing February 2nd, 2016 Michael Knobloch Slide 2

  3. Sampling February 2nd, 2016 Michael Knobloch Slide 3

  4. Instrumentation February 2nd, 2016 Michael Knobloch Slide 4

  5. Instrumentation Techniques February 2nd, 2016 Michael Knobloch Slide 5

  6. Instrumentation Critical Issues February 2nd, 2016 Michael Knobloch Slide 6

  7. Profiling February 2nd, 2016 Michael Knobloch Slide 7

  8. Tracing February 2nd, 2016 Michael Knobloch Slide 8

  9. Tracing details February 2nd, 2016 Michael Knobloch Slide 9

  10. Tracing vs. Profiling February 2nd, 2016 Michael Knobloch Slide 10

  11. Energy-Efficiency Tools Projects at JSC Score-E February 2nd, 2016 Michael Knobloch Slide 11

  12. eeClust – 2009 - 2012 eeClust - Energy-Efficient Cluster Computing Project partners: Uni Hamburg, TU Dresden, JSC, ParTec www.eeclust.de Goals: Identify phases of low resource utilization and turn hardware to lower power-states in such phases Integral point: Extension of performance analysis tools to analyse application power consumption and hardware utilization February 2nd, 2016 Michael Knobloch Slide 12

  13. MPI Busy-Waiting MPI Busy-Waiting Power consumption in phases of busy-waiting is very high due to constant CPU activity. February 2nd, 2016 Michael Knobloch Slide 13

  14. Scalasca Workflow Measurement library Distributed trace Instr. target application Parallel Wait-state Report wait-state report browser search February 2nd, 2016 Michael Knobloch Slide 14

  15. Scalasca Wait-State Detection processes A Send B Recv C Send D Recv Wait State time February 2nd, 2016 Michael Knobloch Slide 15

  16. Calculating Energy-Saving Potential Idle-Waiting ESP = max p ∈ PS (( t w ∗ A p 1 ) − ( t w − t T p , p 1 ) ∗ I p + E T p , p 1 ) Busy-Waiting ESP BW = max p ∈ PS (( t w ∗ A p 1 ) − ( t w − t T p , p 1 ) ∗ A p + E T p , p 1 ) PS – Set of power states t w – Waiting time A p – Active energy in P-State p t T p 1 , p 2 – Transition time I p – Idle energy in P-State p E T p , p 1 – Transition energy February 2nd, 2016 Michael Knobloch Slide 16

  17. Example: Energy Saving Potential February 2nd, 2016 Michael Knobloch Slide 17

  18. Example: Optimal P-State Detection February 2nd, 2016 Michael Knobloch Slide 18

  19. EIC – 2010 - . . . EIC - Exascale Innovation Center Project partners: IBM Germany R&D and JSC Goal: Co-Design for next-gen of Supercomputers One work-package on energy-efficiency Investigation of power consumption on Blue Gene Fine-grained power measurements on POWER7 Now in transition to PADC - POWER Acceleration and Design Center February 2nd, 2016 Michael Knobloch Slide 19

  20. Power Consumption Analysis on POWER7 Amester IBM Automated Measurement of Systems for Temperature and Energy Reporting software. Results were published at EnA-HPC 2013. Sensor name Units Time scale Description PWR1MS W Instantaneous Node power consumption PWR1MSP0 W Instantaneous Processor power consumption PWR1MSMEM0 W Instantaneous Memory power consumption PWR32MS W avg. over last 32 ms Node power consumption PWR32MSP0 W avg. over last 32 ms Pocessor power consumption PWR32MSMEM0 W avg. over last 32 ms Memory power consumption IPS32MS Mips Every 32 ms Instructions per second rate February 2nd, 2016 Michael Knobloch Slide 20

  21. Example: Component Level Power Measurement February 2nd, 2016 Michael Knobloch Slide 21

  22. Example: Counter Resolution Comparison February 2nd, 2016 Michael Knobloch Slide 22

  23. Amester on POWER8 200 150 Power ( W ) 100 50 0 20 40 60 80 100 Time ( s ) Disk IO Memory GPU Fan CPU February 2nd, 2016 Michael Knobloch Slide 23

  24. Score-E – 2013 - 2016 Score-E Main Tools Partners: JSC, TU Dresden, TU Munich Successor of SILC and LMAC Extension of Score-P measurement system (www.score-p.org) Common measurement system for Scalasca, Vampir, and Periscope Power and Energy measurements from different sources, e.g. RAPL, Xeon Phi/GPU power sensors, etc. Energy modelling from power consumption data Enable auto-tuning for energy efficiency New visualization based on application geometries February 2nd, 2016 Michael Knobloch Slide 24

  25. Lessons learned Metric discussion Still on the quest for the right metric Power vs. Energy Might require different analyses User motivation (still) hard Tools need Sensors that provide relevant information Power, energy, temperature, etc. At all relevant system levels Scalable APIs February 2nd, 2016 Michael Knobloch Slide 25

  26. Discussion: Road to Dark Silicon Challenges for tools How reliable is performance data from dynamic applications? How reliable is performance data on dynamic hardware? How to analyse dynamic applications? Requirements on tools ? February 2nd, 2016 Michael Knobloch Slide 26

Recommend


More recommend