vishal gupta georgia tech ripal nathuji microsoft research
play

Vishal Gupta* (Georgia Tech) Ripal Nathuji (Microsoft Research) * - PowerPoint PPT Presentation

Vishal Gupta* (Georgia Tech) Ripal Nathuji (Microsoft Research) * Work done during summer internship at Microsoft Research Different types of CPU cores CPU Cores P P P P P P P Symmetric Asymmetric multicore processor multicore


  1. Vishal Gupta* (Georgia Tech) Ripal Nathuji (Microsoft Research) * Work done during summer internship at Microsoft Research

  2. Different types of CPU cores CPU Cores P P P P P P P Symmetric Asymmetric multicore processor multicore processor SMP AMP

  3. Application P P SMP B P P A C P AMP B P P SMP Speedup! AMP T 2T 3T time

  4. • How good are AMPs as compared to SMPs? • Can datacenter applications save power using AMPs?

  5. Server λ datacenter (throughput) Others … S S S S Processor … S S S S . . . . . . . . . . . . … P P P S S S S P P P P Datacenter SMP AMP

  6. AMP SMP P < P ? datacenter datacenter • Constant work • Meet latency SLA

  7. Sequential • Energy Scaling execution Parallel • Parallel Speedup … execution

  8. Area equivalent Sequential application P P P SMP AMP

  9. t small t large P T AMP AMP P Slack T SMP SMP P time T SLA Smaller core = lesser power

  10. P P P P P P P P Parallel application P P P P P P P P P P P P P P P … P P P P P P SMP AMP

  11. … Sequential P P P P Phase P P P P P P P P P P P P P P P P P P P Small cores: Run on P P P P P P Bottleneck the fast core AMP SMP … Speedup = Higher throughput

  12. Latency SLA Server Arrival Rate λ Request Service Rate Queue µ M/M/1 Queuing Model 1 Avg. E [ T ] = Response Time µ − λ

  13. Parallel Speedup (PS) (refer to paper for ES) P P P P P P P P Parallel application P P P P P P P P P P P P P P P … P P P P P P SMP AMP Amdahl’s Law for Multicores

  14. Area = r r = Area(Big/Core) Area = 1 P P Perf = perf(r) P P P P P P P P P P P P P P P n = Chip area P P P P P P P P P P P P P P P P P P SMP SMP AMP n=16, r=1 n=16, r=4 n=16, r=4 f = fraction of computation that can be parallelized

  15. 1 µ SMP ( f , n , r ) = 1 − f f perf ( r ) + n r * perf ( r ) 1 µ AMP ( f , n , r ) = 1 − f f perf ( r ) + n − r Ref: Hill and Marty, Amdahl's law in the multicore era (IEEE Computer’08)

  16. peak = µ − 1 λ server T SLA Datacenter capacity = No. of servers * Server throughput SMP * λ server SMP λ datacenter = N server Constant Work AMP * λ server AMP λ datacenter = N server

  17. Datacenter power (P) = No. of servers * Server power SMP * P SMP SMP P = N server datacenter server AMP * P AMP AMP P = N server datacenter server

  18. Peak Power Server Power Consumption P(U) Idle Power CPU Utilization (U) Ref: The Case for Energy-Proportional Computing, Barroso & Hölzle, IEEE Computer 2007

  19. Server load distribution (W load ) Fraction of time CPU Utilization (U) ∑ P W load ( U )* P server ( U ) server =

  20. AMP SMP P < P ? datacenter datacenter

  21. Upto 52% power savings n = 64 60% Power savings of AMP 50% over SMP 40% r=32 30% r=16 20% r=8 10% r=4 0% 0 0.2 0.4 0.6 0.8 1 Fraction of work that can be parallelized (f)

  22. Upto 14% power savings 20% Power savings of AMP over 15% 10% 5% Application A Small core bias SMP 0% Application B 5% 10% 15% 20% 25% 30% 35% 40% 45% Uniform bias -5% Application C Large core bias -10% -15% -20% -25% Fraction of area sacrificed for small core

  23. • PS looks more promising that ES • Can we achieve these savings in reality?

  24. High (but not too high!) f High r 60% Power savings of AMP 50% (realistic r = 3) 40% over SMP r=32 30% r=16 20% r=8 10% r=4 0% 0 0.5 1 Fraction of work that can be parallelized (f)

  25. • Scalability : Amdahl’s law assumes unbounded scalability • Migration overhead : zero migration overhead • Perfect scheduling : oracle scheduler Actual savings are going to be lower

  26. • Potential for power savings in datacenters using AMPs • Parallel Speedup more promising than Energy Scaling • Practical considerations to realize full benefits Future work: Extend our analysis to functional asymmetry

Recommend


More recommend