embedded supercomputing
play

Embedded Supercomputing: (to combat the malaise that is big - PowerPoint PPT Presentation

Embedded Supercomputing: (to combat the malaise that is big sofuware) Radio Astronomy at the Limit Simon Ratclifge & Bruce Merry SKA South Africa JSNTB In the beginning.... more recently = Johannes Hevelius 60fu - 1673 (the Ewan


  1. Embedded Supercomputing: (to combat the malaise that is big sofuware) Radio Astronomy at the Limit Simon Ratclifge & Bruce Merry SKA South Africa JSNTB

  2. In the beginning....

  3. more recently = Johannes Hevelius 60fu - 1673 (the Ewan McTeagle of his day)

  4. everything we know so far http://xkcd.com/273/

  5. getting the full picture = + + = Jansky VLA, New Mexico

  6. Watt? 1 Jy = 10 -26 Wm -2 Hz -1 I cannae do it, captain, ye cannae change the laws of physics

  7. how much wood could a wood chuck, chuck 1.008 J 0.00000016 J 0.00000000000007 J

  8. “Know your enemy and know yourself” 10 5 Jy Sun @ 5 GHz 10 8 Jy GSM Phone @ 1km 0 Jy 'Smart' Phone @ 1m

  9. something a little bigger

  10. photon to image s Astronomical signal (EM wave) B . s Detect & amplify Digitise & delay Correlate X X X X X X Integrate Process Calibrate, grid, FFT SKY Image

  11. Will it blend ? X Andrew Cooper 1000000000000000000 B @ 1 bit per grain of standardised* sand * assumptions apply (10gpmm^3, 3kmx2kmx700m, only valid when calculated on paper napkin, just say no to assumptions)

  12. You sir, are a blaggard and a coward 1 exbibyte - 1 exabyte 38, 230 x

  13. Big Iron

  14. Medium Iron (and a fair bit of aluminium)

  15. In theory MeerKAT 64 V ij = M ij B ij G ij D ij E ij P ij T ij V ij IDEAL MAGIC IO / Cache / FLOPS / kW / $

  16. “My god it's full of data” 68 50 1 hour Gibps PiB @ = observation input data rate bufger

  17. 50 hour totals 1.1 Tibps bufger read rate 1.7 ExaFlop total FP operations 1.9 TB working memory

  18. easy…..

  19. whither Mr. Fusion ? 20 kW available power

  20. rosetta stone (edition 2010)

  21. ...shake your windows, and rattle your walls TK1: 327 GFLOPs / 12.7 GiBps

  22. measurement equation (redux) baseline direction V ( u,v,w ) = ∫ A ( l,m,w ) I ( l,m ) e − 2 π i [ u l + v m ] d l d m input efgects image Fourier transform

  23. convolutional gridding V ( u,v,w ) = ∫ A ( l,m,w ) I ( l,m ) e − 2 π i [ ul+vm ] dldm

  24. Working Model 8 Hours, 64 Antennas, Single Channel, 4k Image

  25. I cannae push it any faster, Captain!

  26. Trade-ofg between gridding and FFT costs

  27. Anatomy of a modern radio telescope

  28. Graphs all the way down until you hit the turtles...

  29. How embarrassing is your parallelism ?

  30. ignore the logo, it's clearly an internal design.

  31. pre cambrian cooling inc. 100 Nodes 50 TFLOPS SoC Carrier SSD 0.4 TB RAM backplane 50 TB SSD 40 Gbps Eth Thermal Compound 0.7 kW Hive How big ? peltier exchange Ground Loop FERRO

  32. deep fried

  33. The build TEGRA X1 TESLA K40 1056 Nodes 50 Servers Tegra X1 2 x Tesla K40 4 GB RAM 2 x E5-2660v3 512 GB SSD 6 x 2TB SATA 64 GB RAM 22 Switches 3 Switches 2 x 10 GbE SFP+ 4 x 40 GbE QSFP 48 x 1 GbE 36 x 10 GbE SFP+ 11 Pods 3 Racks 15M Ground Loop Just a rack 50L Mineral Oil

  34. Super green ? Super green. TEGRA X1 TESLA K40 $350 kilo $1,056 kilo $310k Hardware $816k Hardware $40k Infrastructure $58k Infrastructure 12.4 kW 57.5 kW 11 .9 kW Hardware 44.3 kW Hardware 0.5 kW Cooling 13.2 kW Cooling

Recommend


More recommend