Large-Scale, Low-Cost Parallel Computers Applied to Reflector - PowerPoint PPT Presentation

Large-Scale, Low-Cost Parallel Computers Applied to Reflector Antenna Analysis Daniel S. Katz, Tom Cwik {Daniel.S.Katz, cwik}@jpl.nasa.gov J

Physical Optics Application DSN antenna - 34 meter main MIRO antenna - 30 cm main J High Performance Computing Group Daniel S. Katz

Physical Optics Algorithm Create mesh with N triangles on 1 Main reflector sub-reflector. (faceted into M triangles) Compute N currents on sub-reflector 2 due to feed horn (or read currents from file) Create mesh with M triangles on 3 main reflector Compute M currents on main 4 reflector due to currents on sub- reflector Compute antenna pattern due to Feed Horn 5 currents on main reflector (or write Sub-reflector (faceted into currents to file) N triangles) J High Performance Computing Group Daniel S. Katz

Microwave Instrument for the Rosetta Orbiter(MIRO) J High Performance Computing Group Daniel S. Katz

PO Analysis of MIRO 190 GHz: 564 GHz: Element # triangles Element # triangles Analysis time Analysis time matching mirror 1,600 matching mirror 6,400 17 seconds 193 seconds turning mirror 1,600 polarizer 6,400 57 seconds 193 seconds sub-reflector 6,400 turning mirror 6,400 1100 seconds 445 seconds main reflector 40,000 sub-reflector 22,500 5940 seconds main reflector 90,000 J High Performance Computing Group Daniel S. Katz

Previous MIRO Analysis l Cray J90 timings: » 190 GHz: Complete run (3 mirror pairs): 20 minutes » 564 GHz: Complete run (4 mirror pairs): 120 minutes l Turnaround time of 2 hours is too long to do effective design work. l Use parallel computing to decrease time to obtain results J High Performance Computing Group Daniel S. Katz

Beowulf System at JPL (Hyglac) l 16 Pentium Pro PCs, each with 2.5 Gbyte disk, 128 Mbyte memory, Fast Ethernet card. l Connected using 100Base-T network, through a 16-way crossbar switch. l Theoretical peak: 3.2 GFLOP/s l Sustained: 1.26 GFLOP/s J High Performance Computing Group Daniel S. Katz

Hyglac Cost l Hardware cost: $54,200 (as built, 9/96) $22,000 (estimate, 4/98) » 16 (CPU, disk, memory, cables) » 1 (16-way switch, monitor, keyboard, mouse) l Software cost: $600 ( + maintainance) » Absoft Fortran compilers (should be $900) » NAG F90 compiler ($600) » public domain OS, compilers, tools, libraries J High Performance Computing Group Daniel S. Katz

Beowulf System at Caltech (Naegling) l ~120 Pentium Pro PCs, each with 3 Gbyte disk, 128 Mbyte memory, Fast Ethernet card. l Connected using 100Base-T network, through two 80-way switches, connected by a 4 Gbit/s link. l Theoretical peak: ~24 GFLOP/s l Sustained: 10.9 GFLOP/s J High Performance Computing Group Daniel S. Katz

Naegling Cost l Hardware cost: $190,000 (as built, 9/97) $154,000 (estimate, 4/98) » 120 (CPU, disk, memory, cables) » 1 (switch, front-end CPU, monitor, keyboard, mouse) l Software cost: $0 ( + maintainance) » Absoft Fortran compilers (should be $900) » public domain OS, compilers, tools, libraries J High Performance Computing Group Daniel S. Katz

Performance Comparisons Hyglac Naegling T3D T3E600 CPU Speed (MHz) 200 200 150 300 Peak Rate (MFLOP/s) 200 200 300 600 Memory (Mbyte) 128 128 64 128 Communication 150 322 35 18 Latency ( µ s) Communication 66 78 225 1200 Throughput (Mbit/s) (Communication results are for MPI code) J High Performance Computing Group Daniel S. Katz

Message-Passing Methodology l Receiver issues (non-blocking) receive calls: CALL MPI_IRECV(…) l Sender issues (non-blocking, synchronous send calls: CALL MPI_SSEND(…) l Receiver issues (blocking) wait calls (to wait for receives to complete): CALL MPI_WAIT(…) J High Performance Computing Group Daniel S. Katz

Parallelization of PO Algorithm Distribute (M) main reflector currents over all (P) processors l Store all (N) sub-reflector currents redundantly on all (P) processors l Creation of triangles is sequential, but computation of geometry information on l triangles is parallel, so 1 and 3 are partially parallel Computation of currents (2, 4, and 5) is parallel, though communication is l required in 2 (MPI_Allgetherv) and 5 (MPI_Reduce). l Timing: » Part I: Read input files, perform step 3 » Part II: Perform steps 1, 2, and 4 » Part III: Perform step 5 and write output files Algorithm: l Create mesh with N triangles on sub-reflector. 1 Compute N currents on sub-reflector due to feed horn (or read currents from file) 2 Create mesh with M triangles on main reflector 3 Compute M currents on main reflector due to currents on sub-reflector 4 Compute antenna pattern due to currents on main reflector (or write currents to file) 5 J High Performance Computing Group Daniel S. Katz

Physical Optics Results (Two Beowulf Compilers) Number of Part I Part II Part III Total Processors 1 0.0850 64.3 1.64 66.0 4 0.0515 16.2 0.431 16.7 16 0.0437 4.18 0.110 4.33 Time (minutes) on Hyglac, using gnu ( g77 -O2 -fno-automatic ) Number of Part I Part II Part III Total Processors 1 0.0482 46.4 0.932 47.4 4 0.0303 11.6 0.237 11.9 16 0.0308 2.93 0.0652 3.03 Time (minutes) on Hyglac, using Absoft ( f77 -O -s ) M = 40,000 N = 4,900 J High Performance Computing Group Daniel S. Katz

Physical Optics Results (T3D Optimization) Change main integral calculation from: CEJKR = (AJ*AK*1./R)*CDEXP(-AJ*AKR)/R2 to: CEJKR = DCMPLX( . (R*AK*DSIN(AKR)+DCOS(AKR))/(R*R2), . (R*AK*DCOS(AKR)+DSIN(AKR))/(R*R2)) Number of Part II Part II Part III Part III Processors (no opt.) (w/ opt.) (no opt.) (w/ opt.) 1 85.8 48.7 1.90 0.941 4 19.8 12.2 0.354 0.240 16 4.99 3.09 0.105 0.0749 Time (minutes) on T3D, N=40,000, M=4,900 J High Performance Computing Group Daniel S. Katz

Physical Optics Results Number of Naegling T3D T3E-600 Processors 4 95.5 102 35.1 16 24.8 26.4 8.84 64 7.02 7.57 2.30 Time (minutes), N=160,000, M=10,000 l Cray J-90 Time : about 2 hours J High Performance Computing Group Daniel S. Katz

Expected new analysis times for MIRO l Using Beowulf-class computers » Can run 190 GHz case (3 paired mirrors): – 16 processors: about 1 minute – 64 processors: less than 20 seconds » Can run 564 GHz case (4 paired mirrors): – 16 processors: about 25 minutes – 64 processors: about 7 minutes J High Performance Computing Group Daniel S. Katz

Conclusions l Beowulf-class computers can fit individual projects, such as MIRO, quite well l They can enable a project with a limited budget to improve the time required to obtain results l Reflector antenna analysis using Physical Optics is well-suited for these computers J High Performance Computing Group Daniel S. Katz

Large-Scale, Low-Cost Parallel Computers Applied to Reflector - PowerPoint PPT Presentation

Large-Scale, Low-Cost Parallel Computers Applied to Reflector Antenna Analysis Daniel S. Katz, Tom Cwik {Daniel.S.Katz, cwik}@jpl.nasa.gov J Physical Optics Application DSN antenna - 34 meter main MIRO antenna - 30 cm main J High

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

Language and Computers where to start? Outline Computers Computers Computers Topic 1: Text

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Quantum Mechanics; a Blessing and a Curse By Elias Marcopoulos Quantum Computers Quantum

TUTORIAL - TUTORIAL -ABC ABC TOTAL COST for a COST OBJECT TOTAL COST for a COST OBJECT

Overview Parallel computing platforms Approaches to building parallel computers

Cost Report Capital Cost Operating Cost (Up front cost) (Annual cost over time) Utilities

Cost Allocation Plans and Indirect Cost Rates Cost Allocation Plans and Indirect Cost Rates

Idealized Parallel Computers Programmierung Paralleler und Verteilter Systeme (PPV) Sommer 2015

Language and Computers where to start? Language and Outline Language and Computers

Outline Language learning Computers Computers Computers Topic 6: CALL Topic 6: CALL Topic 6:

Good Morning! INT1004 Computers for Business Ulrich Werner Discovering Computers Technology in

Outline Searching Computers Computers Computers Topic 2: Searching Topic 2: Searching Topic

Who cares about spelling? Why people care about spelling Computers Computers Computers Topic

A Brief History of Computers A Brief History of Computers A Brief History of Computers By

Debugging Large Scale Parallel Applications Filippo Gioachin Parallel Programming Laboratory

AIX-EN-PROVENCE INSTITUTE OF POLITICAL STUDIES www.sciencespo-aix.fr A GRANDE COLE IN PROVENCE

Revised April 29.2019 Overview Baseline Information 2018 Point-in-Time Defining PIT

Presentation EDI WEB Portal for the MSSS Establishments EDI WEB Portal InterTrade presents the

Accruals NMSU prepares full accrual financial statements that require income to be reported

AGM INVESTOR PRESENTATION 30 May 2019 Contents 1. Summary 2. Why Italy? 3. Selva: Low Cost

Lloyds Banking Group Guide to Receiving Orders and Getting Paid (Version 2.0 June 2015) The

Invoicing Workshop Agenda System Basics Creating an Invoice PO-based

se rvic e the Ne ptune te rmina l Sub mitte d a pplic a tio n Aug ust 30, 2017 unde r PE