Daren Hasenkamp*, Alex Sim, Michael Wehner, Kesheng Wu Lawrence - PowerPoint PPT Presentation

Searching Tropical Storms on Cloud: A Large-Scale Climate Data Analysis Daren Hasenkamp*, Alex Sim, Michael Wehner, Kesheng Wu Lawrence Berkeley National Laboratory *University of California, Berkeley

Why Study Tropical Storms? Tropical storms are among the Climate change could increase most deadly natural the frequency of severe tropical phenomenon storms [Hurricane Katrina track] [Weather fatalities from weather.gov] 2 Wu - ISGC 2011

Predicting Tropical Storm Statistics q Motivations: v Validate climate models by verifying the tropical storm statistics v Predict future tropical storm statistics q Approach: v Simulate climate, gather statistics from simulation data v Compute statistics of tropical storms, not any individual storm q Case study: fvCAM (finite volume version of the Community Atmospheric Model) dataset (version 2.2) v 15 simulated years with 6 hour output v Mesh point resolution of 0.5 degree latitude by 0.625 degree longitude v Roughly 500 GB, 1000 netCDF files v Scientists will run this simulation for 100 simulated years with many different initial conditions, generating many terabytes of raw data 3 Wu - ISGC 2011

TSTORM code q TSTORM code used to track tropical storms v Based on the criteria established by Knutson, et al. from Geophysical Fluid Dynamical Library (GFDL), 2007 BAMS 88:10 1549-65 q Searches for high vorticity, local pressure drop, and warm core v A local relative vorticity maximum at 850 hPa exceeds 1.6*10 -4 s -1 . Vorticity is the curl of wind velocity, and s is time in seconds. v The surface pressure increases by at least 4 hPa from the storm center within a radius of 5 degrees. The closest local minimum in sea level pressure, within a distance of 2 degrees latitude or longitude from the vorticity maximum, is defined as the center of the storm. v The distance of the warm-core center from the storm center does not exceed 2 degrees. The temperature decreases by at least 0.8 degrees Celsius in all directions from the warm-core center within a distance of 5 degrees. The closest local maximum in temperature averaged between 300 and 500 hPa is defined as the center of the warm core. 4 Wu - ISGC 2011

Tropical Storm Tracks Sep, 1979 q Produced by TSTORMS using virtual machines on Sep, 1993 cloud computing facility 5 Wu - ISGC 2011

TSTORMS Code and Parallelization q TSTORMS v A single thread sequential program v Running on a single processor v Analysis of 500GB of simulation output can take several days v Need to analyze many petabytes, but can not wait for decades q Parallelization is needed v Running multiple TSTORMS processes, one for each time step q Challenges in traditional parallel processing v Need to rewrite the code with MPI v Port dependent software libraries and run-time systems q Cloud computing as an alternative v Using virtual machines to package existing analysis code, libraries and run-time systems, no need to rewrite code v Portable to many computing hardware 6 Wu - ISGC 2011

Three Different Approaches q Virtual machine on cloud computing v Eucalyptus VM submission q Virtual machine on grid computing v Pre-loaded VMware image q MPI parallel processing on cluster computing v Needed code re-write for MPI and local compilation 7 Wu - ISGC 2011

Virtual Machine Coordination q Difficulties in controlling virtual machines instance v Hard to control exactly how many virtual machines instances are launched. For example, a user requesting 40 instances might only receive 36. Not all cloud clusters share this property, but it was our experience during the tests. v Virtual machine instances launch at varying times: If a user makes a request for 20 VM instances, the first instance might start a half hour before the final. q MPI-based process coordination for data-driven parallelism comes easier. q Mechanisms investigated for VM coordination v Coordination through leader election v Coordination through external service 8 Wu - ISGC 2011

Coordination using Distributed Leader Election q Elect one VM instance as a leader at launch time v Track job status and coordinate VM instances v Maintain a synchronized queue of URLs to input files used by all VM instances q Advantage: v The job is self-contained v A user can launch many instances, and does not have to perform any further tasks, such as setting up a remote service q Disadvantage: v Static input URLs v All VMs must be able to talk to each other to elect a leader v Leader can be a single-point of failure 9 Wu - ISGC 2011

Analysis with virtual machines on cloud computing Client VM Result display Submission Climate Data Analysis Result Repository Data Repository Leader VM …. VM VM VM Workers NERSC LBNL ESG Gateway/DataNode Magellan Cloud Facility at ALCF/ANL & NERSC/LBNL

Coordination through a Remote Service q External analysis coordination service v Service maintains a synchronized queue of URLs to input files from which all other VM instances pull one URL at a time. v Advantage: Ø Easy setup Ø Dynamic coordination for multiple source repositories v Disadvantage: Ø Dependency on the remove service 11 Wu - ISGC 2011

Analysis with Virtual Machines on cloud computing Analysis Client Coordination Service VM (Synchronized Submission Queue of URLs) Result display Analysis Result Climate Data VM instances Data Repository Repository VM NERSC ESG Gateway/DataNode …. VM Climate Data NCAR Repository LBNL Climate Data LLNL Repository Magellan Cloud Facility at ALCF/ANL & NERSC/LBNL Climate Data ORNL Repository

Analysis with Virtual Machines on Grid computing Analysis Client Coordination Service VM Job (Synchronized Submission Queue of URLs) Result display Analysis Result Climate Data Pre-loaded VM instances Data Repository Repository VM NERSC ESG Gateway/DataNode …. VM Climate Data NCAR Repository LBNL Climate Data LLNL Repository Grid Laboratory of Wisconsin (GLOW) Univ. of Wisconsin Climate Data ORNL Open Science Grid (OSG) Repository

Analysis with MPI parallel processing on Clusters NERSC Client Job MPI Job Scheduler Submission Result display

Test setup q Magellan cloud and Carver cluster v Each node on each system contains dual quad-core Intel Nehalem 2.66GHz processors and 24GB RAM q GLOW v GLOW nodes we used utilized Xeon 2.66GHz and 3.2GHz processors, and had enough RAM for TSTORMS to execute without using virtual memory v Our VM on GLOW had compute resources comparable to, though not exactly the same as, instances on Magellan and processes on Carver. q Source data on GPFS at NERSC v Runs on Carver had somewhat of a speed advantage over VMs since data could be accessed through a local file system rather than needing to be sent across a network. v Disadvantage from virtualization overhead on VMs compared to Carver MPI processes. 15 Wu - ISGC 2011

Results (1) q Performance from VM-based analysis comparable to MPI- based analysis q In one test, Magellan VM-based analysis actually performed better than Carver MPI-based analysis v Analyzing our 500GB repository on Carver using 8 processes took 3 hours longer than on Magellan using 8 virtual machine instances (~12.5 vs. ~9.5 hours) q Using 30 VMs, analysis of the 500GB dataset in ~4.5 hours v Using a workstation with similar computational power, it can take several days; roughly 100 hours q Analysis in ~2 hours using 90 instances on GLOW v Conveniently short amount of time for a scientist to wait for analysis output, and it is comparable to analysis speed on Carver 16 Wu - ISGC 2011

Results (2) q Total analysis time as a function of number of instance or number of processes v On Carver, 2 * (the amount of processes)  ½ (total analysis time) v Using VMs on a cloud, this holds only approximately Ø Expected that VM instances can have different starting times, whereas processes in MPI start almost at the same time Ø Effects of shared network • Our VM runs somewhat faster late at night and on weekends, when there is less traffic on network resources. • The anomalous 8-instance test on Magellan was started on a Friday night, and competition for both network bandwidth and cloud nodes would have been relatively low. 17 Wu - ISGC 2011

Time v. Number of Processes

Conclusion q Test analysis took 5-7 days on a workstation to ~3 hours on 32 VMs on Cloud q Analysis performance on cloud computing is comparable to analysis performance on MPI-based batch computing v MPI jobs are more predictable in performance v Variability on Cloud jobs is larger Ø Successful number of VM initialization varies Ø Network performance for remote data access Ø Storage capacity and performance q Parallel virtualization v A viable paradigm for large-scale data analysis v Offers an attractive environment Ø analysis programs can be configured once and run anywhere with configurable, and potentially massive, levels of parallelism and efficiency, comparable to a traditional batch-based computing system 19 Wu - ISGC 2011

Daren Hasenkamp*, Alex Sim, Michael Wehner, Kesheng Wu Lawrence - PowerPoint PPT Presentation

Searching Tropical Storms on Cloud: A Large-Scale Climate Data Analysis Daren Hasenkamp, Alex Sim, Michael Wehner, Kesheng Wu Lawrence Berkeley National Laboratory University of California, Berkeley Why Study Tropical Storms? Tropical

Feature Selection and Clustering Jiwoo Bang, Chungyong Kim, Kesheng Wu, Alexander Sim, Suren

SIM Card Hacking What is it? Social engineering attack that tricks carrier reps to swap sim

Standard Implementation Methodology (SIM) High Level Overview of Enhancements for SIM v3.0 Helen

What is a dating sim? What is a dating sim? Cast of cute girls Tokimeki Memorial (1994) What

On Testing Superstring Theories with Gravitational Waves Jasper Hasenkamp (Hamburg U.) at 19th

SIM Position on the Future of the CIPM-MRA October, 2015 Svres, France SIM Vision:

Roo2ng SIM cards The SRLabs Team SRLabs Template v12 SIM

Making the Lanczos method work for electronic structure calculations Kesheng Wu Andrew Canning

Alex Zolciak Alex Zolciak 2D Alex Zolciak 3D Alex Zolciak Other https://vimeo.com/122535005

Update On Graduate Medical Education (GME) General Faculty Meeting January 28, 2014 Dr.

Teaming of Conventional Submarines and XLUUV W. H. Wehner 1 , Dr. C. Fruehling 2 , Dr. B. Lehmann 3

Quantum to Classical Randomness Extractors Mario Berta, Omar Fawzi, Stephanie Wehner - Full

Submarine platform automation enabler of an optimized crew concept H. Wehner 1 , Dr. M. Mohr 2

High-Performance Outlier Detection Algorithm for Finding Blob-Filaments in Plasma Lingfei Wu 1 ,

Flood Protection Overview Holding Back the Ohio Daren Thompson Flood Protection Manager

FILM MEASUREMENTS Timo Ikonen 1 , Antti Kulmala 2 , Christopher Boylan 1 , Viljo Petj 1 , Daren

Network OS OpenFlow Network OS: distributed system that creates a consistent, up-to-date network

Why splitting your focus could be good Alex Peattie CTO & co-founder, Peg alex@peg.co Mon

Structured Prediction Models via the Matrix-Tree Theorem Terry Koo maestro@csail.mit.edu Amir

Mean Field Limits for Ginzburg-Landau Vortices Sylvia Serfaty Universit P. et M. Curie Paris 6,

A Meta-Language for Hardware Testbench Michael Katelman and Jos e Meseguer University of

Lead Capture How a healthcare company increased demand for services 300% Jon Ciampi Daniel

The building X v on which G ( k v ) and G ( k v ) act, when v splits in . K :=

European Product Standards - update on status and changes with relevance to CCPs - Hans-Joachim

Daren Hasenkamp*, Alex Sim, Michael Wehner, Kesheng Wu Lawrence - PowerPoint PPT Presentation

Searching Tropical Storms on Cloud: A Large-Scale Climate Data Analysis Daren Hasenkamp*, Alex Sim, Michael Wehner, Kesheng Wu Lawrence Berkeley National Laboratory *University of California, Berkeley Why Study Tropical Storms? Tropical

Feature Selection and Clustering Jiwoo Bang, Chungyong Kim, Kesheng Wu, Alexander Sim, Suren

SIM Card Hacking What is it? Social engineering attack that tricks carrier reps to swap sim

Standard Implementation Methodology (SIM) High Level Overview of Enhancements for SIM v3.0 Helen

What is a dating sim? What is a dating sim? Cast of cute girls Tokimeki Memorial (1994) What

On Testing Superstring Theories with Gravitational Waves Jasper Hasenkamp (Hamburg U.) at 19th

SIM Position on the Future of the CIPM-MRA October, 2015 Svres, France SIM Vision:

Roo2ng SIM cards The SRLabs Team SRLabs Template v12 SIM

Making the Lanczos method work for electronic structure calculations Kesheng Wu Andrew Canning

Alex Zolciak Alex Zolciak 2D Alex Zolciak 3D Alex Zolciak Other https://vimeo.com/122535005

Update On Graduate Medical Education (GME) General Faculty Meeting January 28, 2014 Dr.

Teaming of Conventional Submarines and XLUUV W. H. Wehner 1 , Dr. C. Fruehling 2 , Dr. B. Lehmann 3

Quantum to Classical Randomness Extractors Mario Berta, Omar Fawzi, Stephanie Wehner - Full

Submarine platform automation enabler of an optimized crew concept H. Wehner 1 , Dr. M. Mohr 2

High-Performance Outlier Detection Algorithm for Finding Blob-Filaments in Plasma Lingfei Wu 1 ,

Flood Protection Overview Holding Back the Ohio Daren Thompson Flood Protection Manager

FILM MEASUREMENTS Timo Ikonen 1 , Antti Kulmala 2 , Christopher Boylan 1 , Viljo Petj 1 , Daren

Network OS OpenFlow Network OS: distributed system that creates a consistent, up-to-date network

Why splitting your focus could be good Alex Peattie CTO &amp; co-founder, Peg alex@peg.co Mon

Structured Prediction Models via the Matrix-Tree Theorem Terry Koo maestro@csail.mit.edu Amir

Mean Field Limits for Ginzburg-Landau Vortices Sylvia Serfaty Universit P. et M. Curie Paris 6,

A Meta-Language for Hardware Testbench Michael Katelman and Jos e Meseguer University of

Lead Capture How a healthcare company increased demand for services 300% Jon Ciampi Daniel

The building X v on which G ( k v ) and G ( k v ) act, when v splits in . K :=

European Product Standards - update on status and changes with relevance to CCPs - Hans-Joachim

Searching Tropical Storms on Cloud: A Large-Scale Climate Data Analysis Daren Hasenkamp, Alex Sim, Michael Wehner, Kesheng Wu Lawrence Berkeley National Laboratory University of California, Berkeley Why Study Tropical Storms? Tropical

Why splitting your focus could be good Alex Peattie CTO & co-founder, Peg alex@peg.co Mon