Running a SPRINT job on the HPC Wales platform 1 There are other - PowerPoint PPT Presentation

Running a SPRINT job on � the HPC Wales platform � 1

There are other parallel R packages � Building ¡Block ¡Approaches ¡ • Difficult to program • Bespoke implementation • Biostatistician needs to be a parallel programmer – Rmpi: wrapper around MPI – NWS and Sleigh: implement a shared memory system Task ¡Farm ¡Approaches ¡ • Require substantial changes to existing scripts • Cannot be used to solve some problems – Biopara: Execute R functions remotely via SSH – Parallel ‘ apply ’ commands, runs the same command on every element in a list – SNOW: allows a single expression to be executed on different data segments 2

R parallel package � • Parallel has been a standard part of R since R 2.14 – Built on multicore and snow packages • Apply – As ¡analogues ¡of ¡lapply ¡there ¡are ¡parLapply(cl, ¡x, ¡FUN, ¡...) ¡ – mclapply(X, ¡FUN, ¡..., ¡mc.cores) • Underlying: – Random number generator – Load balancing • Examples of use: – Bootstrapping, MCMC.

So a Need for… � Tool for easy access to HPC: • User friendly • Hide the complexity of accessing and programming HPC • Allow process and analyse large amounts of high throughput post genomic data using R for statistical computing. • Allow repeated, regular use by biostatisticians • Implement functions that are not embarrassingly parallel, e.g. partitioning around medoids, Pearson’s correlation.

What does SPRINT do? � – The rest of the R workflow doesn’t change R code R code compute SPRINT function R code R code 5

Performance - The Goal of SPRINT � Overcome ¡limitations ¡on ¡data ¡size ¡and ¡ analysis ¡time ¡by ¡providing ¡easy ¡access ¡to ¡ High ¡Performance ¡Computing ¡for ¡all ¡R ¡users ¡

SPRINT and Data Size � Overcome limitations on data size and analysis time by providing easy access to High Performance Computing for all R users For example, Pearson ’ s correlation, Input ¡Matrix ¡ ¡ Output ¡Matrix ¡ ¡ Serial ¡ ¡ Parallel ¡ pcor() Size ¡ Size ¡ Run ¡Time ¡ Run ¡Time ¡ • Enables processing of datasets where 11,000 ¡x ¡320 ¡ 0.9 ¡GB ¡ 4.76 ¡secs ¡ 63.18 secs 26.85 ¡MB ¡ the output does not fit in physical memory 22,000 ¡x ¡320 ¡ Insufficient ¡ ¡ 3.6 ¡GB ¡ 13.87 ¡secs ¡ • 53.7 ¡MB ¡ memory ¡ uses R ff package: memory-efficient 35,000 ¡x ¡320 ¡ storage of large data on disk and fast 9.12 ¡GB ¡ Crashed 36.64 ¡secs ¡ 85.44 ¡MB ¡ access functions (also available from 45,000 ¡x ¡320 ¡ 15.08 ¡GB ¡ Crashed 42.18 ¡secs ¡ CRAN). 109.86 ¡MB ¡ • ff objects can be created, stored, used Benchmark on HECToR - UK National Supercomputing Service on 256 cores. S. Petrou et al, dCSE NAG Report, www.r-sprint.org. and removed, almost like standard R RAM objects. • ff objects are perfect for reading the same data from many R processes.

SPRINT and Analysis Time � Overcome limitations on data size and analysis time by providing easy access to High Performance Computing for all R users Serial ¡Run ¡ Input ¡Matrix ¡ # ¡Permutations ¡ Parallel ¡Run ¡ Time ¡ Size ¡ ¡ Time ¡ (estimated) ¡ For ¡example, ¡permutation ¡testing, ¡ 36,612 ¡x ¡76 ¡ 500,000 ¡ 6 ¡hrs ¡ 73.18 ¡secs ¡ pmaxT() ¡ 36,612 ¡x ¡76 ¡ 1,000,000 ¡ 12 ¡hrs ¡ 146.64 ¡secs ¡ • Parallel ¡implementation ¡of ¡mt.maxT 36,612 ¡x ¡76 ¡ 2,000,000 ¡ 23 ¡hrs ¡ 290.22 ¡secs ¡ () ¡from ¡multtest ¡package ¡(available ¡ 73,224 ¡x ¡76 ¡ 500,000 ¡ 10 ¡hrs ¡ 148.46 ¡secs ¡ from ¡CRAN) ¡ 73,224 ¡x ¡76 ¡ 1,000,000 ¡ 20 ¡hrs ¡ 294.61 ¡secs ¡ 73,224 ¡x ¡76 ¡ 2,000,000 ¡ 39 ¡hrs ¡ 591.48 ¡secs ¡ Benchmark ¡on ¡HECToR ¡-‑ ¡UK ¡National ¡Supercomputing ¡Service ¡on ¡256 ¡cores. ¡ S. ¡Petrou ¡et ¡al, ¡HPDC ¡2010 ¡& ¡CCPE, ¡2011. ¡

SPRINT Data Size and Analysis Time � Overcome limitations on data size and analysis time by providing easy access to High Performance Computing for all R users Parallel ¡Run ¡ For ¡example, ¡clustering ¡with ¡partitioning ¡ Input ¡Data ¡ # ¡Clusters ¡ Serial ¡Run ¡Time ¡ Time ¡ ¡ Size ¡ ¡ Pam() ¡ around ¡medoids, ¡ppam() ¡ Ppam() ¡ • 2400 ¡ 12 ¡ 11.3 ¡secs ¡ 1.1 ¡secs ¡ Parallel ¡implementation ¡of ¡pam() ¡ 2400 ¡ 24 ¡ 52.5 ¡secs ¡ 2.2 ¡secs ¡ from ¡cluster ¡package ¡(available ¡from ¡ 4800 ¡ 12 ¡ 83.3 ¡secs ¡ 4.4 ¡secs ¡ CRAN) ¡ 4800 ¡ 24 ¡ 434.7 ¡secs ¡ 15.9 ¡secs ¡ • Optimisation ¡of ¡serial ¡version ¡ 10 ¡000 ¡ 12 ¡ 17 ¡mins ¡ 22.3 ¡secs ¡ through ¡memory ¡and ¡data ¡storage ¡ 10 ¡000 ¡ 24 ¡ 99 ¡mins ¡ 77.1 ¡secs ¡ Insufficient ¡ ¡ management ¡ 22 ¡374 ¡ 24 ¡ 270.5 ¡secs ¡ memory ¡ • Increased ¡capacity ¡by ¡using ¡external ¡ Benchmark on a shared memory cluster with 8 dual-core 2.6GHz AMD Opteron processors with 2GB of RAM per core . memory ¡(i.e. ¡ff ¡objects) ¡ M. ¡Piotrowski ¡et ¡al, ¡BILIS ¡2011. ¡

How you can use SPRINT � – Install ¡SPRINT ¡ – Modify ¡R ¡script ¡ – Execute ¡script ¡in ¡parallel ¡ ¡ – Execute ¡script ¡on ¡a ¡supercomputer ¡ ¡ ¡ 10

Example � library("sprint”) � my.matrix <- matrix(rnorm(500000,9,1.7), nrow=20000, ncol=25) � genecor <- cor( t(my.matrix) ) � � quit(save="no”) � 1 Dec 2011 11

Example � library("sprint”) � my.matrix <- matrix(rnorm(500000,9,1.7), nrow=20000, ncol=25) � genecor <- p cor( t(my.matrix) ) � pterminate() � quit(save="no”) � 1 Dec 2011 12

Run using MPI � sprint_script.R library("sprint”) � my.matrix <- matrix(rnorm(500000,9,1.7), nrow=20000, ncol=25) � genecor <- pcor( t(my.matrix) ) � pterminate() � quit(save="no”) � $ mpiexec -n 4 R -f sprint_script.R R -f sprint_script.R R -f sprint_script.R R -f sprint_script.R R -f sprint_script.R 13

Running on HPC Wales � 2 files: – R script calling SPRINT functions (sprint_script.R). – Job submission script (sub.q). – A request for time and processors on the supercomputer. – The commands needed to execute your script 14

Sub.q #!/bin/bash --login � # ! Edit number of processors to fit your job � #BSUB -n 8 � # ! Redirect stdout to the file filename � #BSUB -o sprint_test.o.%J � # ! Redirect sterr to the file filename � #BSUB -e sprint_test.e.%J � # ! Edit the job name to identify separate job � #BSUB -J sprint_test � # ! Edit time to fit your job � #BSUB -W 0:10 � � module purge � module load SPRINT � � mpirun -n 8 R --no-save --quiet -f sprint_test.R � � 15

Submit the job � Log in to hpc wales $ ¡ssh ¡username@login.hpcwales.co.uk ¡ Log in to one of the compute clusters ssh ¡ab-‑log-‑001 ¡ ¡ Create the 2 files described previously - sprint_script.R and sub.q Submit the job $ ¡bsub ¡ ¡< ¡ ¡sub.q ¡ ¡ The ¡job ¡will ¡then ¡join ¡a ¡queue ¡and ¡be ¡run ¡when ¡resources ¡become ¡available. ¡ 16

Check the results � To see if your job is waiting in the queue, running or finished, run: $ ¡qstat ¡-‑u ¡$USER ¡ ¡ If this returns ‘No matching job found’ then your job is finished and the output of running the code will be in a {filename}. o{job_number} file. more ¡sprint_test.o{job_number} ¡ ¡ Any error messages will be in a {filename}. e{job_number} file. ¡ more ¡sprint_test.e{job_number} ¡ 17

Running a SPRINT job on the HPC Wales platform 1 There are other - PowerPoint PPT Presentation

Running a SPRINT job on the HPC Wales platform 1 There are other parallel R packages Building Block Approaches Difficult to program Bespoke implementation Biostatistician needs to be a parallel programmer

Installing SPRINT Supercomputers SPRINT is already installed on HPC Wales and

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

Blood Pressure Measurement in SPRINT Karen C. Johnson, MD, MPH, FAHA Vice Chair, SPRINT Steering

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Child Welfare Digital Services Sprint Review Presentation Sprint Review of Iteration 4.2

How does the sprint planning look like? Working arrangements, guidelines & support Why an

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Points of Pride: What we have accomplished so far! Created Job Framework 24 Job Groups/Job

Google what can possibly go wrong with my Sprint?

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

Facilitating HPC job debugging through job scripts archival Andy Georges 2 February 2020 FOSDEM

1 Specifying Grammar with JavaCUP Abstract Syntax Tree for Memory Layout Example JavaCUP example

Machine-Level Programming II: Control CS140 Computer Organization and Assembly Slides Courtesy

ASP4100 Introduction to honours computing Introduction to modern Fortran (incl. Makefiles) 1

Code Generators for Mathematical Functions N. Brunie 1 , F. de Dinechin 2 , O. Kupriianova 3 , Ch.

Machine-Level Programming II: Control CSE 238/2038/2138: Systems Programming Instructor: Fatma

Examining moderated effects of additional adolescent substance use treatment: Structural nested

SBIRT in Family Planning Settings Moderator: Heather Gotham PhD Panelists: Clare Wheeler Laura

Overview Five findings from the Seattle Social Development Project (SSDP) and the

Sambuz

Useful Links

Newsletter

Mail Us

Running a SPRINT job on the HPC Wales platform 1 There are other - PowerPoint PPT Presentation

Running a SPRINT job on the HPC Wales platform 1 There are other parallel R packages Building Block Approaches Difficult to program Bespoke implementation Biostatistician needs to be a parallel programmer

Installing SPRINT Supercomputers SPRINT is already installed on HPC Wales and

HPC @ SAO S.G. Korzennik - SAO HPC Analyst hpc@cfa February 2013 SGK ( hpc@cfa ) HPC @ SAO

Blood Pressure Measurement in SPRINT Karen C. Johnson, MD, MPH, FAHA Vice Chair, SPRINT Steering

Uni.lu HPC School 2020 PS6: HPC Containers: Singularity Uni.lu High Performance Computing (HPC)

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Red- -Light Running Light Running Red Red-Light Running 2 Traffic Signals Traffic Signals

Child Welfare Digital Services Sprint Review Presentation Sprint Review of Iteration 4.2

How does the sprint planning look like? Working arrangements, guidelines &amp; support Why an

UL HPC School 2017[bis] PS1: Getting Started on the UL HPC platform UL High Performance

UL HPC School 2017 PS1: Getting Started on the UL HPC platform UL High Performance Computing

The HPC Skill Tree A Brief Overview Kai Himstedt On Behalf of the HPC-CF Board BoF:

Points of Pride: What we have accomplished so far! Created Job Framework 24 Job Groups/Job

Google what can possibly go wrong with my Sprint?

Whats new in HPC? Gregory Bauer To keep up-to-date on HPC HPC Guru -

UL HPC School 2017 PS5: Advanced Scheduling with SLURM and OAR on UL HPC clusters UL High

Facilitating HPC job debugging through job scripts archival Andy Georges 2 February 2020 FOSDEM

1 Specifying Grammar with JavaCUP Abstract Syntax Tree for Memory Layout Example JavaCUP example

Machine-Level Programming II: Control CS140 Computer Organization and Assembly Slides Courtesy

ASP4100 Introduction to honours computing Introduction to modern Fortran (incl. Makefiles) 1

Code Generators for Mathematical Functions N. Brunie 1 , F. de Dinechin 2 , O. Kupriianova 3 , Ch.

Machine-Level Programming II: Control CSE 238/2038/2138: Systems Programming Instructor: Fatma

Examining moderated effects of additional adolescent substance use treatment: Structural nested

SBIRT in Family Planning Settings Moderator: Heather Gotham PhD Panelists: Clare Wheeler Laura

Overview Five findings from the Seattle Social Development Project (SSDP) and the

Sambuz

Useful Links

Newsletter

Mail Us

How does the sprint planning look like? Working arrangements, guidelines & support Why an