Supercomputing Notes Focusing on Science and GPUs A. Norman GPU - PowerPoint PPT Presentation

Jan 27, 2024 •280 likes •367 views

Supercomputing Notes Focusing on Science and GPUs A. Norman GPU Impressions Common theme from all major GPU players booths (Nvidia, AMD, Intel) Our specialized <language, libs, API> is what you should use But if you

Supercomputing Notes Focusing on Science and GPUs A. Norman
GPU Impressions • Common theme from all major GPU players booths (Nvidia, AMD, Intel) – “Our specialized <language, libs, API> is what you should use” – “But if you don’t you should use OpenMP, you’ll take a 10-20% performance hit on most standard code relative to hand optimized algorithms” – Booths were all showing the same benchmarks • Compiler booths are similar – Emphasize their support for OpenMP 4.x – All (but PGI) claim to have the best implementation* – Nvidia emphasizing pre-optimized libraries of standard algorithms for STL containers *on whichever flavor of GPU they specifically support
OpenMP Training • New spec 5.0 is out but… – Real progress is on distilling down to the “common core” and compiler support for 4.5 – Essential directives and patterns that cover most scientific use cases • OpenMP was touting this (passing out cheat sheets), talking up new book. • Major initiative towards onboarding applications quickly – Compilers are better optimization for common core directives (i.e. sensible default behaviors less tuning) • https://www.openmp.org/resources/openmp-compilers-tools/ – Tutorial was actually VERY good (joint with NERSC) • Easy to replicate – Low hanging fruit for some experiment code • GPU offloading a minimal extension to common core
OpenMP GPU Training • Simplified offloading to target devices in the base part of the spec – Builds directly off common core directives – Can effectively swap out a single directive in most cases to go from OpenMP parallel to OpenMP GPU accelerated – Performance is “meh…” without tuning and memory model considerations – Example codes were getting get 4-8x ish boosts – Tune examples get 20x • Value is in portability and ease of migration – Very real possibility for our science codes that don’t lend themselves to hand optimization – Documentation and training materials are good
GPU Hackathon • Connected with GPU Hackathon team – Learned more about what to expect and how to schedule a hackathon (this is in the NESAP context of our NESAP project) – For application porting they want: • 1-3 people to participate (coder, algorithm person, person for testing) • Start 4-6 week before actual hackathon • Need code to compile using Cray compiler • They want a kernel identified if possible, but are willing to work with more generalized code •
Rescale • Single API (and accounting!) for AWS, Google, Microsoft • Can buy time through them or… – Bring your own allocations (specifically asked about Heidi usecase of a Microsoft Educational allocation) • Claim to have HARD CAPS and cut offs on per group basis and linked to funding and administrative limits. – Want to see accounting interface • This actually may be a viable path to avoid separate integration for each cloud system. Would want to see more.
IBM • Was given the briefing (hard sell) on LSF batch • Claim is that it can scale now. • Lacks various accounting controls and monitoring • Want us to use it with HEPCloud • Want to do a more complete briefing for us

Recommend

The Barcelona Supercomputing Center Sergi Girona Operations Director 04/12/2019 e-IRG workshop

The Barcelona Supercomputing Center Sergi Girona Operations Director 04/12/2019 e-IRG workshop Dec 2019 Barcelona Supercomputing Center Centro Nacional de Supercomputacin BSC-CNS objectives Supercomputing services R&D in Computer,

166 views • 13 slides

Far more than Petaflops: The Jlich Supercomputing Centre ScicomP 15 & SP-XXL Thomas

Mitglied der Helmholtz-Gemeinschaft Far more than Petaflops: The Jlich Supercomputing Centre ScicomP 15 & SP-XXL Thomas Lippert Barcelona Supercomputing Centre Institute for Advanced Simulation May 20, 2009 Jlich Supercomputing

840 views • 80 slides

Problem solved: IBM Notes Replacement 2 IBM Notes Replacement Migrating from IBM Notes to

Problem solved: IBM Notes Replacement 2 IBM Notes Replacement Migrating from IBM Notes to Quickly replace cloud-based alternatives like Offjce IBM Notes 365 or Google Apps is a great way to increase the mobility, speed and applications

269 views • 10 slides

Printout Tuesday, October 29, 2019 7:38 PM Quick Notes Page 1 Quick Notes Page 2 Quick Notes

Printout Tuesday, October 29, 2019 7:38 PM Quick Notes Page 1 Quick Notes Page 2 Quick Notes Page 3 Quick Notes Page 4

281 views • 4 slides

Briefing Notes The Briefing Notes Page The Briefing Notes include: An introduction to the

Community Services : Education The Inspection of the Education Functions of Local Authorities Briefing Notes March 2004 Ma March Briefing Notes The Briefing Notes Page The Briefing Notes include: An introduction to the inspection

531 views • 11 slides

Multicast Monitoring System for Access Grid 18 th APAN Meeting - Cairns - Jul. 2004 Jinyong Jo

Supercomputing Center Supercomputing Center Supercomputing Center Korea Institute of Science and Technology Information Korea Institute of Science and Technology Information Korea Institute of Science and Technology Information Multicast

544 views • 25 slides

GPU Clusters for HPC Bill Kramer Director of Blue Waters National Center for Supercomputing

GPU Clusters for HPC Bill Kramer Director of Blue Waters National Center for Supercomputing Applications University of Illinois at Urbana- Champaign National Center for Supercomputing Applications University of Illinois at Urbana-Champaign

836 views • 37 slides

Disk to Disk Data Transfers at 100Gbps SuperComputing 2011 Azher Mughal Caltech (HEP) CENIC

Disk to Disk Data Transfers at 100Gbps SuperComputing 2011 Azher Mughal Caltech (HEP) CENIC 2012 http://supercomputing.caltech.edu Agenda Motivation behind SC 2011 Demo Collaboration (Caltech, Univ of Victoria, Vendors) Network

363 views • 21 slides

Just-in-time Staging of Large Input Just-in-time Staging of Large Input Data for Supercomputing

Just-in-time Staging of Large Input Just-in-time Staging of Large Input Data for Supercomputing Jobs Data for Supercomputing Jobs Henry Monti, Ali R. Butt Sudharshan S. Vazhkudai HPC Center Data Stage-in Problem HPC Center Data Stage-in

901 views • 21 slides

Supercomputing Operating Systems: A Naive View from Over the Fence Timothy Roscoe (Mothy)

Supercomputing Operating Systems: A Naive View from Over the Fence Timothy Roscoe (Mothy) Systems Group, ETH Zurich Disclaimer: I am a stranger in a strange land Thank you for inviting me! Im assuming your field is Supercomputing

934 views • 74 slides

Zest I/O Paul Nowoczynski, Jared Yanovich Advanced Systems, Pittsburgh Supercomputing Center PDSW

Zest I/O Paul Nowoczynski, Jared Yanovich Advanced Systems, Pittsburgh Supercomputing Center PDSW '08 Austin, TX Zest What is it? Pittsburgh Supercomputing Center Parallel I/O system designed to optimize the compute I/O subsystem for

572 views • 28 slides

LLVM for the future of Supercomputing Hal Finkel hfinkel@anl.gov 2017-03-27 2017 European LLVM

LLVM for the future of Supercomputing Hal Finkel hfinkel@anl.gov 2017-03-27 2017 European LLVM Developers' Meeting What is Supercomputing? Computing for large, tightly-coupled problems. Lots of computational capability paired with High

612 views • 39 slides

Introduction to Parallel Application Performance Engineering Brian Wylie Jlich Supercomputing

VIRTUAL INSTITUTE HIGH PRODUCTIVITY SUPERCOMPUTING Introduction to Parallel Application Performance Engineering Brian Wylie Jlich Supercomputing Centre (with content used with permission from tutorials by Bernd Mohr/JSC and Luiz

632 views • 28 slides

The Critical Role Of Supercomputing in Weather and Climate Science Prof Dale Barker Director,

The Critical Role Of Supercomputing in Weather and Climate Science Prof Dale Barker Director, CCRS NSCC Webinar 1 October 2020 Overview The Climate Challenge Brief History of Supercomputing in Weather/Climate Science Climate System

423 views • 38 slides

Cell History and Structure Quiz on Block Day January 18-19, 2016 Admit Ticket NOTES: Take notes

Cell History and Structure Quiz on Block Day January 18-19, 2016 Admit Ticket NOTES: Take notes on video clip Parts of the Cell https:// www.youtube.com/ watch? v=PHTvqW7CzXY Exit Ticket NOTES: Take notes on video clip Biology:

437 views • 43 slides

NOTES: ORGANIC MARKET IS BOOMING, INCREASE OF ORGANIC FARMLAND RECENT YEARS NOTES: ORGANIC MARKET

NOTES: ORGANIC MARKET IS BOOMING, INCREASE OF ORGANIC FARMLAND RECENT YEARS NOTES: ORGANIC MARKET IS BOOMING, INCREASE OF ORGANIC SALES IN RETAIL NOTES: ORGANIC MARKET IS BOOMING, HIGH VALUE IN ORGANIC SALES NOTES: CONSUMERS OF ORGANIC PRODUCTS,

426 views • 21 slides

Mining Source Code^3 Mining Idioms, Usages and Edits Dario Di Nucci Research Fellow

Mining Source Code^3 Mining Idioms, Usages and Edits Dario Di Nucci Research Fellow dario.di.nucci@vub.be Mining Software Repositories 3 Software Repositories? Issue Trackers Versioning Systems Archived Communication Market Places

919 views • 78 slides

Quantum resource theories of quantum channels Xin Wang Baidu Research TQC 2020 Based on

Quantum resource theories of quantum channels Xin Wang Baidu Research TQC 2020 Based on arXiv:1807.05354,1809.09592, 1903.04483, 1907.06306 l Brief intro of quantum resource theories Overview l From states to channels l What is the power/cost

767 views • 37 slides

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 03:

CSE 440: Introduction to HCI User Interface Design, Prototyping, and Evaluation Lecture 03: James Fogarty Contextual Inquiry Alex Fiannaca Lauren Milne Saba Kawas Kelsey Munsell Tuesday/Thursday 12:00 to 1:20 Amazing Color Changing Card

1.05k views • 73 slides

Unconstrained Handwritten Text Recognition Reporter: Zecheng Xie South China University of

Distilling GRU with Data Augmentation for Unconstrained Handwritten Text Recognition Reporter: Zecheng Xie South China University of Technology August 6 2018 Outline Problem Definition Multi-layer Distilling GRU Data Augmentation

345 views • 22 slides

Strategies to Overcome Inequality in South Africa: Thinking Inside and Outside of the Box Murray

Strategies to Overcome Inequality in South Africa: Thinking Inside and Outside of the Box Murray Leibbrandt Mandela Initiative Income Dynamics (or the lack thereof) in Contemporary South Africa 2014 Severe Poor Non- poor Severe 28.7%

503 views • 13 slides

Neural Networks Hugo Larochelle ( @hugo_larochelle ) Google Brain 2 NEURAL NETWORKS What

Neural Networks Hugo Larochelle ( @hugo_larochelle ) Google Brain 2 NEURAL NETWORKS What well cover ... f ( x ) types of learning problems - definitions of popular learning problems - how to define an architecture for a learning

974 views • 41 slides

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature

Week 3 Video 4 Automated Feature Generation Automated Feature Selection Automated Feature Generation The creation of new data features in an automated fashion from existing data features Multiplicative Interactions You have variables A

513 views • 33 slides

Requirements Engineering Requirements Engineering Week 5 Agenda (Lecture) Agenda (Lecture)

Requirements Engineering Requirements Engineering Week 5 Agenda (Lecture) Agenda (Lecture) Requirement engineering Requirement engineering Agenda (Lab) Agenda (Lab) Create a software requirement and specification Create a software

1.19k views • 85 slides