MALT & NUMAPROF , Memory Profiling for HPC Applications - PowerPoint PPT Presentation

Aug 11, 2023 •362 likes •532 views

1 MALT & NUMAPROF , Memory Profiling for HPC Applications SBASTIEN VALAT FOSDEM 2019 TRACK HPC Origin of the tools 2 PhD. on memory management for HPC (at CEA/UVSQ) MALT , post-doc at Versailles : NUMAPROF , side

1 MALT & NUMAPROF , Memory Profiling for HPC Applications SÉBASTIEN VALAT – FOSDEM 2019 – TRACK HPC
Origin of the tools 2  PhD. on memory management for HPC (at CEA/UVSQ)  MALT , post-doc at Versailles :  NUMAPROF , side project post-doc work at :
Motivation 3  Lot of issues today :  Huge memory space to manage (~TB of memory)  Lot more distinct allocations (75 M in 5 minutes)  Multi-threading : 256 threads  Hidden into large ( huge ) C/C++/Fortran codes ( ~1M lines).  Access:  NUMA (Non Uniform Memory Access)  Memory wall !
Key today 4 You need to well understand memory behavior of your (HPC) application !
Eg: >1M lines C++ simulation. 5 On 128 cores / 16 NUMA CPUs Available My PhD. 500 450 400 35% 350 Execution time (s) 300 20% 250 58% 200 150 100 50 0 MPC/NUMA MPC/UMA Glibc jemalloc tcmalloc User System Idle
Same about memory consumption 6 on 12 cores Physical mem.(GB) 8 7 6 5 2.5x 4 3 2 1 0 glibc jemalloc tcmalloc
Tool 1 : MALT 7  Memory management can have huge impact  Tool to track mallocs  Report properties onto annotated sources  Same idea than valgrind/kcachegrind  Annotated sources  Annotated call graphs  + Non additive metrics (for inclusive costs, eg. lifetime)  + Time charts  + Properties distribution (sizes….)
Web based GUI 8 Inclusive/Exclusive Metric selector Per line annotation Call stacks reaching the selected Symbols Details of symbol or line site.
Example of time based view 9
Tool 2 : NUMAPROF 10  Based on MALT code  But about NUMA  How to detect remote memory accesses  Unsafe & uncontrolled memory binding RAM RAM CPU 1 CPU 1
Some summary views 11
Still source annotation to 12 understand code
Short success 13  MALT  20% CPU saving on my CERN 32 000 C++ code.  Improvement on 2 commercial simulation codes  Profiled CERN LHCb 1.5 million line C++ code  NUMAPROF  20% perf in 20 minutes on 8000 lines simu.  NUMA Linux kernel policy bug detected.  CERN PhD. code NUMA correctness
14 Questions Both tools under CeCILL-C on http://memtt.github.io My researches : http://svalat.github.io
Example of success 15 MALT  Reduce CPU usage of 30% on the CERN app I was developing (mistake with C++11 ) for(auto & it : lst) 32 000 C++ lines running on 500 servers.  Too large allocations in a PhD. Student numerical simulation running on 500 cores while developing the tool.  Realloc pattern in Fortran into an industrial R&D simulation code  Unexpected allocs generated by GFortran compiler on another industrial R&D simulation code .  Successfully ran on CERN LHCb 1.5M lines online analysis software
Example of success 16 NUMAPROF  20% performance improvement in 20 minutes on an unknown 8000 C++ lines simulation on Intel KNL  Linux Kernel bug detected on NUMA management in conjunction with Transparent Huge Pages (while developing the tool). Was detected at same time by other way by Red- Hat…. But…..  Confirmation of NUMA correctness on a CERN/OpenLab PhD. Student code on Intel KNL

Recommend

Malt COA Bucket Analysis Approach Presenters Mike Heinrich Tyler Schoales Craft Malt Specialist

Breakdown of a Malt COA Bucket Analysis Approach Presenters Mike Heinrich Tyler Schoales Craft Malt Specialist NA Craft Sales Manager Country Malt Group Great Western Malting Breakdown of a Malt COA Agenda Overview of Malting and

453 views • 22 slides

Home Brew Con 2018 LOW-OXYGEN EN BREWING Preserves the fresh malt/grain flavor that exists in

Low Oxygen Brewing Notes: Home Brew Con 2018 LOW-OXYGEN EN BREWING Preserves the fresh malt/grain flavor that exists in your malt before you even begin brewing. Theo Th eory Ascorbic Acid Oxidase (AAO) is a malt antioxidant that, when

315 views • 10 slides

Crop 2018 Viking Malt Countries All Viking Malt countries together with many other countries have

Malting Barley Crop 2018 Viking Malt Countries All Viking Malt countries together with many other countries have been struggling with a severe drought under the summer 2018. The malting barley did the right thing and was setting fewer and

347 views • 9 slides

Caitlin Tegels Malt-O-Meal Improving Utility Efficiency Malt-O-Meal Caitlin Tegels Advisors:

Caitlin Tegels Malt-O-Meal Improving Utility Efficiency Malt-O-Meal Caitlin Tegels Advisors: Paul Pagel, John Polanski Campbell Mill in Northfield, MN Ready-to-eat cereals ~800 employees at that location Motivations for Change

388 views • 18 slides

Crop 2018 Viking Malt Countries All Viking Malt countries have been together with other countries

Malting Barley Crop 2018 Viking Malt Countries All Viking Malt countries have been together with other countries hidden av a severe drought under the summer 2018. The malting barley did the right thing and was setting fewer and shorter ears

501 views • 9 slides

MALT : MALloc Tracker A memory profiling tool 3/02/2019 MALT, Sbastien Valat 1 Questions

MALT : MALloc Tracker A memory profiling tool 3/02/2019 MALT, Sbastien Valat 1 Questions We have good profiling tool for timings (eg. Valgrind or vtune) But for what memory profiling ? Memory can be an issue : Availability of

762 views • 40 slides

Society of Barley Engineers 2oth Anniversary 20 th Anniversary Barley Champagne Malt: Belgian

Society of Barley Engineers 2oth Anniversary 20 th Anniversary Barley Champagne Malt: Belgian Pilsner Malt 1.1L +15% Cane Sugar Hops: 83 IBU with Nelson Sauvin late hops Yeast: Cal Ale grown on Session IPA Mash: 2 step @146F &

261 views • 23 slides

Using the Heart of the Malt for Clean Flavor Endosperm brewing What it is Where it came

Endosperm Brewing : Using the Heart of the Malt for Clean Flavor Endosperm brewing What it is Where it came from How its done Why it works Fun uses for it How you can do it too 2 Endosperm mashing has been around for

438 views • 29 slides

HOPS!!! Kieren Vercoe Recipes Malt Best Malz Pilsen 66.5% West Coast Maris Otter 19% Best

HOPS!!! Kieren Vercoe Recipes Malt Best Malz Pilsen 66.5% West Coast Maris Otter 19% Best Malz Dark Munich 6.6% Leaning IPA Flaked Oats 2.9% Acidulated 2.2% TF Pale Crystal 1.4% OG 1.068 Simpsons Heritage Crystal 1.4% FG 1.008 Mash

327 views • 12 slides

Brewing Lager Beer Paul Konopelski Ingredients Water Malt RO with added

Brewing Lager Beer Paul Konopelski Ingredients Water Malt RO with added salts Pilsner: Light (1.5-2 L), less toasted, more moisture, more DMS precursor Carbon filtered, adjusted for pH and (SMM) 2- :Cl - ratio

800 views • 7 slides

CONTINENTAL MILKOSE INDIA LTD. We Manufacture, You Sell . . . Malt Based Products |

CONTINENTAL MILKOSE INDIA LTD. We Manufacture, You Sell . . . Malt Based Products | Ready Nutritional Formulations | Food Ingredients 1 Continental Milkose India Ltd. | www.milkose.com | marketing@milkose.com

367 views • 23 slides

Highland Single Malt Scotch Whisky distilled and aged in oak casks in Scotland Highland Queen

HIGHL HLAND AND QUEEN N MAJESTY AJESTY CLA LASSIC SIC Highland Single Malt Scotch Whisky distilled and aged in oak casks in Scotland Highland Queen has been produced in Scotland for over 100 years. It takes its name from Mary Queen of

493 views • 15 slides

PROCESSED FOOD WHAT WE DO Processing laboratory for cereals (TROYES-FR) Malt and Beer;

IMPACT OF PESTICIDES ON THE QUALITY OF PROCESSED FOOD WHAT WE DO Processing laboratory for cereals (TROYES-FR) Malt and Beer; Bread and analyses on flour according to the French processsing CEB 218 guideline. Processing

346 views • 13 slides

A.G.Hindhaugh Simpsons Malt Ltd 3 rd November 2015 Trade Committee functions Reports to

A.G.Hindhaugh Simpsons Malt Ltd 3 rd November 2015 Trade Committee functions Reports to Executive Committee Meets twice a year once in conjunction with Technical Committee Chairman rotates every 2 years Representatives

268 views • 16 slides

NORSKL Norwegian malt, hops and herbs- the taste of Norwegian beer 2013-2016 Northern Cereal,

NORSKL Norwegian malt, hops and herbs- the taste of Norwegian beer 2013-2016 Northern Cereal, Orkney 2015 v. Mette Goul Thomsen Project organization 4 year innovation project, 2013 -2016 Project owner Ngne - brewery in near

435 views • 26 slides

Paul John Indian Single Malt Awards Indian Whisky turned into Indian Whisky turned into Indian

Paul John Indian Single Malt Awards Indian Whisky turned into Indian Whisky turned into Indian Whisky turned into 275 Asias Most Awarded Single Malts! Awards Scenario Europe Amsterdam, Belgium, Sweden, United India Germany,

349 views • 10 slides

Using Hardware Features for Increased Debugging Transparency Fengwei Zhang, Kevin Leach, Angelos

Using Hardware Features for Increased Debugging Transparency Fengwei Zhang, Kevin Leach, Angelos Stavrou, Haining Wang, and Kun Sun. In S&P'15. Fengwei Zhang Wayne State University CSC 6991 Topics in Computer Security 1 Overview

451 views • 28 slides

MALT: Distributed Data Parallelism for Existing ML Applications Hao Li*, Asim Kadav, Erik Kruus,

MALT: Distributed Data Parallelism for Existing ML Applications Hao Li*, Asim Kadav, Erik Kruus, Cristian Ungureanu * University of Maryland-College Park NEC Laboratories, Princeton Data, data everywhere User-generated Software Hardware

439 views • 42 slides

Session Objectives 1) Describe the purpose and key aspects of incorporating standards of

3/15/2017 Sherri Jones, MS, MBA, RDN, LDN, FAND Improvement Specialist UPMC Shadyside Session Objectives 1) Describe the purpose and key aspects of incorporating standards of excellence into practice 2) Utilize the Academys Standards of

268 views • 14 slides

Ensemble Models for Dependency Parsing: Cheap and Good? Mihai Surdeanu and Christopher D. Manning

Ensemble Models for Dependency Parsing: Cheap and Good? Mihai Surdeanu and Christopher D. Manning Stanford University June 3, 2010 Ensemble Parsing Parser 2 Parser 1 Parser 3 Ensemble Parser Parser 4

195 views • 18 slides

Transparent System Introspection in Support of Analyzing Stealthy Malware Kevin Leach PhD

Transparent System Introspection in Support of Analyzing Stealthy Malware Kevin Leach PhD Dissertation kjl2y@virginia.edu November 30, 2016 Analogy: Volkswagen Scandal Volkswagen cheated on emissions test (over 10x EPA requirements) 2

1.56k views • 78 slides

Algorithms R OBERT S EDGEWICK | K EVIN W AYNE L INEAR P ROGRAMMING brewers problem

Algorithms R OBERT S EDGEWICK | K EVIN W AYNE L INEAR P ROGRAMMING brewers problem simplex algorithm implementations Algorithms reductions F O U R T H E D I T I O N R OBERT S EDGEWICK | K EVIN W AYNE

633 views • 50 slides

Inspecting the Structural Biases of Dependency Parsing Algorithms Yoav Goldberg and Michael

Inspecting the Structural Biases of Dependency Parsing Algorithms Yoav Goldberg and Michael Elhadad Ben Gurion University CoNLL 2010, Sweden There are many ways to parse a sentence There are many ways to parse a sentence Transition Based

1.06k views • 81 slides

$72 \ (2)(3) = 6 Pant 2 53 k n n n 1 2 k n n ... n n k 1 2 k 1$

72 \ (2)(3) = 6 Pant 2 53 k n n n 1 2 k n n ... n n k 1 2 k 1

, ,Pz3 EP f- z shirts - pxSSILsnsz.Si3lpxsI-lPl.lslshirt1pant11shit2@P1.s 72 \ (2)(3) = 6 Pant 2 53 k n n n 1 2 k n n ... n n k 1 2 k 1 k we 22-52 26 ' 16 1600 5 . gb za = = - . 63 E I

756 views • 22 slides