MarFS Metadata Scaling PDSW WIP Report 2016 David Bonnie, - PowerPoint PPT Presentation

Apr 08, 2023 •337 likes •472 views

MarFS Metadata Scaling PDSW WIP Report 2016 David Bonnie, Hsing-Bung Chen, Gary Grider, Jeffrey Inman, BreH KeHering, William Vining LA-UR 16-28615 Metadata scaling components Deploy one drMDS per file system as rank 1 on first node

MarFS Metadata Scaling PDSW WIP Report 2016 David Bonnie, Hsing-Bung Chen, Gary Grider, Jeffrey Inman, BreH KeHering, William Vining LA-UR 16-28615
Metadata scaling components • Deploy one drMDS per file system as rank 1 on first node – Make new directories & broadcast dir inode to fdMDSc’s • Deploy fsMDSc’s on ¼ cores for each node in file system service – Handles its sharded part of distributed file metadata when broadcast commands are sent • Deploy fsMDSp’s on ¼ cores for each node in file system service – Handles its sharded part of distributed file metadata when command are sent to a specific fsMDSp. • Deploy file system Clients on ½ cores for each node in file system service – Execute file system opera\ons, such as create
File Crea7on Rate by Node 1,600,000,000 1,411,953,400 1,400,000,000 1,200,000,000 Total Files Created per Second 1,000,000,000 835,736,363 800,000,000 Files Created/Sec Linear Files Created/Sec 600,000,000 400,000,000 200,000,000 102,687,520 83,089,905 10,268,752 10,268,752 - 64 640 8,800 Number of Nodes
File Sequen7al Readdir Rate by Node 715,000 710,000 705,000 Total Files Sequen7al Readdir'd per Second 700,000 695,000 690,000 711,237 Files Readdir'd-Sequen\al/s 685,000 680,000 693,429 691,802 690,245 675,000 682,640 670,000 665,000 10 20 30 40 50 Number of Nodes
File Parallel Readdir Rate by Node 350,000,000 300,000,000 250,000,000 Total Files Parallel Readdir'd per Second 200,000,000 Files Readdir'd-Parallel/s 303,030,303 150,000,000 250,000,000 206,896,551 100,000,000 160,000,000 50,000,000 80,000,000 - 10 20 30 40 50 Number of Nodes
Factor of X that Parallel Readdir Rate is Greater than Sequen7al 500.00 450.00 400.00 350.00 300.00 Factor X 250.00 Factor X Parallel Over Sequen\al 437.00 200.00 362.19 303.08 150.00 231.28 100.00 112.48 50.00 0.00 10 20 30 40 50 Number of Nodes
Background Informa\on MARFS METADATA SCALING
MarFS Overview • Provides near-POSIX over cloud-style erasure and objects – Yields reliable storage on inexpensive disk – Supports legacy apps’ files/folders/ownership/etc. • Store large data sets for weeks to months on PFS, 1 TB/s • Store data forever in archive, 10s GB/s • Store large data sets for months to year’ish on MarFS, 100s GB/s – Data set O(PB), aggregate data O(EB) • Systems growing from O(M) cores/O(PB) memory to O(B) cores/O(10s PB) memory – Going to O(B) files per job in one directory and O(10s T) files per file system
Here’s a picture of crea\ng a directory
Here’s a picture of crea\ng files
Here’s a picture of sequen\al readdir
Here’s a picture of parallel readdir

Recommend

MarFS : A Scalable Near-POSIX File System over Cloud Objects Kyle E. Lamb HPC Storage Team Lead

MarFS : A Scalable Near-POSIX File System over Cloud Objects Kyle E. Lamb HPC Storage Team Lead Nov 18 th 2015 LA-UR-15-27431 Why Do We Need a MarFS HPC Post Trinity HPC Pre Trinity HPC At Trinity 1-2 PB/sec Memory Residence hours

470 views • 22 slides

Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large

Scaling Outline Scaling Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Scaling-at-large Principles of Complex Systems Allometry Allometry Definitions Definitions Course 300, Fall, 2008 Examples Examples

568 views • 27 slides

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software

UP UP AND OUT: SCALING SOFTWARE WITH AKKA Jonas Bonr CTO Typesafe @jboner Scaling software with Jonas Bonr CTO Typesafe @jboner Scaling Scaling software with software with Scaling Scaling software with software with Akka

1.98k views • 174 slides

UNSD metadata template / SDMX Metadata Structure Definition Elena De Jess, UNSD Standardized

UNSD metadata template / SDMX Metadata Structure Definition Elena De Jess, UNSD Standardized metadata improves usability and comparability Metadata (and data) that follow specific standardized patterns: are easier for users to interpret

991 views • 19 slides

Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms

Rafael Oliveira University of Toronto Analysis of Scaling Algorithms for Matrix & Operator Scaling Contents Scaling Algorithms Three Step Analysis Generalization One More Application of Scaling Non-Negative Matrices &

604 views • 15 slides

Hitachi NEXT 2018 Automating Onboarding Data with Metadata Injection Contents Page 2:

Hitachi NEXT 2018 Automating Onboarding Data with Metadata Injection Contents Page 2: Introduction to Metadata Injection Page 7: Guided Demonstration Overview: Metadata Injection Page 13: Guided Demonstration Standard Metadata Injection

831 views • 36 slides

Metadata In ArcGIS 10.0 Jason Cupp Whats New In ArcGIS 10.0 New Metadata Editor for

Esri International User Conference | San Diego, CA Technical Workshops | 2011-07-12 Metadata In ArcGIS 10.0 Jason Cupp Whats New In ArcGIS 10.0 New Metadata Editor for Multiple Standards New Tools and Workflows Metadata Editor

332 views • 22 slides

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA

From SDTM to displays, through ADaM & Analyses Results Metadata, a flight on board METADATA Airlines Omar SEFIANI - Stphane BOUGET, Boehringer Ingelheim DH13, PhUSE Barcelona 2016, October, 12 th Outline Background Metadata Driven

686 views • 19 slides

Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries

Batch Metadata Editing Batch Metadata Editing in DSpace 1.6+ Maureen P. Walsh, The Ohio State University Libraries DSpace User Group Meeting at SPARC DRM November 10, 2010 J Batch Metadata Editing Batch Metadata Editing

681 views • 47 slides

DUNE Data Model Meeting: Metadata Metadata Needs And Considerations Steven Timm The following

DUNE Data Model Meeting: Metadata Metadata Needs And Considerations Steven Timm The following are design concerns for whatever new metadata system is written or adopted. They are written from the point of view of data management operations and

166 views • 4 slides

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive

Effectively Scaling Effectively Scaling up/universalizing exclusive up/universalizing exclusive breastfeeding breastfeeding Creating Distt. Level Model Creating Distt. Level Model Effectively scaling up /universalizing Effectively scaling

574 views • 12 slides

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent

Scaling From simple models to rich strategies PPPLab Day, November 30th Scaling: recent publications Insight Series Tool Animation The rationale for scaling As the SDGs require transformational change, scaling can provide: Reaching more

609 views • 37 slides

Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of

Scaling Scaling Outline Scalinga Plenitude of Power Laws Scaling-at-large Scaling-at-large Principles of Complex Systems Allometry Allometry CSYS/MATH 300, Fall, 2010 Definitions Definitions Examples Examples History: Metabolism

251 views • 20 slides

Concurrent clinical condition - on presentation Important note: This is an archived metadata

Concurrent clinical condition - on presentation Important note: This is an archived metadata standard from the AIHW Knowledgebase. For current metadata standards and related information please access METeOR, the AIHW's Metadata Online Registry

180 views • 3 slides

The Practice of Metadata The hows and whys of metadata at USGS U.S. Department of the

The Practice of Metadata The hows and whys of metadata at USGS U.S. Department of the Interior U.S. Geological Survey Presenter Viv Hutchison USGS Core Science Systems / Core Science Analytics and Synthesis (CSAS) Program Denver,

639 views • 44 slides

MetaData Management 2005 MetaData Management 2005 Toronto IRMAC April 19, 2005 April

Gavilan Research Associates Gavilan Research Associates MetaData Management 2005 MetaData Management 2005 Toronto IRMAC April 19, 2005 April 19, 2005 Toronto IRMAC DAMA Wisconsin April 20, 2005 April 20, 2005 DAMA

931 views • 58 slides

How we use OSGi to build Open Liberty Alasdair Nottingham - IBM 1 Background 2 Project goals

How we use OSGi to build Open Liberty Alasdair Nottingham - IBM 1 Background 2 Project goals Implement Jakarta EE Small Footprint Start fast Composible Dynamic Easy to use 3 Just Enough App Server You

410 views • 14 slides

Cyclic vectors in Dirichlet-type spaces Constanze Liaw (Baylor University) at TeXAMP 2013 This

Cyclic vectors in Dirichlet-type spaces Constanze Liaw (Baylor University) at TeXAMP 2013 This presentation is based on joint work with C. B en eteau, A. Condori, D. Seco, A. Sola. Thanks to NSF for their support. Broader Impacts of the

453 views • 19 slides

Navigation Message Authentication for the Galileo Open Service Tomer Ashur , Dan Burkey, David

Navigation Message Authentication for the Galileo Open Service Tomer Ashur , Dan Burkey, David Calle, Simon Cancela, Ignacio Fernandez, Oscar Pozzobon, Vincent Rijmen, Carlo Sarto, Gonzalo Seco-Granados, Javier Simon, and Paul Walker GNSS -

579 views • 39 slides

Questions Historical research questions that are suitable for a prosopographical approach What

Prosopographical Research Questions Historical research questions that are suitable for a prosopographical approach What is prosopography? Lawrence Stone: Prosopography is the investigation of the common background characteristics of a

528 views • 9 slides

LiveWeb Core Language for Web Applications Miguel Domingues Joo Costa Seco CITI

LiveWeb Core Language for Web Applications Miguel Domingues Joo Costa Seco CITI Departamento de Informtica FCT/UNL SOFT-PT INForum 2010 10 Setembro 2010 Most Web Application Development is not Type Safe Heterogeneous

474 views • 13 slides

Consistent and Durable Data Structures for Non-Volatile Byte-Addressable Memory Shivaram

Consistent and Durable Data Structures for Non-Volatile Byte-Addressable Memory Shivaram Venkataraman* , Niraj Tolia , Parthasarathy Ranganathan* and Roy H. Campbell *HP Labs, Palo Alto, Maginatics, and University of Illinois,

422 views • 27 slides

Motivation Data-intensive applications need large machines with plenty of NumaGiC: cores and

Motivation Data-intensive applications need large machines with plenty of NumaGiC: cores and memory A garbage collector for big-data on big NUMA machines Lokesh Gidra , Gal Thomas , Julien Sopena , Marc Shapiro , Nhan

236 views • 10 slides

A z k -invariant subspace without the wandering property Daniel Seco Universidad Carlos III de

A z k -invariant subspace without the wandering property Daniel Seco Universidad Carlos III de Madrid and Instituto de Ciencias Matemticas Workshop on Banach spaces and Banach lattices, ICMAT 12 th September 2019 Seco (UC3M/ICMAT) Wandering

676 views • 56 slides