Advanced Data Placement via Ad-hoc File Systems at Extreme Scales - PowerPoint PPT Presentation

Dec 25, 2023 •246 likes •318 views

Center for Information Services and High Performance Computing (ZIH) Advanced Data Placement via Ad-hoc File Systems at Extreme Scales (ADA-FS) Michael Kluge, Wolfgang E. Nagel, Andr Brinkmann, Achim Streit, Sebastian Oeste, Marc-Andr Vef,

Center for Information Services and High Performance Computing (ZIH) Advanced Data Placement via Ad-hoc File Systems at Extreme Scales (ADA-FS) Michael Kluge, Wolfgang E. Nagel, André Brinkmann, Achim Streit, Sebastian Oeste, Marc-André Vef, Mehmet Soysal PDSW-DISCS @ SC’16 Salt Lake City, 2016/11/24
Project Rationale I/O Challenges at Exascale I/O subsystem is the slowest system to access in a HPC machine Shared medium: no reliable bandwidth, no good transfer time predictions Upcoming architectures with “fat nodes” and intermediate local storages Goal: optimize I/O Faster access Using additional storages Transparent solution for parallel applications Pre-stage inputs early, Pre-stage inputs cache outputs 1 Michael Kluge
Proposed Solution Ad-hoc overlay file system – Separate overlay file system per application run – Instantiated on the scheduled compute nodes – Lives longer than the users’ job Central I/O planner – Global Planning of I/O including stage-in/-out of data, for all par. jobs – Optimization of data placement in the ad-hoc file system (resp. nodes) – Integration with systems batch scheduler Application monitoring, resource discovery – I/O behavior, machine-specific storage types, sizes, speeds, … 2 Michael Kluge
Ad-hoc overlay file system Research Goals Related Work Status Relax POSIX semantics GPFS, Lustre, Design phase for based on access patterns BeeGFS,… scalable metadata and lock free block storage No locking Key-value stores for metadata Evaluation of different Distributed Metadata storage schemata DeltaFS, BurstFS, … Eventual consistency Monitoring Make applications responsible for their I/O 3 Michael Kluge
Central I/O Planner Research Goals Related Work Status Stage in and stage out of Current batch systems, Prototype for a data Data Staging from Grid temporary file system Environments based on BeeGFS Maybe even during job runtime Workpool/Workspace Stage in and stage out concepts based on parallel copy Schedule I/O based on tools estimations from the I/O scheduling and QoS running/planned jobs approaches SLURM integration 4 Michael Kluge
Resource Discovery and Monitoring Research Goals Related Work Status Collect available OpenMPI Working prototype that resources discovers node and Likwid connection details Monitor FS activities Many data collection Working on integration Provide planner with tools into I/O planner estimations about I/O I/O pattern recognition capabilities and current usage Learn I/O behavior for standard applications 5 Michael Kluge

Recommend

File Management What is a file? Elements of file management File organization

CPSC 410 / 611 : Operating Systems File Management File Management What is a file? Elements of file management File organization Directories File allocation What is a File? A file is a collection of data elements, grouped

817 views • 38 slides

ADVANCED PLACEMENT The purpose of the Advanced Placement program is to provide the students with

ADVANCED PLACEMENT The purpose of the Advanced Placement program is to provide the students with challenging coursework that culminates with an external examination. BENEFITS OF ADVANCED PLACEMENT Possible College Credit Early completion

601 views • 6 slides

Advanced Placement Physics 1 Advanced Placement Physics 2 Dr. Matt Frederickson Dr. Kevin

Advanced Placement Physics 1 Advanced Placement Physics 2 Dr. Matt Frederickson Dr. Kevin McColgan Advanced Placement Physics 1&2 Revisions at a Glance WHY from College Board? AP has implemented key recommendations of the NRC

66 views • 6 slides

Advanced File Systems Thierry Sans Advanced File Systems How to improve the performances?

Advanced File Systems Thierry Sans Advanced File Systems How to improve the performances? BSD Fast File System (FFS) How to improve the reliability in case of a crash? Log-Structured File system (LFS) Journaling File System (ext3)

600 views • 30 slides

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click on Model File for CAD Click on File for CAD Click on Size File for CAD Click on Size File for CAD Click on Size File for CAD

642 views • 39 slides

CPSC 410/611: File Management What is a file? Elements of file management File

CPSC 410 / 611 : Operating Systems CPSC 410/611: File Management What is a file? Elements of file management File organization Directories File allocation UNIX file system Reading: Silberschatz, Chapter 10, 11

186 views • 14 slides

Week 10: File Management What is a file? Elements of file management File

CPSC 410 / 611 : Operating Systems Week 10: File Management What is a file? Elements of file management File organization Directories File allocation UNIX file system Reading: Silberschatz, Chapter 10, 11

499 views • 13 slides

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy http://d3s.mff.cuni.cz Jan enolt Jan.Senolt@Oracle.COM Traditjonal UNIX File System, ext2 Traditjonal UNIX File System, ext2 ... BB Block Grp 0 Block Grp

559 views • 37 slides

File Systems: Consistency Issues 1 File Systems: Consistency Issues File systems maintain many

File Systems: Consistency Issues 1 File Systems: Consistency Issues File systems maintain many data structures Free list/bit vector Directories File headers and inode structures Data blocks All data structures are cached for better

357 views • 14 slides

File Systems: Semantics & Structure What is a File a file is a named collection of

5/15/2017 File Systems: Semantics & Structure What is a File a file is a named collection of information 11A. File Semantics primary roles of file system: 11B. Namespace Semantics to store and retrieve data 11C. File

352 views • 16 slides

File Systems: Semantics & Structure What is a File a file is a named collection of

5/16/2018 File Systems: Semantics & Structure What is a File a file is a named collection of information 11A. File Semantics primary roles of file system: 11B. Namespace Semantics to store and retrieve data 11C. File

373 views • 13 slides

Advanced Placement/Dual Enrollment Course Info. Session: February 13, 2020 Advanced Placement

Advanced Placement/Dual Enrollment Course Info. Session: February 13, 2020 Advanced Placement Program Mr. Anthony Vargas MCPS, Supervisor of Gifted/Talent and Advanced Programs What are the advantages of taking an AP course in high school?

243 views • 21 slides

Area 11 Redistricting Ad-Hoc Committee AREA 11 Redistricting Ad-Hoc Committee March 8 th 2017 a

Area 11 Redistricting Ad-Hoc Committee AREA 11 Redistricting Ad-Hoc Committee March 8 th 2017 a motion was made and passed to create a redistricting ad-hoc committee. to create a redistricting ad -hoc committee with a representative from

519 views • 21 slides

Routing In Ad Hoc Networks 1. Introduction to Ad-hoc networks 2. Routing in Ad-hoc networks 3.

Routing In Ad Hoc Networks 1. Introduction to Ad-hoc networks 2. Routing in Ad-hoc networks 3. Proactive routing protocols DSDV 4. Reactive routing protocols DSR, AODV 5. Non-uniform routing protocols ZRP, CEDAR 6. Other

392 views • 22 slides

Ad-hoc and Mesh Networks MAP-I Manuel P. Ricardo Faculdade de Engenharia da Universidade do

Ad-hoc+mesh-net 1 Ad-hoc and Mesh Networks MAP-I Manuel P. Ricardo Faculdade de Engenharia da Universidade do Porto Ad-hoc+mesh-net 2 What is an ad-hoc network? What are differences between layer 2 and layer 3 ad-hoc networks? What

865 views • 50 slides

Mobile Communications Ad-hoc and Mesh Networks Manuel P. Ricardo Faculdade de Engenharia da

Ad-hoc+mesh-net 1 Mobile Communications Ad-hoc and Mesh Networks Manuel P. Ricardo Faculdade de Engenharia da Universidade do Porto Ad-hoc+mesh-net 2 What is an ad-hoc network? What are differences between layer 2 and layer 3 ad-hoc

671 views • 43 slides

CompSci 514: Computer Networks Lecture 17: Datacenter Network Architectures Xiaowei Yang

CompSci 514: Computer Networks Lecture 17: Datacenter Network Architectures Xiaowei Yang Overview Motivation Challenges The FatTree architecture

331 views • 31 slides

Determinacy models and good scales at singular cardinals Trevor Wilson University of California,

Background Results Determinacy models and good scales at singular cardinals Trevor Wilson University of California, Irvine Logic in Southern California University of California, Los Angeles November 15, 2014 Trevor Wilson Determinacy

612 views • 24 slides

CS 6453 Network Fabric Presented by Ayush Dubey Based on: 1. Jupiter Rising: A Decade of Clos

CS 6453 Network Fabric Presented by Ayush Dubey Based on: 1. Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Googles Datacenter Network. Singh et al. SIGCOMM15. 2. Network Traffic Characteristics of Data Centers in

905 views • 57 slides

Tools for the Analysis and Design of Complex Multi-Scale Networks: Overview MURI

Tools for the Analysis and Design of Complex Multi-Scale Networks: Overview MURI Annual Review Berkeley, October 27, 2011 J. Walrand, PI SLIDES: http://web.me.com/jeanwalrand/Site_3 1. Team Venkat

521 views • 29 slides

Extreme-Scale HPC Network Analysis using Discrete-Event Simula=on Noah Wolfe 1 , Misbah Mubarak 2

Extreme-Scale HPC Network Analysis using Discrete-Event Simula=on Noah Wolfe 1 , Misbah Mubarak 2 , Nikhil Jain 3 , Jens Domke 4 , Abhinav Bhatele 3 , Christopher D. Carothers 1 , Robert B. Ross 2 1 Rensselaer Polytechnic InsDtute 2 Argonne NaDonal

308 views • 29 slides

Housekeeping Tw itter: # ACMW ebinarScaling W elcom e to today s ACM Learning Webinar.

Housekeeping Tw itter: # ACMW ebinarScaling W elcom e to today s ACM Learning Webinar. The presentation starts at the top of the hour and lasts 60 minutes. Slides will advance automatically throughout the event. You can resize the

418 views • 25 slides

Data Center Switch Architecture in the Age of Merchant Silicon Nathan Farrington Erik Rubow

Data Center Switch Architecture in the Age of Merchant Silicon Nathan Farrington Erik Rubow Amin Vahdat The Network is a Bottleneck HTTP request amplification Web search (e.g. Google) Small object retrieval (e.g. Facebook) Web

626 views • 36 slides

Symbiosis in Scale Out Networking and Data Management Amin Vahdat Google/UC San Diego

Symbiosis in Scale Out Networking and Data Management Amin Vahdat Google/UC San Diego vahdat@google.com Overview Large-scale data processing needs scale out networking Unlocking the potential of modern server hardware for at scale

694 views • 67 slides

Advanced Data Placement via Ad-hoc File Systems at Extreme Scales - PowerPoint PPT Presentation

Center for Information Services and High Performance Computing (ZIH) Advanced Data Placement via Ad-hoc File Systems at Extreme Scales (ADA-FS) Michael Kluge, Wolfgang E. Nagel, Andr Brinkmann, Achim Streit, Sebastian Oeste, Marc-Andr Vef,

File Management What is a file? Elements of file management File organization

ADVANCED PLACEMENT The purpose of the Advanced Placement program is to provide the students with

Advanced Placement Physics 1 Advanced Placement Physics 2 Dr. Matt Frederickson Dr. Kevin

Advanced File Systems Thierry Sans Advanced File Systems How to improve the performances?

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

CPSC 410/611: File Management What is a file? Elements of file management File

Week 10: File Management What is a file? Elements of file management File

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy

File Systems: Consistency Issues 1 File Systems: Consistency Issues File systems maintain many

File Systems: Semantics & Structure What is a File a file is a named collection of

File Systems: Semantics & Structure What is a File a file is a named collection of

Advanced Placement/Dual Enrollment Course Info. Session: February 13, 2020 Advanced Placement

Area 11 Redistricting Ad-Hoc Committee AREA 11 Redistricting Ad-Hoc Committee March 8 th 2017 a

Routing In Ad Hoc Networks 1. Introduction to Ad-hoc networks 2. Routing in Ad-hoc networks 3.

Ad-hoc and Mesh Networks MAP-I Manuel P. Ricardo Faculdade de Engenharia da Universidade do

Mobile Communications Ad-hoc and Mesh Networks Manuel P. Ricardo Faculdade de Engenharia da

CompSci 514: Computer Networks Lecture 17: Datacenter Network Architectures Xiaowei Yang

Determinacy models and good scales at singular cardinals Trevor Wilson University of California,

CS 6453 Network Fabric Presented by Ayush Dubey Based on: 1. Jupiter Rising: A Decade of Clos

Tools for the Analysis and Design of Complex Multi-Scale Networks: Overview MURI

Extreme-Scale HPC Network Analysis using Discrete-Event Simula=on Noah Wolfe 1 , Misbah Mubarak 2

Housekeeping Tw itter: # ACMW ebinarScaling W elcom e to today s ACM Learning Webinar.

Data Center Switch Architecture in the Age of Merchant Silicon Nathan Farrington Erik Rubow

Symbiosis in Scale Out Networking and Data Management Amin Vahdat Google/UC San Diego

Sambuz

Useful Links

Newsletter

Mail Us

Advanced Data Placement via Ad-hoc File Systems at Extreme Scales - PowerPoint PPT Presentation

Center for Information Services and High Performance Computing (ZIH) Advanced Data Placement via Ad-hoc File Systems at Extreme Scales (ADA-FS) Michael Kluge, Wolfgang E. Nagel, Andr Brinkmann, Achim Streit, Sebastian Oeste, Marc-Andr Vef,

File Management What is a file? Elements of file management File organization

ADVANCED PLACEMENT The purpose of the Advanced Placement program is to provide the students with

Advanced Placement Physics 1 Advanced Placement Physics 2 Dr. Matt Frederickson Dr. Kevin

Advanced File Systems Thierry Sans Advanced File Systems How to improve the performances?

Click on M odel File for CAD Click on M odel File for CAD Click on Model File for CAD Click

CPSC 410/611: File Management What is a file? Elements of file management File

Week 10: File Management What is a file? Elements of file management File

Advanced File Systems, Advanced File Systems, ZFS ZFS http://d3s.mff.cuni.cz/aosy

File Systems: Consistency Issues 1 File Systems: Consistency Issues File systems maintain many

File Systems: Semantics &amp; Structure What is a File a file is a named collection of

File Systems: Semantics &amp; Structure What is a File a file is a named collection of

Advanced Placement/Dual Enrollment Course Info. Session: February 13, 2020 Advanced Placement

Area 11 Redistricting Ad-Hoc Committee AREA 11 Redistricting Ad-Hoc Committee March 8 th 2017 a

Routing In Ad Hoc Networks 1. Introduction to Ad-hoc networks 2. Routing in Ad-hoc networks 3.

Ad-hoc and Mesh Networks MAP-I Manuel P. Ricardo Faculdade de Engenharia da Universidade do

Mobile Communications Ad-hoc and Mesh Networks Manuel P. Ricardo Faculdade de Engenharia da

CompSci 514: Computer Networks Lecture 17: Datacenter Network Architectures Xiaowei Yang

Determinacy models and good scales at singular cardinals Trevor Wilson University of California,

CS 6453 Network Fabric Presented by Ayush Dubey Based on: 1. Jupiter Rising: A Decade of Clos

Tools for the Analysis and Design of Complex Multi-Scale Networks: Overview MURI

Extreme-Scale HPC Network Analysis using Discrete-Event Simula=on Noah Wolfe 1 , Misbah Mubarak 2

Housekeeping Tw itter: # ACMW ebinarScaling W elcom e to today s ACM Learning Webinar.

Data Center Switch Architecture in the Age of Merchant Silicon Nathan Farrington Erik Rubow

Symbiosis in Scale Out Networking and Data Management Amin Vahdat Google/UC San Diego

Sambuz

Useful Links

Newsletter

Mail Us

File Systems: Semantics & Structure What is a File a file is a named collection of

File Systems: Semantics & Structure What is a File a file is a named collection of