National Data Service National effort to bring together infrastructure supporting the publication , discovery , and reuse of scientific data • Advance discovery by enabling open sharing of data • Increase collaboration within/across fields • Large-scale Data Service Interoperability • Distributed Storage & Computation • Spectrum of Services & Software • Incubator of Data Technologies, Projects, and Pilots
Publishing Scientific Data • Storage • Everything else!!! – The bytes are not enough on their own 00110100 00110010 – Metadata, curation tools, indexes, storage abstraction, replication, data transfer, authentication, access control, transformation, analysis, tools, computation, …
National Data Service Consortium • A commitment to addressing the “Big Data” challenges of the scientific community as well as the broader public • Making science more effective through the interoperability of NDS components • Creating broader impact through the interoperability of NDS components • Finding and filling gaps between current NDS components • Working within a broad governance model • Leveraging current Federal, State, Industry, and Institutional investments
National Data Service Consortium • Communities – Astronomy, Biology, Engineering, Geoscience, Information Science, Material Science, Medicine, Social Science • Universities, Libraries, Archives, and Publishers – CU Boulder, Harvard, Indiana, Johns Hopkins, Notre Dame, Purdue, UC San Diego, UIC, UIUC, U Michigan, ICPSR … – Nature, Science, APS, IEEE, PLOS, Elsevier, … • Computing and Data Centers/Cyberinfrastructure – ANL, NCSA, PSC, SDSC, TACC – Brown Dog, Data Excacell, DataONE, DFC, CyVerse, GABBs, IN-CORE, iRODS, Globus, SciServer, SEAD, Terra Populus, TERRA REF, Whole Tale, LSST, LIGO, ...
NDS Data DNS • Share your data without moving it • Contribute to reproducible science • Invite new analysis • Services for finding, indexing data NDS Labs Workbench • Incubate data technologies/projects • Experiment with tools, perfect stack • Run data science training environments • Promote your data tools!
CC* Coordinated campus-level cyberinfrastructure components of data, networking, and computing infrastructure across 200+ universities Big Data Hubs Stimulate regional grassroots partnerships focused on Big Data
Open Storage Network
Challenges • Broad landscape of capabilities – Overlapping, isolated, lacking interoperability • Binary viewpoints – e.g. all local or all cloud • A frequent desire to get complicated quickly • Long term preservation of data
http://www.nationaldataservice.org/ https://nationaldataservice.atlassian.net/wiki/
Recommend
More recommend