Sibylle Hermann, Dorothea Iglezakis, Anett Seeland Make it easy - integration of data description in the research process 11. June 2019 University of Stuttgart
Einführung – Anforderungen – Umsetzung – Zusammenfassung Beispiel: Direkte numerische Simulation einer turbulenten Grenzschichtströmung Vorbereitung Vorbereitung Art Gitter Randbedingung Integration Filterung Dämpfu x y z char. fest extrap. period. Verfahren ∆t Ordnung Breite St grob mittel fein grob mittel fein grob mittel fein DNS x x x o o o RK4 0,001 10 12 • Projekt RK4 0,001 10 12 x x x o o o RK4 0,001 10 12 x x x o o RK4 0,001 10 12 x x x o o o RK4 0,001 10 12 • Bestimmung x x x o o o x x x o o RK4 0,001 10 12 x x x o o o RK4 0,001 10 12 x x x o o o RK4 0,001 10 12 • Gitter (N=3 3 ) x x x o o RK4 0,001 10 12 RK4 0,001 10 12 x x x o o o RK4 0,001 10 12 x x x o o o • Randbedingung (N=3 3 ∙3) … … … RK4 0,001 10 12 x x x o o o … … … x x x o o o RK4 0,001 10 12 • Numer. Parameter (N=3 3 ∙3∙2 3 ) … … … x x x o o o RK4 0,001 10 12 … … … RK4 0,001 10 12 → O(10 3 ) Simulationen x x x o o o … … … RK4 0,001 10 24 x x x o o Universität Stuttgart 29.03.2019 11 Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 1 / 13
What the User doesn’t like to do • Publish data because it is not yet common in engineering science • Spend time with documentation Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 2 / 13
What the User Needs • Manage a lot of data • Find saved data easily • Browse data sets • Change data sets dynamically • Record metadata easily • Link results with simulaions • Link data sets from different simulations • Give controlled access Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 3 / 13
Metadata What our users want to search for (apart from Author, Year) • Variables – measured and controlled • Parameters of the used method • Parameters of the observed system What our users want to document from their research process • Methods and workflows • Software and computing environments • Instruments • Parameters and assumptions Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 4 / 13
EngMeta A Metadata Schema for Engineering Science Schembera & Iglezakis “The Genesis of EngMeta-A Metadata Model for Research Data in Computational Engineering”, In: Research Conference on Metadata and Semantics Research, p127–132, 2018, Springer. Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 5 / 13
Local Data Management – Prerequisite for Open Data Idea Adding metadata to the data as early in the process and as easy as possible Approach Using a data repository primarily as metadata store and tools around it for smooth interaction Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 6 / 13
DaRUS Data Repository of the University of Stuttgart Based on Dataverse • Open source research data repository software • Repository hosts multiple virtual archives called Dataverses Image: http://guides.dataverse.org/en/latest/user/dataverse-management.html , Access: 6/7/2019 Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 7 / 13
Challenge I: Automation Ingest of (Meta)data Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 8 / 13
Challenge II: Handling of Large Files Dataverse not designed for large files • Users experienced frozen UI and timeouts → Use REST API for files > 2 GB • Trade-off between timeout configuration and available threads → Introduce 2nd thread pool in Glassfish → Uploads around 100 GB possible Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland 9 / 13
Challenge II: Handling of Large Files • Currently under development • In planning • Connection of object storage to tape library • Extend Dataverse to support different storage classes (Download vs Provide-Buttons) Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland10 / 13
Outlook: Different Data Overview Needed Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland11 / 13
Summary • Starting early in the process means less effort at the end • To make it easy is still a challenge • Automation is a key requirement Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland12 / 13
Thank you! FoKUS fokus@izus.uni-stuttgart.de E-Mail: https://www.izus.uni-stuttgart.de/en/fokus/ URL: DaRUS URL.: https://www.izus.uni-stuttgart.de/en/fokus/darus/ Make it easy - integration of data description in the research processSibylle Hermann, Dorothea Iglezakis, Anett Seeland13 / 13
Recommend
More recommend