The implementation of national research data repository in South Africa. Mr Mbuyiselo Mqondisi Ndlovu Software Engineer OPEN REPOSITORIES 2019, June 10-13, 2019 | Hamburg, Germany
Content • Introduction • Issues to address • Current view of SA institutional data repositories • Provisioning of a national data repository – Architecture • Interaction with the storage • Metadata • Persistent Identifier • DOI – Data Deposit Tool Interaction • Integrated National Research Data Infrastructure 2
Introduction • Not all institutions have IT capability to provide digital storage for their research data. • Need to look at how all institutions can be supported by offering them a central repository system. • Provide competent and user friendly systems to help researchers interact with centralized data repository is essential. • This presentation covers applications and a storage facility implemented by DIRISA. 3
Issues to address Committed to address research data challenges: • Storage and back up • Sharing data • Preserving data • Discovering data • Data transfer • Re-usability 4
Current view of SA institutional data repositories • South Africa has 26 institutions • Own repositories and varied infrastructures • Digital content generated by faculty, staff and students • Currently not centralized 5
Provisioning of a national data repository – Architecture DIRISA cloud portal Passive data: Openstack Storage archival data & Virtualisation Cluster staging: VM access iRODS 8 PB 40 PB Active data: near real time interactive access Allocation of quotas per researcher. Reliable and persistent. 6
Interaction with the storage • Metalnx, iCommands (optional) • Designed to work alongside iRODS • Metalnx key features: – Collection management – Metadata extraction – Metadata management – Metadata templates 7
Metadata • Dublin core elements - 15 metadata fields: – Title – Subject – Description – Creator – Publisher – Contributor – Date – Type – Format – Identifier – Source – Language – Relation – Coverage – Rights 8
Persistent Identifier • Reference to a document, file, web page, or other object • Globally unique • Forms part of metadata fields • Invoked by metalnx through API call during upload process • The Anatomy of a Digital Object Identifier (DOI) Source: http://www.ands.org.au/online-services/doi-service/doi-policy-statement 9
DOI – Data Deposit Tool Interaction 10
Integrated National Research Data Infrastructure Tier 1 (National) DIRISA Tier 2 (Regional) Region 2 Region 1 Region 3 Tier 3 (Inst) Institution 1 Institution 2 • One central application • All nodes connected (Tier 1, Tier 2, and Tier 3) • Retrieve data from all institutional repositories via one central UI • Interoperability • Improved collaboration and sharing 11
12
Thank you
Recommend
More recommend