Enhancement of NOMAD Dataset Metadata and Features
Description:
We aim to improve the NOMAD dataset metadata to align with state-of-the-art repositories to enhance the discoverability and citation of datasets. Our goal is to incorporate a more comprehensive metadata schema similar to that used by Zenodo. Below are the proposed enhancements:
- Description: add a description field of the dataset.
- Authors: list the contributors
- Publisher/authors Information: Include a field to specify the dataset's publisher. This could be a research institution, funding agency, or the repository itself.
- Funding Details: Add metadata fields to record the funding information, such as grant numbers and funding agencies.
- Alternate Identifiers: Introduce fields for alternate identifiers such as Handle, ARK, PURL, ISSN, ISBN, etc., to allow for cross-referencing and improved linkage between related scholarly work.
- Related Works: Implement a section where users can link to related works, providing a way to interconnect datasets with publications, software, or other datasets.
- References: Add a field where users can list references that cite or are related to the dataset, enhancing the dataset's academic context and credibility.
- Visibility Settings: Include options to set the visibility of the dataset, such as 'Public' or 'Restricted', with the ability to apply embargos if necessary (currently set at upload level).
Impact:
The proposed metadata enhancements will hopefully provide the path for the following benefits:
- Improved Discoverability: Richer metadata will enable better indexing by search engines and databases, such as Google Scholar. We will need to work actively to get the datasets indexed in CrossRef, Scholar, ORCID and other places.
- Enhanced Citability: With standardized citation information, datasets can be more easily cited, improving the recognition and credit to researchers.
- Greater Interoperability: The inclusion of related works and alternate identifiers fosters greater integration with other research outputs and systems.
Further enhancements:
This is specifically dedicated to incentive uploaders. @jrudz This is related to the incentives discussion.
- Add search-app configuration for the dataset.
- Demonstrate citability of the datasets. The ultimate goal here would be that every dataset being part of a big-data paper would get cited, and the authors (NOMAD uploaders) would get those citations indexed in the ORCID and Google Scholar profiles.
- Direct shipment of the dataset to Zenodo too.
Resources
Some additional Zeonodo screenshots will come in the comments.
- CrossRef data-and-software-citation-deposit-guide
- CrosRef membership
- From EU survey about data repositories: @mscheidg