This guide explains how to register your research datasets in Pure. Registering the metadata for your datasets increases their visibility and ensures they are linked to your other research outputs. A research dataset is a collection of data that has been generated or collected during the course of your research. This can include numerical data, text, images, audio, or other forms of data.

Registering datasets is a high priority for EEMCS. However, this is an area of active development. This guide represents the best practice based on current possibilities. Key challenges we are working on include:

  • Developing a UT-wide workflow for dataset registration.
  • Investigating the capabilities and limitations of Pure for datasets.
  • Determining how the library can support this process.
  • Integrating dataset registration with the university's data and software policy.

Registering a Dataset entry in Pure

Once you have uploaded your dataset to a repository like 4TU.ResearchData, Zenodo, or Figshare, and you want to register its metadata in Pure, follow the guide below. Select the Dataset > Dataset category to add a new dataset entry.

Key Fields:

  • People: Add all creators of the dataset.
  • Data availability:
    • Publisher: Enter the name of the repository where the dataset is stored (e.g., '4TU.ResearchData').
    • DOI: Add the Digital Object Identifier for the dataset, which you received from the repository.
    • Readme file: Upload a copy of the README file. Proper documentation is essential for making your data FAIR.
    • Links: Leave this field empty. The data steward will add the direct link to the dataset in Areda after reviewing the record.
    • Date made available: Enter the date the dataset was published in the repository. Do not include a possible embargo period here.
  • Access to the dataset:
    • Open: If the dataset is openly accessible in a trusted repository.
    • Closed / Embargoed: If the data is encrypted or has a temporary non-disclosure period.
    • Restricted: If the data is not public and only accessible to group members.
  • Legal / ethical: Indicate if the dataset contains personal data or has other ethical considerations that restrict access.
  • Relations to other content: This is crucial. Link the dataset to the research outputs that use it and the projects that funded it.
  • Visibility: The general policy is that the description (metadata) of the dataset should be Public. The visibility of the data itself is controlled by the “Access to the dataset” setting.

TODO: Add screenshot: The “Data availability” section of the dataset template, with annotations for Publisher, DOI, Readme file, and Links, as described in QRC p.25.