===== 🗂️ Datasets ===== This guide explains how to register your research datasets in Pure. Registering the metadata for your datasets increases their visibility and ensures they are linked to your other research outputs. A research dataset is a collection of data that has been generated or collected during the course of your research. This can include numerical data, text, images, audio, or other forms of data. ===== Importance and Challenges at EEMCS ===== Registering datasets is a high priority for EEMCS. However, this is an area of active development. This guide represents the best practice based on current possibilities. Key challenges we are working on include: * Developing a UT-wide workflow for dataset registration. * Investigating the capabilities and limitations of Pure for datasets. * Determining how the library can support this process. * Integrating dataset registration with the university's data and software policy. ===== Registering a Dataset entry in Pure ===== Once you have uploaded your dataset to a repository like 4TU.ResearchData, Zenodo, or Figshare, and you want to register its metadata in Pure, follow the guide below. Select the Dataset > Dataset category to add a new dataset entry. **Key Fields:** * **People:** Add all creators of the dataset. * **Data availability:** * **Publisher:** Enter the name of the repository where the dataset is stored (e.g., '4TU.ResearchData'). * **DOI:** Add the Digital Object Identifier for the dataset, which you received from the repository. * **Readme file:** Upload a copy of the README file. Proper documentation is essential for making your data FAIR. * **Links:** **Leave this field empty.** The data steward will add the direct link to the dataset in Areda after reviewing the record. * **Date made available:** Enter the date the dataset was published in the repository. Do not include a possible embargo period here. * **Access to the dataset:** * **Open:** If the dataset is openly accessible in a trusted repository. * **Closed / Embargoed:** If the data is encrypted or has a temporary non-disclosure period. * **Restricted:** If the data is not public and only accessible to group members. * **Legal / ethical:** Indicate if the dataset contains personal data or has other ethical considerations that restrict access. * **Relations to other content:** **This is crucial.** Link the dataset to the research outputs that use it and the projects that funded it. * **Visibility:** The general policy is that the description (metadata) of the dataset should be **Public**. The visibility of the data itself is controlled by the "Access to the dataset" setting. //TODO: Add screenshot: The "Data availability" section of the dataset template, with annotations for Publisher, DOI, Readme file, and Links, as described in QRC p.25.//