DataCite Blog
  • Support
  • DataCite homepage

Towards richer metadata – perspectives from three DataCite projects

November 17, 2022November 17, 2022 Kelly Stathis, Xiaoli Chen, Cody Ross and Paul Vierkant
https://doi.org/10.5438/fz2e-fs29

Metadata

Metadata is at the heart of DOIs and open scholarly infrastructure. At DataCite, our metadata schema defines what metadata properties can be included through DOI registration. The schema currently includes just six required properties—identifier (the DOI), creator, title, publication year, publisher, and resource type—along with 14 recommended and optional properties. 

On the one hand, requiring only six metadata properties keeps the schema flexible and makes it easy to get started with DOI registration. At the same time, we want to encourage all DataCite Metadata Schema users to go beyond the mandatory properties and to share rich metadata that includes all available information about a given resource. This is especially important for metadata properties that are essential for discoverability—such as description and subject—and building connections between PIDs—including identifiers for related resources, people, and organizations. Keeping metadata up-to-date is also critical to ensure that the “persistent” part of persistent identifiers lives up to its full potential.

DataCite Connection Metadata. An infographic shows five bubbles that represent an entity and its persistent identifier: researchers with the ORCID iD; related research outputs with a DOI; research funders with the ROR ID and Crossref Funder ID; research organizations with the ROR ID. These four bubbles are floating around and are linked to the inner bubble of research output with DataCite DOIs.

We know it isn’t always easy to create, enhance, and maintain robust metadata. Among other challenges, there is a need to collaboratively define metadata best practices so that metadata creators, repository platform providers, and the open research community can create metadata with confidence. At DataCite, we are involved in several projects and partnerships that support rich metadata across different domains. This post is based on the November DataCite Open Hours, where we heard from team members involved in three projects/partnerships: the Implementing FAIR Workflows Project; the IGSN-DataCite Partnership; and the NFDI4Ing Seed Funds project.

FAIR Workflows

In the Implementing FAIR Workflows project, we look at metadata completeness from two perspectives:  1) capturing the metadata of outputs that traditionally fly under the radar by establishing new PID workflows; and 2) enriching the metadata of outputs that are already being shared by improving existing PID workflows. For the former, we focus on helping the researchers to identify interim outputs and build sharing practices around them, at the same time, supporting repository platforms and research tools in developing integration to streamline PID registration and metadata sharing and increase interoperability between technologies. For the latter, we emphasize the use of community-endorsed resources for metadata generation and maximize coverage of metadata submission to not only include core metadata, but also recommended and optional metadata in the curation process. Organizations can take action by creating comprehensive crosswalks between their local metadata and the DataCite metadata schema, and implementing workflows that capture and share connection metadata in a standardized format.

IGSN perspective

The IGSN ID is a globally unique and persistent identifier for all types of material samples from all disciplines, including sample aggregates, destroyed/discarded samples, and even sample collection sites. As with other research outputs, metadata plays a critical role in describing and connecting IGSN IDs to maximize discoverability and reuse. Under its partnership, IGSN e.V., and DataCite have established working groups to define best practices for material sample metadata in the DataCite Metadata Schema. Furthermore, we continue to work with disciplinary samples communities to better support their needs and reach consensus on metadata standardized within and among these communities. 

IGSN ID metadata in the DataCite Metadata Schema can be collected and enriched throughout every stage of the samples workflow, from planning and collection through to repository ingest and publication. Principal Investigators, analysts, curators, and repository managers are encouraged to enrich sample metadata throughout the workflow processes and over time, using metadata from field-based tools, analytical systems, and local samples databases. In contrast to many research outputs, the physical nature of samples means that they are often broken down into smaller and smaller pieces. Relationship metadata for IGSN IDs is thus valuable not only for unambiguously linking samples with related datasets, publications, researchers, institutions, and external metadata, but also vital for describing the linkages between parent samples and derived children.

NFDI4Ing

The strength of the DataCite Metadata Schema is its domain-agnostic character. As a global standard for describing research outputs and resources, the schema needs to be as general as possible but at the same time as specific as necessary. DataCite’s engagement in different research projects reflects our will to connect to research domains that are in need of domain specific metadata. One goal in the NFDI4Ing Seed Funds project is to map the DataCite Metadata Schema to other schemata used in the engineering sciences as well as schema.org. Beyond mapping, we seek best practices in linking our schema to domain-specific schemata in general.
The second goal of NFDI4Ing is to develop user stories around the upcoming resource type for instruments, using connection and relationship metadata. With relationship metadata, researchers and research organizations can find out which instrument was used to create a dataset. To visualize these relationships, extensive PID metadata – whether general or domain-specific – must be part of the PID graph. Similar to the  projects mentioned above, the importance of metadata within the NFDI4Ing Seed funds project cannot be overstated.

What’s next?

These three projects and partnerships are some of the work DataCite is doing to improve DOI metadata. In the coming months, you’ll hear more from us about how you can contribute to the next major version of the DataCite Metadata Schema (5.0), along with updates as we finalize our next minor version (4.5). We look forward to working with the DataCite community to support your efforts to collect and share richer metadata.

Photo of Kelly Stathis
Kelly Stathis
Blog posts
  • Kelly Stathis
    https://blog.datacite.org/author/kelly-stathisdatacite-org/
    Welcome our new DataCite Committee Members
  • Kelly Stathis
    https://blog.datacite.org/author/kelly-stathisdatacite-org/
    Full API support for DataCite Metadata Schema 4.4
  • Kelly Stathis
    https://blog.datacite.org/author/kelly-stathisdatacite-org/
    Updating our metadata schema suggestions process
  • Kelly Stathis
    https://blog.datacite.org/author/kelly-stathisdatacite-org/
    ConfIDent about PIDs: Using DataCite DOIs for Conferences
Xiaoli Chen
FAIR Workflows Project Lead at DataCite | Blog posts
  • Xiaoli Chen
    https://blog.datacite.org/author/xiaoli-chendatacite-org/
    Connected in Gothenburg: DataCite's first in-person Connect event
  • Xiaoli Chen
    https://blog.datacite.org/author/xiaoli-chendatacite-org/
    Mind the gap - what to expect when practicing FAIR
  • Xiaoli Chen
    https://blog.datacite.org/author/xiaoli-chendatacite-org/
    The pitfalls of traditional workflows - with a silver lining
  • Xiaoli Chen
    https://blog.datacite.org/author/xiaoli-chendatacite-org/
    Introducing Xiaoli and the FAIR Workflows Project
Cody Ross
Application Support Engineer | Blog posts
  • Cody Ross
    https://blog.datacite.org/author/cody-rossdatacite-org/
    IGSN ID Catalogs – Now it is Even Easier to Register IGSN IDs!
  • Cody Ross
    https://blog.datacite.org/author/cody-rossdatacite-org/
    Start registering IGSN IDs with DataCite now!
  • Cody Ross
    https://blog.datacite.org/author/cody-rossdatacite-org/
    Welcome Cody!
Paul Vierkant
Outreach Manager at DataCite | Blog posts
  • Paul Vierkant
    https://blog.datacite.org/author/pvierkantdatacite-org/
    Connected in Gothenburg: DataCite's first in-person Connect event
  • Paul Vierkant
    https://blog.datacite.org/author/pvierkantdatacite-org/
    Launch of the PID-network Project - Understanding Metadata Workflows
  • Paul Vierkant
    https://blog.datacite.org/author/pvierkantdatacite-org/
    DataCite Member Survey 2022
  • Paul Vierkant
    https://blog.datacite.org/author/pvierkantdatacite-org/
    Welcome our new DataCite Committee Members

Share this:

  • Click to share on Twitter (Opens in new window)
  • Click to share on Facebook (Opens in new window)
FAIR Workflows, IGSN, Metadata, NFDI4Ing, Projects.

© 2022 Kelly Stathis, Xiaoli Chen, Cody Ross and Paul Vierkant. Distributed under the terms of the Creative Commons Attribution license.


Post navigation

FAIR is everywhere
Investigating PIDs for organizations – ORCID DE 2 project successfully completed

Recent Posts

  • Connected in Gothenburg: DataCite’s first in-person Connect event
  • IGSN ID Catalogs – Now it is Even Easier to Register IGSN IDs!
  • DataCite Design System is ready to be worn.
  • DataCite launches Global Access Program with support from CZI
  • Launch of the PID-network Project – Understanding Metadata Workflows

Tags

Anniversary (3) API (3) Bibliometrics (2) Citation (8) Conference (2) Content negotiation (2) Crossref (10) CSV (4) Data-level metrics (9) Data citation (7) Discovery (2) Docker (3) DOI (18) Dublin core (2) Fabrica (4) FAIR (5) FORCE11 (2) FREYA (8) Github (2) Google (2) GraphQL (7) IGSN (5) Impactstory (2) Infrastructure (13) MDC (7) Members (13) Metadata (35) Open hours (2) ORCID (17) Organization identifiers (4) PIDapalooza (5) PID graph (9) Policy (2) RDA (8) Re3data (11) React (2) ROR (5) Schema.org (3) Search (3) Services (5) Software (2) Software citation (5) Staff (6) Strategy (2) THOR (13)

Archives

  • March 2023 (3)
  • February 2023 (2)
  • January 2023 (5)
  • December 2022 (4)
  • November 2022 (3)
  • October 2022 (5)
  • September 2022 (6)
  • August 2022 (3)
  • July 2022 (1)
  • June 2022 (3)
  • May 2022 (1)
  • April 2022 (1)
  • March 2022 (2)
  • February 2022 (3)
  • January 2022 (1)
  • December 2021 (2)
  • November 2021 (3)
  • October 2021 (5)
  • August 2021 (2)
  • July 2021 (2)
  • June 2021 (1)
  • May 2021 (2)
  • April 2021 (2)
  • March 2021 (2)
  • February 2021 (3)
  • January 2021 (3)
  • December 2020 (1)
  • November 2020 (2)
  • October 2020 (4)
  • September 2020 (4)
  • August 2020 (3)
  • July 2020 (3)
  • June 2020 (2)
  • May 2020 (3)
  • April 2020 (2)
  • March 2020 (2)
  • February 2020 (4)
  • January 2020 (4)
  • December 2019 (3)
  • November 2019 (3)
  • October 2019 (5)
  • September 2019 (3)
  • August 2019 (3)
  • July 2019 (3)
  • June 2019 (2)
  • May 2019 (5)
  • April 2019 (6)
  • March 2019 (2)
  • February 2019 (5)
  • January 2019 (1)
  • December 2018 (4)
  • November 2018 (3)
  • October 2018 (4)
  • September 2018 (4)
  • August 2018 (4)
  • June 2018 (4)
  • May 2018 (4)
  • April 2018 (1)
  • February 2018 (3)
  • January 2018 (1)
  • November 2017 (2)
  • October 2017 (2)
  • August 2017 (4)
  • July 2017 (1)
  • June 2017 (1)
  • May 2017 (2)
  • April 2017 (5)
  • March 2017 (2)
  • January 2017 (1)
  • December 2016 (4)
  • November 2016 (2)
  • October 2016 (5)
  • September 2016 (3)
  • August 2016 (1)
  • July 2016 (3)
  • June 2016 (1)
  • May 2016 (6)
  • April 2016 (5)
  • March 2016 (5)
  • February 2016 (2)
  • January 2016 (2)
  • December 2015 (3)
  • November 2015 (3)
  • October 2015 (8)
  • September 2015 (5)
  • August 2015 (6)

About

  • What we do
  • Governance
  • Members
  • Steering groups
  • Team
  • Job opportunities

Services

  • Create DOIs with Fabrica
  • Discover metadata with Commons
  • Integrate with APIs
  • Partner services

Resources

  • Metadata schema
  • Support
  • Fee model

Community

  • Members
  • Partners
  • Steering groups
  • Service providers
  • Roadmap
  • FAIR Workflows

Contact us

  • Imprint
  • Terms and conditions
  • Privacy policy
  • Mail
  • RSS Feed
  • Twitter
  • Mastodon
  • GitHub
  • YouTube
  • LinkedIn
We use cookies on our website. Some are technically necessary, others help us improve your user experience. You can decline non-essential cookies by selecting “Reject”. Please see our Privacy Policy for further information about our privacy practices and use of cookies.
RejectAccept
Manage consent

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT