This year, metadata development is one of our key priorities and we’re making a start with the release of version 5.4.0 of our input schema with some long-awaited changes. This is the first in what will be a series of metadata schema updates.
What is in this update?
Publication typing for citations
This is fairly simple; we’ve added a ‘type’ attribute to the citations members supply. This means you can identify a journal article citation as a journal article, but more importantly, you can identify a dataset, software, blog post, or other citation that may not have an identifier assigned to it. This makes it easier for the many thousands of metadata users to connect these citations to identifiers. We know many publishers, particularly journal publishers, do collect this information already and will consider making this change to deposit citation types with their records.
Every year we release metadata for the full corpus of records registered with us, which can be downloaded for free in a single compressed file. This is one way in which we fulfil our mission to make metadata freely and widely available. By including the metadata of over 165 million research outputs from over 20,000 members worldwide and making them available in a standard format, we streamline access to metadata about scholarly objects such as journal articles, books, conference papers, preprints, research grants, standards, datasets, reports, blogs, and more.
Today, we’re delighted to let you know that Crossref members can now use ROR IDs to identify funders in any place where you currently use Funder IDs in your metadata. Funder IDs remain available, but this change allows publishers, service providers, and funders to streamline workflows and introduce efficiencies by using a single open identifier for both researcher affiliations and funding organizations.
As you probably know, the Research Organization Registry (ROR) is a global, community-led, carefully curated registry of open persistent identifiers for research organisations, including funding organisations. It’s a joint initiative led by the California Digital Library, Datacite and Crossref launched in 2019 that fulfills the long-standing need for an open organisation identifier.
We began our Global Equitable Membership (GEM) Program to provide greater membership equitability and accessibility to organizations in the world’s least economically advantaged countries. Eligibility for the program is based on a member’s country; our list of countries is predominantly based on the International Development Association (IDA). Eligible members pay no membership or content registration fees. The list undergoes periodic reviews, as countries may be added or removed over time as economic situations change.
It’s been a while, here’s a metadata update and request for feedback
In Spring 2023 we sent out a survey to our community with a goal of assessing what our priorities for metadata development should be - what projects are our community ready to support? Where is the greatest need? What are the roadblocks?
The intention was to help prioritize our metadata development work. There’s a lot we want to do, a lot our community needs from us, but we really want to make sure we’re focusing on the projects that will have the most immediate impact for now.
Several projects were proposed, based on community demand over time. All are projects we intend to support long-term.
Projects
The projects included in the survey were:
Alternate names - We proposed adding a repeatable ‘name’ element to allow for names that aren’t separated by given/family/surname.
Updates to funding data -this update will be released in the near future and includes:
Expand ROR support - Allow members to supply ROR ID instead of funder ID in funding data and grant records.
Include Grant DOIs in funding metadata.
Publication typing in citations - Support citation type in citation metadata (for example article, preprint, data, software, etc.).
Expand contributor role support - Allow multiple contributor roles to be provided per contributor and add support for external vocabularies (like CRediT)
Expand abstract support - We currently require all abstracts to be formatted using JATS. We will be adding new abstract formats, including BITS and ONIX (which have been requested), as well as a generic abstract format (non-JATS).
Statements - Add support for free text statements such as data availability, acknowledgments, funding, and conflict of interest.
Contributor identifiers - Accept contributor identifiers such as ISNI (in addition to ORCID, which is already supported).
There is a clear preference for publication types in citations and abstract markup, expanded support for multilingual metadata, followed by expanding contributor roles to support multiple roles and the CRediT taxonomy. The results have helped us prioritize our work and we’re advancing several projects soon based on our readiness to move forward.
First up is publication typing in citations and statements - we hope to be able to make this ready for registration in the coming months, but want to confirm a few things first, primarily the list of ‘types’ to apply to citations, so please review and comment: Metadata updates in need of feedback July 2024
We also have been discussing expansions to our support for preprints metadata with our Preprints Advisory Group and have a number of preprint-specific updates that will be rolled out in the coming months as well, including support for versions and status. These proposed changes are also available for comment.
And finally, we will be expanding support for contributor roles to include multiple roles per contributor, as well as adding support for the CRediT taxonomy. This update is yet to be scheduled but we do have the inputs and output planning done and welcome any comments on this as well.
We will also be continuing work on other projects highlighted in the survey that aren’t quite ready to go:
Multilingual metadata: Support for multilingual metadata in particular is very important and will require a fairly significant technical effort, so we want to be sure we get this right - at minimum we need to include repeatable fields flagged with language metadata for most items, there may be other considerations as well such as the scope of languages supported.
As we develop new metadata segments we’re keeping language metadata in mind, but I’d like to form a short-term working group to help shape this update - this group will be focused on the details of supporting multilingual metadata in our inputs and outputs, so conversations will be very XML and JSON heavy. If you are interested and available please contact pfeeney@crossref.org.
Abstract markup: we are currently in the research phase of this project but will be proposing updates and asking for input this fall. At the moment support for BITS and ONIX abstracts have been requested, as well as an agnostic format.
Expansion of name and contributor ID support: work is under way for this as well, and I should have inputs and outputs for feedback in the coming months.
We anticipate more developments and requests for feedback in the future as we still have other projects from the list above to get to. I’ve opened up a ‘Metadata Development’ section in our Community Forum to invite discussion and will be kicking off a renewed Metadata Interest Group in the fall.