This year, metadata development is one of our key priorities and weâre making a start with the release of version 5.4.0 of our input schema with some long-awaited changes. This is the first in what will be a series of metadata schema updates.
What is in this update?
Publication typing for citations
This is fairly simple; weâve added a âtypeâ attribute to the citations members supply. This means you can identify a journal article citation as a journal article, but more importantly, you can identify a dataset, software, blog post, or other citation that may not have an identifier assigned to it. This makes it easier for the many thousands of metadata users to connect these citations to identifiers. We know many publishers, particularly journal publishers, do collect this information already and will consider making this change to deposit citation types with their records.
Every year we release metadata for the full corpus of records registered with us, which can be downloaded for free in a single compressed file. This is one way in which we fulfil our mission to make metadata freely and widely available. By including the metadata of over 165 million research outputs from over 20,000 members worldwide and making them available in a standard format, we streamline access to metadata about scholarly objects such as journal articles, books, conference papers, preprints, research grants, standards, datasets, reports, blogs, and more.
Today, weâre delighted to let you know that Crossref members can now use ROR IDs to identify funders in any place where you currently use Funder IDs in your metadata. Funder IDs remain available, but this change allows publishers, service providers, and funders to streamline workflows and introduce efficiencies by using a single open identifier for both researcher affiliations and funding organizations.
As you probably know, the Research Organization Registry (ROR) is a global, community-led, carefully curated registry of open persistent identifiers for research organisations, including funding organisations. Itâs a joint initiative led by the California Digital Library, Datacite and Crossref launched in 2019 that fulfills the long-standing need for an open organisation identifier.
We began our Global Equitable Membership (GEM) Program to provide greater membership equitability and accessibility to organizations in the worldâs least economically advantaged countries. Eligibility for the program is based on a memberâs country; our list of countries is predominantly based on the International Development Association (IDA). Eligible members pay no membership or content registration fees. The list undergoes periodic reviews, as countries may be added or removed over time as economic situations change.
ROR IDs and Affiliations of authors can now be tracked in Participation Reports! Check your own Participation Report to see how many of your publications have author affiliations and ROR IDs in Crossref metadata. If you deposit metadata via XML, see our guide on Affiliations and ROR for instructions on how to include affiliations and ROR IDs in your metadata.
Crossref encourages our members to include ROR IDs in metadata in order to help make research organization information clear and consistent as it is shared between systems. ROR IDs are essential to realize a rich and complete Research Nexus because they enable connections between research outputs and the organizations that support researchers.
âAt Scholastica, we care about taking steps to enrich metadata â like adding ROR IDs, for example, on behalf of our customers, so they donât have to worry about the technical aspects of metadata collection or creation and can instead focus on maximizing the discovery benefits.â â Cory Schires, Co-founder and Chief Technology Officer, Scholastica
âIf weâre talking about misconduct, then you might need to be able to contact the institution that the author is from. On an individual manuscript, it doesnât matter if thereâs no identifier â an address will do. But if you find some signal that is on manuscripts at scale, and youâve got thousands of them, well, you need an identifier. You canât go through them and try and search for every single one of those institutions.â â Adam Day, CEO, Clear Skies Ltd.
ROR IDs are specifically designed to be implemented in any system that captures institutional affiliations and to enable a richer networked research infrastructure. ROR IDs are interoperable with other organization identifiers, including GRID (which provided the seed data that ROR launched with), the Open Funder Registry, ISNI, and Wikidata. ROR data is available under a CC0 Public Domain waiver and can be accessed at no cost via a public API and a data dump.
ROR is operated as a joint initiative by Crossref, DataCite, and the California Digital Library, and was launched with seed data from GRID in collaboration with Digital Science. These organizations have invested resources into building an open registry of research organization identifiers that can be embedded in scholarly infrastructure to effectively link research to organizations. ROR is not a membership organization (or an organization at all!) and charges no fees for use of the registry or the API. Read more about RORâs sustainability model.
Why ROR IDs are an important element of Crossref metadata
For a long time, Crossref only collected affiliation metadata as free-text strings, which made for ambiguity and incomplete data. An author affiliated with the University of California at Berkeley might give the name of the university in any of several common ways:
University of California, Berkeley
University of California at Berkeley
University of California Berkeley
UC Berkeley
Berkeley
And likely more âŚ
While it isnât too difficult for a human to guess that âUC Berkeley,â âUniversity of California, Berkeley,â and âUniversity of California at Berkeleyâ are all referring to the same university, a machine interpreting this information wouldnât necessarily make the same inference. If you are trying to easily find all of the publications associated with UC Berkeley, you would need to run and reconcile multiple searches at best, or, at worst, miss some data completely.
This is where an organization identifier comes in: a single, unambiguous, standardized identifier that will always stay the same. For UC Berkeley, that would be https://ror.org/01an7q238.
In 2019, Crossref members indicated that the ability to associate research outputs with organizations in a clean and consistent fashion was one of their most desired improvements to Crossref metadata. In January of 2022, therefore, Crossref added support for ROR IDs in its metadata schema and APIs. Since then, more and more Crossref members have been including ROR IDs in DOI metadata.
Publishers and service providers can implement ROR in their systems so that submitting authors and co-authors can easily choose their affiliation from a ROR-powered list instead of typing in free text. Authors themselves do not have to provide a ROR ID or even know that a ROR ID is being collected. This affiliation information can then be sent to Crossref alongside other publication information.
Demo of collecting ROR IDs in a typeahead field
If the submission system you use does not yet support ROR, or if you donât use a submission system, youâll still be able to provide ROR IDs in your Crossref metadata. ROR IDs can be added to JATS XML, and Crossref helper tools will start to support the deposit of ROR IDs. Thereâs also an OpenRefine reconciler that can map your internal identifiers to ROR identifiers.
ROR IDs for affiliations stand to transform the usability of Crossref metadata. While itâs crucial to have IDs for affiliations, itâs equally important that the affiliation data can be easily used. The ROR dataset is CC0, so ROR IDs and associated affiliation data can be freely and openly used and reused without any restrictions.
The ROR IDs registered by members in their Crossref metadata are available via Crossrefâs open APIs so that they can be detected, analyzed, and reused by anyone interested in linking research outputs to research organizations. Examples include
Institutions who want to monitor and measure their research output by the articles their researchers have published
Funders who want to be able to discover and track the research and researchers they have supported
Academic librarians who want to find all of the publications associated with their campus
Journals who want to know where authors are affiliated so they can determine eligibility for institutionally sponsored publishing agreements
The inclusion of ROR IDs in Crossref metadata will eventually help all these entities make all these connections much more easily.
Get ready to ROR đŚ!
ROR is already working with publishers, funders and service providers who are integrating ROR in their systems, mapping their affiliation data to ROR IDs, and/or including ROR IDs in publication metadata. Libraries and institutional repositories are also beginning to build ROR into their systems and to send ROR IDs to Crossref in their metadata. See the growing list of active and in-progress ROR integrations for more stakeholders who are supporting ROR.
If you deposit metadata with Crossref via XML, see our guide on Affiliations and ROR for instructions on how to include author affiliations and ROR IDs.
For further information on how ROR IDs are supported in the Crossref metadata, you can take a look at this .xsd file (under the âinstitutionâ element) or in this journal article example XML. ROR also has some great help documentation for publishers and anyone else working with the ROR Registry.