This year, metadata development is one of our key priorities and we’re making a start with the release of version 5.4.0 of our input schema with some long-awaited changes. This is the first in what will be a series of metadata schema updates.
What is in this update?
Publication typing for citations
This is fairly simple; we’ve added a ‘type’ attribute to the citations members supply. This means you can identify a journal article citation as a journal article, but more importantly, you can identify a dataset, software, blog post, or other citation that may not have an identifier assigned to it. This makes it easier for the many thousands of metadata users to connect these citations to identifiers. We know many publishers, particularly journal publishers, do collect this information already and will consider making this change to deposit citation types with their records.
Every year we release metadata for the full corpus of records registered with us, which can be downloaded for free in a single compressed file. This is one way in which we fulfil our mission to make metadata freely and widely available. By including the metadata of over 165 million research outputs from over 20,000 members worldwide and making them available in a standard format, we streamline access to metadata about scholarly objects such as journal articles, books, conference papers, preprints, research grants, standards, datasets, reports, blogs, and more.
Today, we’re delighted to let you know that Crossref members can now use ROR IDs to identify funders in any place where you currently use Funder IDs in your metadata. Funder IDs remain available, but this change allows publishers, service providers, and funders to streamline workflows and introduce efficiencies by using a single open identifier for both researcher affiliations and funding organizations.
As you probably know, the Research Organization Registry (ROR) is a global, community-led, carefully curated registry of open persistent identifiers for research organisations, including funding organisations. It’s a joint initiative led by the California Digital Library, Datacite and Crossref launched in 2019 that fulfills the long-standing need for an open organisation identifier.
We began our Global Equitable Membership (GEM) Program to provide greater membership equitability and accessibility to organizations in the world’s least economically advantaged countries. Eligibility for the program is based on a member’s country; our list of countries is predominantly based on the International Development Association (IDA). Eligible members pay no membership or content registration fees. The list undergoes periodic reviews, as countries may be added or removed over time as economic situations change.
The bibliographic (descriptive) metadata you send us is used to display citations, match DOIs to citations, and to enhance discovery services. It is essential that this metadata is clean, complete, and accurate.
Do:
provide all contributors, titles, dates, and identifiers associated with the item you are registering
make sure the contributors, titles, dates, and identifiers are accurate
update and correct metadata as needed
Do not:
supply titles, names, or other metadata in all caps, even if that is how you display and store them - it makes it difficult for others to use your metadata to format citations (and link to your content)
omit article identifiers, page numbers, or author names - omissions will make your metadata less or undiscoverable
force metadata into fields that aren’t a good match - it’s often better just to leave it out. For example, putting subject keywords into a title
Titles
Your metadata should include the title used for the content when it was first published. For most types of content alternate titles and subtitles can be provided as well (see each record type markup guide for details). You are also able in most cases to provide titles in multiple languages (see translated and multi-language materials).
Do:
use subtitles - subtitles are supported in a distinct subtitle element, and allow an item to be discoverable using the main title, subtitle, or both combined.
supply alternate titles, abbreviated titles, and translated titles if you use them in citation recommendations.
use face markup and/or MathML in titles when it impacts the meaning of the text.
include non-title metadata such as author, price, or volume numbers in a title field - this is a common error that significantly impacts discoverability and display.
supply titles in ALL CAPS - our metadata is often used for display and citation formatting.
Additional best practices may apply for the content you are registering, see specific record type guides for details.
Contributors
Contributor metadata is expressed consistently across record types (excluding Grants), and includes contributor names, roles, identifiers, alternate names, and affiliation information. A contributor is a single person or a group of people/organization that has contributed in some way to the content being registered.
Do include:
correct names, so authors and other contributors can be matched to citations
a complete contributor list so that contributors can receive credit for their work, and to help make your content more discoverable
Contributor role(s) - at least one for each contributor, but supply as many as apply
ORCID iDs, so that authors can be disambiguated and connected to the research they write and support
Affiliations and ROR IDs so that contributor institutions can be identified and research outputs can be traced by institution
Do not:
include suffixes such as Jr, Sr, IV in the family name field - use the suffix element
supply the entire date whenever possible - for most dates supplied within our metadata we allow you to supply just a year, with month and day being optional, but we encourage you to supply full dates whenever possible, particularly for online content.
supply all relevant date types - for most items a publication date is required and other dates are optional but we encourage you to supply all dates that apply to the content you are registering. This includes acceptance dates for most content, and approval and posted dates for others.
Include the correct date at both the parent and child level (journal issue / article, book title / chapter)
Include both online and print publication dates (if applicable)
Do not:
supply only the most recent publication date - this is inaccurate and may impact your registration fees, as back year rates are calculated based on the publication year provided in your registration metadata.
Page numbers and article identifiers
Correct page number and article identifier (aka e-location ID) metadata is essential for many discovery systems.
Do:
be careful with your pages - be sure each page element contains only the page number itself, not a range. This means capture the first page in first_page, the last page in last_page, and any additional page information in other_pages. If your content has pages, first_page is essential.
if you use article numbers /e-location IDs, supply them as described in the markup guide
Do not:
include an entire page range in first_page - this is incorrect, and will throw off many matching processes (in Crossref and beyond) and cause your metadata to be displayed incorrectly wherever it is used.
include extraneous text in the page field - just the page please, no ‘page 1’ or ‘1st pg’
Guidance on constructing XML for article IDs and page ranges can be found in our Article ID Markup Guide.