Identifiers - Crossref

January 2015 DOI Outage: Followup Report

Geoffrey Bilder – 2015 March 17

In IdentifiersPersistenceHandleDataCiteDOIs

Background

On January 20th, 2015 the main DOI HTTP proxy at doi.org experienced a partial, rolling global outage. The system was never completely down, but for at least part of the subsequent 48 hours, up to 50% of DOI resolution traffic was effectively broken. This was true for almost all DOI registration agencies, including Crossref, DataCite and mEDRA.

At the time we kept people updated on what we knew via Twitter, mailing lists and our technical blog at CrossTech. We also promised that, once we’d done a thorough investigation, we’d report back. Well, we haven’t finished investigating all implications of the outage. There are both substantial technical and governance issues to investigate. But last week we provided a preliminary report to the Crossref board on the basic technical issues, and we thought we’d share that publicly now.

Crossref’s DOI Event Tracker Pilot

Geoffrey Bilder – 2015 March 02

In CitationR&DEvent DataIdentifiersLinking

TL;DR

Crossref’s “DOI Event Tracker Pilot”- 11 million+ DOIs & 64 million+ events. You can play with it at: http://goo.gl/OxImJa

Tracking DOI Events

So have you been wondering what we’ve been doing since we posted about the experiments we were conducting using PLOS’s open source ALM code? A lot, it turns out. About a week after our post, we were contacted by a group of our members from OASPA who expressed an interest in working with the system. Apparently they were all about to conduct similar experiments using the ALM code, and they thought that it might be more efficient and interesting if they did so together using our installation. Yippee. Publishers working together. That’s what we’re all about.

Problems with dx.doi.org on January 20th 2015- what we know.

Geoffrey Bilder – 2015 January 21

In HandleIdentifiersPersistenceDataCite

Hell’s teeth.

So today (January 20th, 2015) the DOI HTTP resolver at dx.doi.org started to fail intermittently around the world. The doi.org domain is managed by CNRI on behalf of the International DOI Foundation. This means that the problem affected all DOI registration agencies including Crossref, DataCite, mEDRA etc. This also means that more popularly known end-user services like FigShare and Zenodo were affected. The problem has been fixed, but the fix will take some time to propagate throughout the DNS system. You can monitor the progress here:

https://www.whatsmydns.net/#A/doi.org

Now for the embarrassing stuff…

♫ Researchers just wanna have funds ♫

Geoffrey Bilder – 2014 April 10

In ORCIDCrossmarkR&DOpen Funder RegistryIdentifiersLinked DataMetadataORCID

Cindy Lauper

photo credit

Summary

You can use a new Crossref API to query all sorts of interesting things about who funded the research behind the content Crossref members publish.

Background

Back in May 2013 we launched Crossref’s FundRef service. It can be summarized like this:

Crossref keeps and manages a canonical list of Funder Names (ephemeral) and associated identifiers (persistent).
We encourage our members (or anybody, really- the list is available under A CC-Zero license waiver) to use this list for collecting information on who funded the research behind the content that our members publish.
We then ask that our members deposit this data in their normal Crossref metadata deposits.

And that was cool.

DOIs unambiguously and persistently identify published, trustworthy, citable online scholarly literature. Right?

Geoffrey Bilder – 2013 September 20

In IdentifiersInteroperabilityORCIDPersistenceDataCite

The South Park movie , “Bigger, Longer & Uncut” has a DOI:

a) http://dx.doi.org/10.5240/B1FA-0EEC-C316-3316-3A73-L

So does the pornographic movie, “Young Sex Crazed Nurses”:

b) http://dx.doi.org/10.5240/4CF3-57AB-2481-651D-D53D-Q

And the following DOI points to a fake article on a “Google-Based Alien Detector”:

c) http://dx.doi.org/10.6084/m9.figshare.93964

And the following DOI refers to an infamous fake article on literary theory:

d) http://dx.doi.org/10.2307/466856

This scholarly article discusses the entirely fictitious Australian “Drop Bear”:

DataCite supporting content negotiation

Geoffrey Bilder – 2011 October 10

In DataDataCiteIdentifiersLinked DataStandards

In April In April for its DOIs. At the time I cheekily called-out DataCite to start supporting content negotiation as well.

Edward Zukowski (DataCite’s resident propellor-head) took up the challenge with gusto and, as of September 22nd DataCite has also been supporting content negotiation for its DOIs. This means that one million more DOIs are now linked-data friendly. Congratulations to Ed and the rest of the team at DataCite.

We hope this is a trend. Back in June Knowledge Exchange organized a seminar on Persistent Object Identifiers. One of the outcomes of the meeting was “Den Haag Manifesto” a document outlining five relatively simple steps that different persistent identifier systems could take in order to increase interoperability. Most of these steps involved adopting linked data principles including support for content negotiation. We look forward to hearing about other persistent identifiers adopting these principles over the next year.

Content Negotiation for Crossref DOIs

Geoffrey Bilder – 2011 April 19

In DataCiteIdentifiersLinked DataMetadataProgrammingStandards

So does anybody remember the posting DOIs and Linked Data: Some Concrete Proposals?

Well, we went with option “D.”

From now on, DOIs, expressed as HTTP URIs, can be used with content-negotiation.

Let’s get straight to the point. If you have curl installed, you can start playing with content-negotiation and Crossref DOIs right away:

curl -D - -L -H “Accept: application/rdf+xml” “http://dx.doi.org/10.1126/science.1157784”
curl -D - -L -H “Accept: text/turtle” “http://dx.doi.org/10.1126/science.1157784”

XMP in RSC PDFs

admin – 2010 August 03

In IdentifiersPDFXMPInChI

Just a quick heads-up to say that we’ve had a go at incorporating InChIs and ontology terms into our PDFs with XMP. There isn’t a lot of room in an XMP packet so we’ve had to be a bit particular about what we include.

InChIs: the bigger the molecule the longer the InChI, so we’ve standardized on the fixed-length InChIKey. This doesn’t mean anything on its own, so we’ve gone the Semantic Web route of including an InChI resolver HTTP URI. Alternatively you can extract the InChIKeys with a regular expression.
Ontology terms: we’re using HTTP URIs again and pointing to either Open Biomedical Ontology URIs (biology, biomedicine; slashy) or RSC ontology terms (chemistry; hashy). Often the OBO URIs resolve to a specific web page, but for the moment the RSC URIs just point to a large OWL file. Slashy URIs are quite a bit more involved so we’ll have to see what the demand is like.

There’s only about 4K to play with, so it’s only ever going to be a best-of. More detailed article metadata has to go in either a sidecar file, as Tony has pointed out before, or ideally on the article landing page. The example files are here and I’ve posted something with a different slant on the RSC technical blog.

DOIs and Linked Data: Some Concrete Proposals

Geoffrey Bilder – 2010 March 25

In IdentifiersLinked Data

Since last month’s threads (here, here, here and here) talking about the issues involved in making the DOI a first-class identifier for linked data applications, I’ve had the chance to actually sit down with some of the thread’s participants (Tony Hammond, Leigh Dodds, Norman Paskin) and we’ve been able sketch-out some possible scenarios for migrating the DOI into a linked data world.

I think that several of us were struck by how little actually needs to be done in order to fully address virtually all of the concerns that the linked data community has expressed about DOIs. Not only that- but in some of these scenarios we would put ourselves in a position to be able to semantically-enable over 40 million DOIs with what amounts to the flick of a switch.

Does a Crossref DOI identify a “work?”

Geoffrey Bilder – 2010 February 11

In IdentifiersLinked DataPublishing

Tony’s recent thread on making DOIs play nicely in a linked data world has raised an issue I’ve meant to discuss here for some time- a lot of the thread is predicated on the idea that Crossref DOIs are applied at the abstract “work” level. Indeed, that it what it currently says in our guidelines. Unfortunately, this is a case where theory, practice and documentation all diverge.

When the Crossref linking system was developed it was focused primarily on facilitating persistent linking amongst journals and conference proceedings. The system was quickly adapted to handle books and more recently to handle working papers, technical reports, standards and “components”- a catchall term used to refer to everything from individual article images to database records.

RSS Feed

Get involved

Find a service

Documentation

About us

2026 March 17

2026 public data file now available

2026 March 16

Reflections from the Crossref Ambassador Community

2026 March 12

Renewed partnership: DOAJ and Crossref focus on equitable scholarly metadata and global support

2026 March 11

Hit refresh: redesigning our technical infrastructure

Blog

January 2015 DOI Outage: Followup Report

Background

Crossref’s DOI Event Tracker Pilot

TL;DR

Tracking DOI Events

Problems with dx.doi.org on January 20th 2015- what we know.

♫ Researchers just wanna have funds ♫

Summary

Background

DOIs unambiguously and persistently identify published, trustworthy, citable online scholarly literature. Right?

DataCite supporting content negotiation

Content Negotiation for Crossref DOIs

XMP in RSC PDFs

DOIs and Linked Data: Some Concrete Proposals

Does a Crossref DOI identify a “work?”

Recent Posts

Categories

Archives