Blog

Joe Wass

Joe Wass was Head of the Software Development team. He spent his first five years at Crossref getting to know our broad community, with a special focus on finding citations in new places on the web, keeping tabs on the evolving activities of scholars round the world.

Mending Chestertonā€™s Fence: Open Source Decision-making

Joe Wass

Joe Wass ā€“ 2024 March 18

In EngineeringPOSI

When each line of code is written it is surrounded by a sea of context: who in the community this is for, what problem weā€™re trying to solve, what technical assumptions weā€™re making, what we already tried but didnā€™t work, how much coffee weā€™ve had today. All of these have an effect on the software we write.

By the time the next person looks at that code, some of that context will have evaporated. There may be helpful code comments, tests, and specifications to explain how it should behave. But they donā€™t explain the path not taken, and why we didnā€™t take it. Or those occasions where the facts changed, so we changed our mind.

Mending Chestertonā€™s Fence: Open Source Decision-making

Joe Wass

Joe Wass ā€“ 2024 March 18

In Engineering

When each line of code is written it is surrounded by a sea of context: who in the community this is for, what problem weā€™re trying to solve, what technical assumptions weā€™re making, what we already tried but didnā€™t work, how much coffee weā€™ve had today. All of these have an effect on the software we write.

By the time the next person looks at that code, some of that context will have evaporated. There may be helpful code comments, tests, and specifications to explain how it should behave. But they donā€™t explain the path not taken, and why we didnā€™t take it. Or those occasions where the facts changed, so we changed our mind.

Renewed Persistence

Joe Wass

Joe Wass ā€“ 2023 April 01

In Engineering

We believe in Persistent Identifiers. We believe in defence in depth. Today weā€™re excited to announce an upgrade to our data resilience strategy.

5Ā¼ inch floppy disk with Crossref logo

Defence in depth means layers of security and resilience, and that means layers of backups. For some years now, our last line of defence has been a reliable, tried-and-tested technology. One thatā€™s been around for a while. Yes, Iā€™m talking about the humble 5Ā¼ inch floppy disk.

Whatā€™s that DOI?

Joe Wass

Joe Wass ā€“ 2019 January 21

In Event DataPidapalooza

This is a long overdue followup to 2016ā€™s ā€œURLs and DOIs: a complicated relationshipā€. Like that post, this accompanies my talk at PIDapalooza, the festival of open persistent identifiers). I donā€™t think I need to give a spoiler warning when I tell you that itā€™s still complicated. But this post presents some vocabulary to describe exactly how complicated it is. Event Data has been up and running and collecting data for a couple of years now, but this post describes changes we made toward the end of 2018.

Hear this, real insight into the inner workings of Crossref

You want to hear more from us. We hear you.

Weā€™ve spent the past year building Crossref Event Data, and hope to launch very soon. Building a new piece of infrastructure from scratch has been an exciting project, and weā€™ve taken the opportunity to incorporate as much feedback from the community as possible. Weā€™d like to take a moment to share some of the suggestions we had, and how weā€™ve acted on them.

Bridging Identifiers at PIDapalooza

Hello from sunny Girona! Iā€™m heading to PIDapalooza, the Persistent Identifier festival, as it returns for its second year. Itā€™s all about to kick off.

One of the themes this year is ā€œbridging worldsā€: how to bring together different communities and the identifiers they use. Something I really enjoyed about PIDapalooza last year was the variety of people who came. We heard about some ā€œtraditionalā€ identifier systems (at least, it seems that way to us): DOIs for publications, DOIs for datasets, ORCIDs for researchers. But, gathered in Reykjavik, under dark Icelandic skies, I met oceanographic surveyors assigning DOIs to drilling equipment, heard stories of identifiers in Chinese milk production and consoled librarians trying navigate the identifier landscape.

Event Data as Underlying Altmetrics Infrastructure at the 4:AM Altmetrics Conference

Iā€™m here in Toronto and looking forward to a busy week. Maddy Watson and I are in town for the 4:AM Altmetrics Conference, as well as the altmetrics17 workshop and Hack-day. Iā€™ll be speaking at each, and for those of you who arenā€™t able to make it, Iā€™ve combined both presentations into a handy blog post, which follows on from my last one.

But first, nothing beats a good demo. Take a look at our live stream. This shows the Events passing through Crossref Event Data, live, as they happen. You may need to wait a few seconds before you see anything.

You do want to see how itā€™s made ā€” seeing what goes into altmetrics

Thereā€™s a saying about oil, something along the lines of ā€œyou really donā€™t want to see how itā€™s madeā€. And whilst Iā€™m reluctant to draw too many parallels between the petrochemical industry and scholarly publishing, there are some interesting comparisons to be drawn.

Oil starts its life deep underground as an amorphous sticky substance. Prospectors must identify oil fields, drill, extract the oil and refine it. It finds its way into things as diverse as aspirin, paint and hammocks. And as I lie in my hammock watching paint dry, Iā€™m curious to know how crude oil made its way into the aspirin that Iā€™ve taken for the headache brought on by the paint fumes. Whilst it would be better if I did know how these things were made, not knowing doesnā€™t impair the efficacy of my aspirin.

URLs and DOIs: a complicated relationship

As the linking hub for scholarly content, itā€™s our job to tame URLs and put in their place something better. Why? Most URLs suffer from link rot and can be created, deleted or changed at any time. And thatā€™s a problem if youā€™re trying to cite them.

Using AWS S3 as a large key-value store for Chronograph

One of the cool things about working in Crossref Labs is that interesting experiments come up from time to time. One experiment, entitled ā€œwhat happens if you plot DOI referral domains on a chart?ā€ turned into the Chronograph project. In case you missed it, Chronograph analyses our DOI resolution logs and shows how many times each DOI link was resolved per month, and also how many times a given domain referred traffic to DOI links per day.