Adam Buttrick

Adam has moved on from Crossref. Adam Buttrick is a librarian and developer based in Los Angeles, USA. He previously worked as a data developer for the Getty Conservation Institute, as an implementation manager for OCLC’s Metadata Services, and for the University of Michigan’s Art, Architecture, and Engineering Library. As metadata curation lead for the ROR project, Adam coordinates ongoing updates and improvements to the registry and works closely with ROR’s community curation advisory board.

Metadata matching: beyond correctness

Dominika Tkaczyk, Adam Buttrick – 2025 January 08

In MetadataLinkingMetadata MatchingData Science

In our previous entry, we explained that thorough evaluation is key to understanding a matching strategy’s performance. While evaluation is what allows us to assess the correctness of matching, choosing the best matching strategy is, unfortunately, not as simple as selecting the one that yields the best matches. Instead, these decisions usually depend on weighing multiple factors based on your particular circumstances. This is true not only for metadata matching, but for many technical choices that require navigating trade-offs. In this blog post, the last one in the metadata matching series, we outline a subjective set of criteria we would recommend you consider when making decisions about matching.

How good is your matching?

Dominika Tkaczyk, Adam Buttrick – 2024 November 06

In MetadataLinkingMetadata MatchingData Science

In our previous blog post in this series, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it’s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!

How about we start with a quiz? Imagine a database of scholarly metadata that needs to be enriched with identifiers, such as ORCIDs or ROR IDs. Hopefully, by this point in our series this is recognizable as a classic matching problem. In searching for a solution, you identify an externally-developed matching tool that makes one of the below claims. Which of the following would demonstrate satisfactory performance?

The myth of perfect metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 August 28

In MetadataLinkingMetadata MatchingData Science

In our previous instalments of the blog series about matching (see part 1 and part 2), we explained what metadata matching is, why it is important and described its basic terminology. In this entry, we will discuss a few common beliefs about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Spoiler alert: we are calling them myths because these beliefs are not true! Read on to learn why.

The anatomy of metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 June 27

In MetadataLinkingMetadata MatchingData Science

In our previous blog post about metadata matching, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.

Basic terminology

Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems matching tasks.

Metadata matching 101: what is it and why do we need it?

Dominika Tkaczyk, Adam Buttrick – 2024 May 16

In MetadataLinkingMetadata MatchingData Science

At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we’ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly our favourite thing to talk and write about! But sometimes it is good to step back and look at the problem from a wider perspective. In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.

RSS feed

Recent blog posts

Why PID strategies need more than PIDs: our first position paper

2026 July 20

Schema 5.5 now available: adding CRediT, new record types for blogs and posters, and more

2026 July 09

Take part in UX Research at Crossref

2026 July 02

Building, refining, and connecting: summary of our May 2026 community update

2026 June 30

Get involved

Find a service

Documentation

About us

2026 July 20

Why PID strategies need more than PIDs: our first position paper

2026 July 09

Schema 5.5 now available: adding CRediT, new record types for blogs and posters, and more

2026 July 02

Take part in UX Research at Crossref

2026 June 30

Building, refining, and connecting: summary of our May 2026 community update

Blog

Adam Buttrick

Metadata matching: beyond correctness

How good is your matching?

The myth of perfect metadata matching

The anatomy of metadata matching

Basic terminology

Metadata matching 101: what is it and why do we need it?

Recent blog posts

Topics

Archives