Metadata Matching

Follow the money, or how to link grants to research outputs

Dominika Tkaczyk – 2022 March 22

In Grant Linking SystemLinkingR&DMetadata Matching

The ecosystem of scholarly metadata is filled with relationships between items of various types: a person authored a paper, a paper cites a book, a funder funded research. Those relationships are absolutely essential: an item without them is missing the most basic context about its structure, origin, and impact. No wonder that finding and exposing such relationships is considered very important by virtually all parties involved. Probably the most famous instance of this problem is finding citation links between research outputs. Lately, another instance has been drawing more and more attention: linking research outputs with grants used as their funding source. How can this be done and how many such links can we observe?

What if I told you that bibliographic references can be structured?

Dominika Tkaczyk – 2019 July 08

In LinkingCitationR&DReference MatchingMetadata Matching

Last year I spent several weeks studying how to automatically match unstructured references to DOIs (you can read about these experiments in my previous blog posts). But what about references that are not in the form of an unstructured string, but rather a structured collection of metadata fields? Are we matching them, and how? Let’s find out.

Reference matching: for real this time

Dominika Tkaczyk – 2018 December 18

In LinkingCitationR&DReference MatchingMetadata Matching

In my previous blog post, Matchmaker, matchmaker, make me a match, I compared four approaches for reference matching. The comparison was done using a dataset composed of automatically-generated reference strings. Now it’s time for the matching algorithms to face the real enemy: the unstructured reference strings deposited with Crossref by some members. Are the matching algorithms ready for this challenge? Which algorithm will prove worthy of becoming the guardian of the mighty citation network? Buckle up and enjoy our second matching battle!

Matchmaker, matchmaker, make me a match

Dominika Tkaczyk – 2018 November 12

In LinkingCitationR&DReference MatchingMetadata Matching

Matching (or resolving) bibliographic references to target records in the collection is a crucial algorithm in the Crossref ecosystem. Automatic reference matching lets us discover citation relations in large document collections, calculate citation counts, H-indexes, impact factors, etc. At Crossref, we currently use a matching approach based on reference string parsing. Some time ago we realized there is a much simpler approach. And now it is finally battle time: which of the two approaches is better?

RSS Feed

Get involved

Find a service

Documentation

About us

2026 April 01

Reflections from Bangkok

2026 March 31

Voices from Crossref Metadata Sprint in São Paulo

2026 March 30

DOI resolution and deposit outage on 17 March 2026

2026 March 24

Strengthening support for data citations and saying goodbye to Event Data

Blog

Follow the money, or how to link grants to research outputs

What if I told you that bibliographic references can be structured?

Reference matching: for real this time

Matchmaker, matchmaker, make me a match

Recent Posts

Categories

Archives