Dominika Tkaczyk

Dominika joined Crossref in 2018 as a Principal R&D Developer, where she focused on metadata matching research aimed at enriching the scholarly record through the discovery of new relationships. In 2024, she became Crossref’s Director of Data Science and established the Data Science team, with a mission to explore innovative ways of using data to support the scholarly community, enrich the Research Nexus with more metadata and relationships, and develop collaborations with like-minded community initiatives. Since 2025, Dominika has served as Director of Technology, leading a unified technology team that integrates infrastructure, software development, and data science functions. Dominika holds a PhD in Computer Science from the Polish Academy of Sciences. Prior to joining Crossref, she she was a researcher and a data scientist at the University of Warsaw, Poland, and a postdoctoral researcher at Trinity College Dublin, Ireland.

Piecing together the Research Nexus: uncovering relationships with open funding metadata

Rocío Gaudioso Pedraza, Dominika Tkaczyk – 2025 October 01

In CommunityFundingGrant Linking SystemMetadata Matching

The Crossref Grant Linking System (GLS) has been facilitating the registration, sharing and re-use of open funding metadata for six years now, and we have reached some important milestones recently! What started as an interest in identifying funders through the Open Funder Registry evolved to a more nuanced and comprehensive way to share and re-use open funding data systematically. That’s how, in collaboration with the funding community, the Crossref Grant Linking System was developed. Open funding metadata is fundamental for the transparency and integrity of the research endeavour, so we are happy to see them included in the Research Nexus.

Data Science @Crossref

Dominika Tkaczyk, Alex Bédard-Vallée, Jason Portenoy – 2025 July 07

In Data Science

To address the growing scale and complexity of scholarly data, we’ve launched a new data science function at Crossref. In April, we were excited to welcome our first data scientists, Jason Portenoy and Alex Bédard-Vallée, to the team. With their arrival, the Data Science team is now fully up and running. In this blog post, we’re sharing our vision and what’s ahead for data science at Crossref.

Meet six winners of the first ever Crossref Metadata Awards

Kornelia Korzec, Dominika Tkaczyk – 2025 May 07

In MetadataCommunity

Marking our 25th anniversary, we launch the Crossref Metadata Awards to emphasise our community’s role in stewarding and enriching the scholarly record.

We are pleased to recognise Noyam Publishers, GigaScience Press, eLife, American Society for Microbiology, and Universidad La Salle Arequipa Perú with the Crossref Metadata Excellence Awards, and Instituto Geologico y Minero de España wins the Crossref Metadata Enrichment Award. These inaugural awards highlight the leadership of members who show dedication to the best metadata practices.

Metadata matching: beyond correctness

Dominika Tkaczyk, Adam Buttrick – 2025 January 08

In MetadataLinkingMetadata MatchingData Science

https://doi.org/10.13003/axeer1ee

In our previous entry, we explained that thorough evaluation is key to understanding a matching strategy’s performance. While evaluation is what allows us to assess the correctness of matching, choosing the best matching strategy is, unfortunately, not as simple as selecting the one that yields the best matches. Instead, these decisions usually depend on weighing multiple factors based on your particular circumstances. This is true not only for metadata matching, but for many technical choices that require navigating trade-offs. In this blog post, the last one in the metadata matching series, we outline a subjective set of criteria we would recommend you consider when making decisions about matching.

How good is your matching?

Dominika Tkaczyk, Adam Buttrick – 2024 November 06

In MetadataLinkingMetadata MatchingData Science

https://doi.org/10.13003/ief7aibi

In our previous blog post in this series, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it’s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!

How about we start with a quiz? Imagine a database of scholarly metadata that needs to be enriched with identifiers, such as ORCIDs or ROR IDs. Hopefully, by this point in our series this is recognizable as a classic matching problem. In searching for a solution, you identify an externally-developed matching tool that makes one of the below claims. Which of the following would demonstrate satisfactory performance?

The myth of perfect metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 August 28

In MetadataLinkingMetadata MatchingData Science

https://doi.org/10.13003/pied3tho

In our previous instalments of the blog series about matching (see part 1 and part 2), we explained what metadata matching is, why it is important and described its basic terminology. In this entry, we will discuss a few common beliefs about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Spoiler alert: we are calling them myths because these beliefs are not true! Read on to learn why.

The anatomy of metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 June 27

In MetadataLinkingMetadata MatchingData Science

https://doi.org/10.13003/zie7reeg

In our previous blog post about metadata matching, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.

Basic terminology

Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems matching tasks.

Metadata matching 101: what is it and why do we need it?

Dominika Tkaczyk, Adam Buttrick – 2024 May 16

In MetadataLinkingMetadata MatchingData Science

https://doi.org/10.13003/aewi1cai

At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we’ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly our favourite thing to talk and write about! But sometimes it is good to step back and look at the problem from a wider perspective. In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.

Discovering relationships between preprints and journal articles

Dominika Tkaczyk – 2023 December 07

In PreprintsLinkingMetadata Matching

In the scholarly communications environment, the evolution of a journal article can be traced by the relationships it has with its preprints. Those preprint–journal article relationships are an important component of the research nexus. Some of those relationships are provided by Crossref members (including publishers, universities, research groups, funders, etc.) when they deposit metadata with Crossref, but we know that a significant number of them are missing. To fill this gap, we developed a new automated strategy for discovering relationships between preprints and journal articles and applied it to all the preprints in the Crossref database. We made the resulting dataset, containing both publisher-asserted and automatically discovered relationships, publicly available for anyone to analyse.

The more the merrier, or how more registered grants means more relationships with outputs

Dominika Tkaczyk, Rachael Lammey, Ginny Hendricks – 2023 February 22

In Grant Linking SystemResearch FundersMetadata Matching

One of the main motivators for funders registering grants with Crossref is to simplify the process of research reporting with more automatic matching of research outputs to specific awards. In March 2022, we developed a simple approach for linking grants to research outputs and analysed how many such relationships could be established. In January 2023, we repeated this analysis to see how the situation changed within ten months. Interested? Read on!

RSS Feed

Get involved

Find a service

Documentation

About us

2025 December 03

Metadata in editorial workflows

Background

2025 November 26

Crossref members over the years: a journey through space and time

2025 November 19

Crossref at the Frankfurt Book Fair 2025

2025 November 06

The sunset is on the horizon for Metadata Manager. What's next?

Blog