Blog

What does the sample say?

At Crossref Labs, we often come across interesting research questions and try to answer them by analyzing our data. Depending on the nature of the experiment, processing over 100M records might be time-consuming or even impossible. In those dark moments we turn to sampling and statistical tools. But what can we infer from only a sample of the data?

Why Data Citation matters to publishers and data repositories

A couple of weeks ago we shared with you that data citation is here, and that you can start doing data citation today. But why would you want to? There are always so many priorities, why should this be at the top of the list?

Ten more days ’til Toronto

Our LIVE Annual Meeting is back in North America for the first time since 2015, and with just 10 days to go, there’s a lot going on in preparation. As you’d expect with a How good is your metadata? theme—the two-days will be entirely devoted to the subject of metadata—because it touches everything we do, and everything that publishers, hosting platforms, funders, researchers, and librarians do. Oh, and it’s actually super awesome too—and occasionally fun.

Crossref LIVE Brazil evoked vibrant Q&A session

There has been a steady increase in the growth of our membership in Latin America—and in Brazil in particular—over the past few years. We currently have more than 800 Brazil-based members; some as individual members, but most are sponsored by another organization. As part of our LIVE Local program Chuck Koscher and I traveled to meet some of these members in Goiânia and Fortaleza, where we co-hosted events with Associação Brasileira de Editores Científicos do Brasil (ABEC Brasil)—one of our largest Sponsors.

It’s not about the money, money, money.

But actually, sometimes it is about the money. As a not-for-profit membership organization that is obsessed with persistence, we have a duty to remain sustainable and manage our finances in a responsible way. Our annual audit is incredibly thorough, and our outside auditors and Board-based Audit committee consistently report that we’re in good shape. Our Membership & Fees committee regularly reviews both membership fees and Content Registration fees for a growing range of research outputs.

Good, better, best. Never let it rest.

Best practices seem to be having a moment. In the ten years since the Books Advisory Group first created a best practice guide for books, the community beyond Crossref has developed or updated at least 17 best practice resources, as collected here by the Metadata 2020 initiative. (Full disclosure: I co-chair its Best Practices group.)

Metadata Manager: Members, represent!

Over 100 Million unique scholarly works are distributed into systems across the research enterprise 24/7 via our APIs at a rate of around 633 Million queries a month. Crossref is broadcasting descriptions of these works (metadata) to all corners of the digital universe.

Using the Crossref REST API. Part 12 (with Europe PMC)

As part of our blog series highlighting some of the tools and services that use our API, we asked Michael Parkin—Data Scientist at the European Bioinformatics Institute—a few questions about how Europe PMC uses our metadata where preprints are concerned.

A wrap up of the Crossref blog series for SciELO

Crossref member SciELO (Scientific Electronic Library Online), based in Brazil, celebrated two decades of operation last week with a three-day event The SciELO 20 Years Conference.

Data citation: let’s do this

Data citation is seen as one of the most important ways to establish data as a first-class scientific output. At Crossref and DataCite we are seeing growth in journal articles and other record types citing data, and datasets making the link the other way. Our organizations are committed to working together to help realize the data citation community’s ambition, so we’re embarking on a dedicated effort to get things moving.