Crossref Event Data

Early Preview: This service is under development

This service is under development with a launch expected in early 2017. You are welcome to take a look around pre-launch to get a feel for what data the service may offer when it’s released. Please note this service is not yet live. We will greatly expand the variety of platforms from which we will collect data. As this is a system in active development, you may experience hiccups in the service.

Please contact us at eventdata@crossref.org with any questions or suggestions as we continue to work on the development.

What Event Data will offer

Discussion around scholarly research often takes place on the web outside of the formal literature; for example on blogs, sharing services and social media. This activity is made up of a number of individual 'events' such as a bookmark, a comment, a social share, or a link.

What’s needed is community infrastructure to collect, store, and provide this raw data for anyone to access.

Crossref Event Data will capture and record a wide variety of events related to content with a unique digital object identifier (DOI). The data will be openly available, portable, and auditable. Initially the service will not include scholarly citation events from citation index platforms.

Anyone, such as publishers, funders, authors, bibliometricians, and librarians, will be able to process, interpret and integrate Event Data as they wish, gaining a broad understanding of how scholarly content is shared and consumed.

What are 'Events' and how are they collected?

An event is any activity that occurs around a research object that has a DOI. Event types are varied depending on the source of the activity. Here are some event examples:

Here are the specific event types for some of the sources that will be included in Event Data:

Data source Collection method Refresh rate Data collected Event type
Crossref to DataCite Crossref Metadata API Crossref pushes Crossref - DataCite links into Event Data once per day Links to DataCite DOIs as cited in Crossref DOIs Every citation from a Crossref DOI to a DataCite DOI
DataCite to Crossref DataCite API DataCite pushes DataCite - Crossref links into DataCite Event Data once per day. Crossref then pulls these links into Event Data once per day. Links to Crossref DOIs as cited in DataCite DOIs Every citation from a DataCite DOI to a Crossref DOI
Wikipedia Full streaming data from Wikimedia Streaming (near real time) Edits to Wikipedia pages that mention a DOI directly, or edits that remove such a mention A 'references' event type for the reference from the Wikipedia page to the DOI. If the action field is 'delete' then this indicates that the reference was removed.
Mendeley Mendeley API Pulling data. Rate TBC, currently daily Counts of how many times a DOI has been bookmarked and liked The Mendeley Data Source provides the event type of bookmarks and likes. Each one has a date stamp and a count.
Twitter Full streaming data from Gnip Streaming (near real time) DOIs (or DOI domain URLs) mentioned in tweets Every tweet that mentions a DOI or article landing page that we can match to a DOI. The DOI is included in the event. If a landing page was used, this data will be available via the audit log.

Want to learn more?

You can find background on the development of this new service in the Event Data section of the Crossref blog, including the planned list of sources and updates on the development of the service.

For more comprehensive information, please take a look at the Crossref Event Data Technical User Guide.

Please note that specifications and documentation will continue to evolve as we progress through the technical development of this new service.

Please do get in touch if you have any questions or feedback as we continue to build the service, contact Madeleine at eventdata@crossref.org