Last week Jennifer Lin shared information on the Making Data Count (MDC) project on this blog. MDC is a project funded by the U.S. National Science Foundation (NSF) to design and develop metrics that track and measure data use – data-level metrics (DLM).
Funding for the 12 month project ends October 1st, with a no-cost extension until March 1st. MDC is a research project and has delivered some interesting questions and important results. One open question is whether and how to turn MDC research into a service that is not limited to the grant-funding period and possibly includes other datasets beyond those from the DataONE repository network. These important decisions require analysis and feedback from the broader community.
In order to better understand and analyze these questions DataCite has taken over hosting of the DLM service from PLOS and will provide this service until at least March 1st, when the MDC funding formally ends. The DLM service is now hosted by DataCite Labs and can be found at https://dlm.datacite.org. The following chart (directly from https://dlm.datacite.org/sources) gives an overview about the data we have collected so far:
Most users will not be using the API directly, but rather want to see the metrics data displayed together with the datasets they are interested in. About half of the about 140,000 datasets in the DLM service use DataCite DOIs and, as a first step, we have integrated DataCite DOIs into Labs Search. If your search results include datasets from one of the DataONE data centers that use DOIs (including Long Term Ecological Research Network, National Center for Ecological Analysis and Synthesis and Dryad Digital Repository) and we have found links for them (e.g. for 10.5061/dryad.f1cb2), we will display them:
Data from: rise of the machines
Let's dig into the Dryad example a bit more -- every Dryad data package is associated with a journal article (or other textual output) and the metadata deposited in DataCite links to that particular article. Dryad has provided
Is referenced by for this relationship:
Show "isReferencedBy" relationship
When you click on the
Is Cited By link you will find 6 citations, all of which are different from the article in the
Is referenced by link.
We included the names of the data sources (e.g. Europe PMC, PLOS, etc.) to distinguish DataCite metadata from external data pulled in from DLM. We are not currently deduplicating links if they are found in different sources - in this case two citations where found both via PLOS fulltext search and via Europe PMC API:
PLOS fulltext search and Europe PMC
Although citations in the scholarly literature are the most interesting links the DLM service can discover, DLM also searches other data sources such as the bookmarking service CiteULike, the ORCID registry of personal author identifiers, and Wikipedia (in this case for 10.5061/DRYAD.868SM:
Data from: ontogeny, morphology and taxonomy
We hope you find this information useful when you use Labs Search, but there still remains a lot of work to do, including:
relationType, e.g. be consistent with
Is referenced Byvs.
Is cited By,
As always with DataCite Labs projects, your feedback is greatly appreciated.
This blog post was originally published on the DataCite Blog.
Announcing the DataCite Profiles Service
DataCite Labs today is launching the DataCite Profiles service, a central place for users to sign in with DataCite, using their ORCID credentials.The first version of DataCite Profiles focusses on integration with ORCID via the Search & Link and Auto-Update services, ...
Announcing the new Member API
When we launched the new version of the OAI-PMH service in November (Hallett (2019)), and retired Solr (used by the old OAI-PMH service) in December, we completed the transition to Elasticsearch as our search index, and the REST API as our main API. ...
Powering the PID Graph: announcing the DataCite GraphQL API
Today DataCite launches a new API that powers the PID Graph, the graph formed by scholarly resources described by persistent identifiers (PIDs) and the connections between them. The API is powered by GraphQL, a widely adopted Open Source technology that enables queries of this graph, ...