Last week Jennifer Lin shared information on the Making Data Count (MDC) project on this blog. MDC is a project funded by the U.S. National Science Foundation (NSF) to design and develop metrics that track and measure data use – data-level metrics (DLM).
Funding for the 12 month project ends October 1st, with a no-cost extension until March 1st. MDC is a research project and has delivered some interesting questions and important results. One open question is whether and how to turn MDC research into a service that is not limited to the grant-funding period and possibly includes other datasets beyond those from the DataONE repository network. These important decisions require analysis and feedback from the broader community.
In order to better understand and analyze these questions DataCite has taken over hosting of the DLM service from PLOS and will provide this service until at least March 1st, when the MDC funding formally ends. The DLM service is now hosted by DataCite Labs and can be found at https://dlm.datacite.org. The following chart (directly from https://dlm.datacite.org/sources) gives an overview about the data we have collected so far:
Most users will not be using the API directly, but rather want to see the metrics data displayed together with the datasets they are interested in. About half of the about 140,000 datasets in the DLM service use DataCite DOIs and, as a first step, we have integrated DataCite DOIs into Labs Search. If your search results include datasets from one of the DataONE data centers that use DOIs (including Long Term Ecological Research Network, National Center for Ecological Analysis and Synthesis and Dryad Digital Repository) and we have found links for them (e.g. for 10.5061/dryad.f1cb2), we will display them:
Data from: rise of the machines
Let's dig into the Dryad example a bit more -- every Dryad data package is associated with a journal article (or other textual output) and the metadata deposited in DataCite links to that particular article. Dryad has provided
Is referenced by for this relationship:
Show "isReferencedBy" relationship
When you click on the
Is Cited By link you will find 6 citations, all of which are different from the article in the
Is referenced by link.
We included the names of the data sources (e.g. Europe PMC, PLOS, etc.) to distinguish DataCite metadata from external data pulled in from DLM. We are not currently deduplicating links if they are found in different sources - in this case two citations where found both via PLOS fulltext search and via Europe PMC API:
PLOS fulltext search and Europe PMC
Although citations in the scholarly literature are the most interesting links the DLM service can discover, DLM also searches other data sources such as the bookmarking service CiteULike, the ORCID registry of personal author identifiers, and Wikipedia (in this case for 10.5061/DRYAD.868SM:
Data from: ontogeny, morphology and taxonomy
We hope you find this information useful when you use Labs Search, but there still remains a lot of work to do, including:
relationType, e.g. be consistent with
Is referenced Byvs.
Is cited By,
As always with DataCite Labs projects, your feedback is greatly appreciated.
This blog post was originally published on the DataCite Blog.
DataCite switches to Globus for Authentication
Access to some DataCite resources and services requires authentication so that DataCite knows who is making a request. This includes Fabrica, our DOI registration service that requires a member account, but also our integration with ORCID in the Profiles service, ...
DataCite's New Search
Today we are announcing our first new functionality of 2019, a much improved search for DataCite DOIs and metadata. While the DataCite Search user interface has not changed, changes under the hood bring many important improvements and are our biggest ...
A step forward for software citation: GitHub's enhanced software citation support
On August 19, GitHub announced software citation support in GitHub repositories. Citation information provided by users (using a CITATION.cff YAML file in the root directory of the default branch) is parsed and made available as bibtex file or formatted citation, ...