New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use case: Interlinking Data and Documents #10

rayi113 opened this Issue Jan 16, 2015 · 0 comments


None yet
2 participants

rayi113 commented Jan 16, 2015

Use Case: Interlinking Data and Documents

  • Contributors: Bonnie C. Carroll, Information International Associates, Inc. and CODATA Task Group on Data Citation. Based on input from Giri Palanisamy, ARM Data Center, Oak Ridge National Laboratory.

Goal and Summary

At DOE’s Atmospheric Radiation Measurement Program (ARM) data center at Oak Ridge National Laboratory they are making a concerted effort to ensure that authors, who use their data properly, attribute and cite the data sets they are using. There was an historical look at publications to ensure that those using ARM data had a citation to that data. They worked with Thompsons Reuters to make sure when data were cited that there could be an active link between the data set and the publication. Going forward with new publications, ARM always asks people who request their data to cite the data when they are used in articles. Further, ARM tells the requestor how to cite the data to make it easier and to ensure quality of the citation. The DOI (which is assigned to all ARM datasets) is the critical linkage between the data set and the scientific publication.

Why is it important and to whom?

It is important for many reasons to recognize well curated data in publications where the data are used and re-used :

  1. To ensure the scholarly record is maintained and the research and conclusions can be replicated.
  2. To give credit for the contribution made by the data and the data producer/curator to the research results.
  3. Underlying the credit is to provide evidence for the contributions of the data producer and curator to the advancement of science. This is an inherent part of the recognition and reward structure and motivation to do good data creating and curation – which is necessary for the advancement of good science.

Why hasn’t it been solved yet?

There are significant challenges that remain. All are cultural rather than technical.

  1. Convincing and ensuring all authors to always properly cite data sets they use.
  2. Convincing journal publishers to insist on data citations and to include them when provided in footnotes and bibliographies.
  3. Have data producers provide a proper citation with the data sets they share so there is easy and consistent reference to the data sets they want to cite.

Actionable Outcomes

Additional Information and Links

See this document based on conversations with Giri. Note: the facts are Giri’s; any interpretation is mine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment