Incorporating original data and proper citations is essential for research integrity, reproducibility, and acknowledging intellectual work. Data should be treated with the same citation standards as journal articles, including author(s), date, title, publisher (repository), and a persistent identifier (DOI).
Key Principles for Incorporating Data
Locate and Reuse: Data can be found in repositories like Zenodo, DataCite, or institutional archives, often requiring you to check the license for usage restrictions.
Cite in Text: In-text citations should accompany any analysis or claim derived from the data.
Data Availability Statement: Journals often require a section outlining where the data is stored and how it can be accessed.
Version Control: Cite the specific version of the dataset used to ensure reproducibility.
Unpublished Data: If the data is not yet published, use the phrase “unpublished data” instead of a date.
Structure of a Data Citation
Following guidelines from sources like ICPSR and the Research Data Alliance, a standard citation includes:
Author(s): Creator(s) of the dataset.
Date: Year of publication.
Title: Title of the dataset (including version/edition).
Publisher/Repository: Where the data is archived (e.g., Figshare, Dryad).
Identifier: DOI, handle, or persistent URL.
Example Citation (APA Style)
Creator, A. A. (Year). Title of dataset (Version) [Data set]. Publisher. Persistent Identifier/DOI.
Benefits of Proper Data Citation
Credit & Tracking: Allows data creators to receive credit and track the impact of their work.
Transparency: Enables readers to verify results.
FAIR Principles: Increases the findability, accessibility, interoperability, and re-usability of data
Leave a Reply