Skip to article frontmatterSkip to article content

Citing your own Research Objects

When citing your own research objects, it is important to consider the type of research object you are referring to. Here, we provide explanations for citing data, physical samples, and software.

Citing Data

When sharing a dataset, use the assigned DOI (from the data repository) and add this to your data availability statement at the end of the paper (similar to the acknowledgement section). It is important to also cite your dataset in the references themselves, as only the citations in the reference section will contribute to citation counts. Data citation is important because it facilitates access, transparency and potentially reproducibility, reuse, and credit for researchers, and might be required when data is shared under a license that requires citation (for example, CC-BY). For more information check the chapter about Licensing.

Data citation also provides recognition and visibility for the repositories that share data.

You can find examples of these statements in the publishers’ (research data) author policies.

Data availability statement examples:

Using the Digital Object Identifier (DOI):

  • “The data that support the findings of this study are openly available in [repository name] at http://doi.org/[doi].”

If no DOI is issued:

  • “The data that support the findings of this study are openly available in [repository name] at [URL], reference number [reference number].”

When there is an embargo period you can reserve your DOI and still include a reference to the dataset in your paper:

  • “The data that support the findings will be available in [repository name] at [URL / DOI] following a [6 month] embargo from the date of publication to allow for the commercialisation of research findings.”

When data cannot be made available:

  • “Restrictions apply to the data that support the findings of this study. [Explain nature of restrictions, for example, if the data contains information that could compromise the privacy of research participants] Data are available upon reasonable request by contacting [name and contact details] and with permission of [third party name].”
  • “The data that support the findings of this study are available upon request. Access conditions and procedures can be found at [URL to restricted access repository such as EASY.]”

When code is shared:

  • “Data and code to reproduce the results shown in the paper can be obtained from The Turing Way (2023) at Zenodo (Community (2022)) and GitHub (https://github.com/the-turing-way/the-turing-way). We used R version 4.2.2 (use citation() to check the suggested citation) and the following R packages: ggplot2 (Wickham 2016), another example (and citation added to the references!).”

More Data Availability Statement examples:

You can find more examples on the Manchester’s Data Access Statements page, the AMS Data Availability Statement examples, or Nature’s Tips for writing a dazzling Data Availability Statement.

Citing Physical Samples

When sharing results related to physical samples (such as archaeological material, genomics samples), ideally a persistent identifier is assigned to track the samples and their associated data. As with data citation, you include a citation in the references and a more detailed explanation in the data availability statement. To learn more about how to cite physical samples and to check out examples, see the Scientific Author Guide for Publishing Open Research Using Physical Samples by Damerow et al., 2024.

Citing Software

A software citation has a lot of the same elements as a data citation, described above, and are described in more detail in the Software Citation Principles. Depending on the journal, you may also include a Software/Code Availability Statement. In such section, you can describe where the code developed during the research presented in the paper can be found, similar to how you would provide information about data availability. For an example you can look at the Data and Code Availability Statements - Sample text or Availability Statement Templates.

GitHub
GitLab

To make your code citable, you can use the integration between Zenodo and GitHub.

  • Create a file to tell people how to cite your software. Use this handy guide to format the file.

  • Link your GitHub account with a Zenodo account. This guide explains how.

  • You can tell Zenodo what information or metadata you want to include with your software by converting your CITATION.cff file to zenodo.json.

    pip install cffconvert
    cffconvert --validate
    cffconvert --format zenodo --outfile .zenodo.json
  • Add .zenodo.json to your repository.

  • On Zenodo, flip the switch to the ‘on’ position for the GitHub repository you want to release.

  • On GitHub, click the Create a new release button. Zenodo should automatically be notified and should make a snapshot copy of the current state of your repository (just one branch, without any history), and should also assign a persistent identifier (DOI) to that snapshot.

  • Use the DOI in any citations of your software and tell any collaborators and users to do the same!

References
  1. Community, T. T. W. (2022). The Turing Way: A handbook for reproducible, ethical and collaborative research. Zenodo. 10.5281/ZENODO.3233853
  2. Damerow, J., Raia, N., Stanley, V., Byers, N., Choe, S., Edmunds, R., Kunze, J., Lehnert, K., McIntyre-Redden, M., Mungall, C., O’Ryan, D., Parker, C., Plomp, E., Richard, S., Vieglais, D., Wood-Charlson, E., Cluster, E. P. S. C., & Thomer, A. (2024). A Scientific Author Guide for Publishing Open Research Using Physical Samples. 10.6084/m9.figshare.24669057.v1