The FAIR Principles - The Turing Way

The FAIR guiding principles for scientific data management and stewardship Wilkinson et al., 2016 were developed as guidelines to improve the Findability, Accessibility, Interoperability and Reusability of digital assets; all of which support research reproducibility. The FAIR principles play an important role in making your data available to others for reuse.

It is much easier to make data FAIR if you plan to do this from the beginning of your research project. You can plan for this in your Data Management Plan (DMP) (see points 4 and 5 of the Data Management Plan chapter).

Even though the FAIR principles have been defined to allow machines to find and use digital objects automatically, they improve the reusability of data by humans as well. The capacity of computational systems to find, access, interoperate, and reuse data, with none or minimal human intervention, is essential in today’s data-driven era, where humans increasingly rely on computational support to deal with data as a result of the increase in volume, velocity and variety.

This chapter provides an abstract and broad view of what the FAIR principles are. How to put the FAIR principles into practise is discussed in other sub chapters ( Data Organisation in Spreadsheets, Documentation and Metadata and Sharing and Archiving Data). You can also use the Wellcome Getting Started Guide or the How To FAIR website to find out more about the FAIR principles and how to get started.

Image in green and grey scale showing a winding, climbing pathway made of jigsaw pieces representing the FAIR principles, with stick figures continuing to build the pathway at the top. In the top left hand corner, a highlight bubble shows a signposted pathway with a location marker labelled persistent as a visual representation of findable. In the top right hand corner a highlight bubble shows a key unlocking a padlock with the text meaningful interaction as a visual representation of accessible. In the bottom left hand corner a highlight bubble shows sharing between two computers as a visual representation of interoperable. In the bottom right hand corner a highlight bubble shows a completed puzzle with the text full disclosure as a visual representation of reusable. — Figure 1:*The Turing Way* project illustration by Scriberia. Used under a CC-BY 4.0 licence. DOI: The Turing Way Community & Scriberia (2024).

Theory¶

In brief, FAIR data should be:

Findable: The first step in (re)using data is to find it! Descriptive metadata (information about the data such as keywords) is essential.

Accessible: Once the user finds the data and software they need to know how to access it. Data could be openly available but it is also possible that authentication and authorisation procedures are necessary.

Interoperable: Data needs to be integrated with other data and interoperate with applications or workflows.

Reusable: Data should be well-described so that they can be used, combined, and extended in different settings.

You can find a more detailed overview of the FAIR principles by GO FAIR of what the FAIR principles recommend. You can also read A FAIRy tale for an understandable explanation of each principle.

Making data ‘FAIR’ is not the same as making it ‘open’. Accessible means that there is a procedure in place to access the data. Data should be as open as possible, and as closed as necessary.

It is also important to say that the FAIR principles are aspirational: they do not strictly define how to achieve a state of FAIRness, but rather describe a continuum of features, attributes, and behaviours that will move a digital resource closer to that goal.

The FAIR principles are also applied to software (see [LGK+20]and [HCH+20]). Watch a ten minute video on FAIR software for a short explanation.

FAIR principles and environmental sustainability¶

“FAIR practices can result in highly efficient code implementations, reduce the need to retrain models, and reduce unnecessary data generation/storage, thus reducing the overall carbon footprint. As a result, green computing and FAIR practices may boht stimulate innovation and reduce financial costs.” - Lannelongue et al., 2023

FAIR principles and accessibility¶

The Accessible in FAIR is not equal to ensuring that your research objects are accessibles to all users. For this, the term “actually accessible” has been coined by Colon et al., 2023 to refer to data that is “easy to locate, obtain, interpret, use, share, and analyze for everybody, including disabled people.”

Community involvement¶

Various online resources are provided for people who are working in the life sciences, to guide them in ensuring FAIRness in their data, providing them with tools and advice for good data management at various stages of their work. Two prominent ones include:

Under the FAIR Cookbook, several resources are offering guidance and assistance in FAIR data management. The FAIR Cookbook is designed to serve a variety of audience types and involved in different stages of data management life cycle. The FAIR Cookbook is developed and maintained by life sciences professionals, both in the academia and industry sectors, including members of the ELIXIR community.
Under ELIXIR Research Data Management Kit (RDMkit), resources are provided for life scientists to guide them in better management of their research data in adhering to the FAIR Principles. It is an attempt to help researchers work at different capacities, both in individual and collaborative workspaces. The RDMkit is open for suggestions from anyone, as long as they abide by the contributor responsibilities.

Many groups and organisations are working to define guidance and tools to help researchers and other stakeholders (like librarians, funders, publishers, and trainers) make data more FAIR. There are two global initiatives that act as umbrella organisations and reference points for many discipline-specific efforts, including the ones listed above: GOFAIR and the Research Data Alliance (RDA).

Under GOFAIR, there are many Implementation Networks (INs) committed to implementing the FAIR principles.
Under the RDA, there are several groups tackling different aspects relevant to the RDM life cycle. Among these, one group, the FAIR Data Maturity Model Working Group is reviewing existing efforts, building on them to define a standard set of common assessment criteria for the evaluation of FAIRness.

More information¶

Deep dive into the FAIR principles by Dr. Maryann Martone (45 minute video)

References¶

Wilkinson, M. D., Dumontier, M., Aalbersberg, Ij. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J.-W., da Silva Santos, L. B., Bourne, P. E., Bouwman, J., Brookes, A. J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C. T., Finkers, R., … Mons, B. (2016). The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data, 3(160018), 1–9. 10.1038/sdata.2016.18
The Turing Way Community, & Scriberia. (2024). Illustrations from The Turing Way: Shared under CC-BY 4.0 for reuse. Zenodo. 10.5281/ZENODO.3332807
Hansen, K. K., Buss, M., & Haahr, L. S. (2018). A FAIRy tale. Zenodo. 10.5281/ZENODO.2248200
Lannelongue, L., Aronson, H.-E. G., Bateman, A., Birney, E., Caplan, T., Juckes, M., McEntyre, J., Morris, A. D., Reilly, G., & Inouye, M. (2023). GREENER principles for environmentally sustainable computational science. Nature Computational Science. 10.1038/s43588-023-00461-y
Colon, R., Goben, A., & Karcher, S. (2023). Actually Accessible Data: An Update and a Call to Action. Journal of Librarianship and Scholarly Communication. 10.31274/jlsc.15449

Pathways

Data Management Plan

Pathways

Personal data management