Working on Sensitive Data Projects

Working on Sensitive Data Projects

Prerequisites

Prerequisite

Importance

Skill Level

Notes

Research Data Management

Helpful

Beginner

Summary

In this chapter, we describe how you can work with sensitive data in a practical way. We cover working on projects that use trusted research environments (TREs) such as data safe havens, including writing code and version control in TREs.

There are also sub-chapters that describe steps that are good to think about at the start of your project, such as how to {ref}ensure that your sensitive files remain secure when using GitHub. We also show how you can {ref}safely share your code, even though it uses sensitive data, and draw attention to {ref}what you should do if you inadvertently expose sensitive data on GitHub.

Sub-chapters include:

  • Working with Trusted Research Environments

  • Keeping sensitive files secure

  • Sharing your Jupyter notebook

  • Removing sensitive data from Github

  • Further recommendations

Motivation

Working in sensitive data projects can be challenging and it often requires the development of a bespoke way of working for each project.

Researchers need to consider how they are going to work collaboratively using Trusted Research Environments and how they can enable reproducible research and publish as openly as possible. We must also consider how to keep our data secure and prevent unintentional sharing of data.