The University of Southampton Research Data Management Policy has a requirement that all significant Research Data should be held for a minimum of 10 years and may be longer where the data is actively used. Funders also have retention requirements and some research data will also be subject to legal requirements.
The Digital Curation Centre (DCC) has a summary of the requirements by the major funders. If your funder is not on this list please use the Sherpa Juliet service to check if your funder has any requirements for research data.
It may be tempting to keep everything but that has drawbacks as it can be more difficult to find the truly important material. It is also worth remembering that research data can be subject to Freedom of Information (FOI) requests.
You may have your own view on how long you need or want to retain data. This will be influenced by the discipline you are working in, the type of data created and whether further work or publications will be based on it. Factors that may influence retention include:
Data which you decide not to keep should be destroyed securely.
Further guidance is available:
It is not a requirement that all research data must be held within the University. Discipline specific repositories and funder requirements may mean that research data will be held elsewhere. You should consider what services you may require to meet the retention requirements applicable to your data
You can find discipline-specific data repositories for your subject via Re3data.org.
If you are funded, check your funder policy for recommended data repositories e.g. ESRC and NERC support dedicated data centres. Also consider whether any agreements with your collaborators include requirements for data deposit.
Regardless of where you deposit your data, you need to add a dataset record to our data catalogue (via Pure)
This is a growing and fast moving area. Some publishers are now requiring the deposit of supporting data with the article, while others require that a link to the data is provided. You will need to take this into account when considering how long you will need to retain the data and may influence your choice of storage location.
The largest share of costs for data are incurred in preparation and ingest to the selected storage service, as shown by the costing tool provided by the UK Data Archive. Extended data retention periods may have some additional costs that will impact your project directly or they may be covered by the full economic costing included in your proposal. Invariably data retention periods will outlive projects, so you may want to consider how this will be funded as part of your data management plan and/or in your proposal – check with your funder’s guidelines.
Over time costs will be incurred for storage, typically based on the volume of data stored for a given retention period, and for additional services, for example active data management such as reformatting to counter possible format obsolescence. The latter is now regarded as less of a problem for popular formats, but may need to be considered for specialised data formats.
Research data represents an investment not just from the funder and the University but also by the individual researcher. However, as part of the deposit process you will be asked to consider what should happen at the end of the retention period and who is responsible for carrying this out. Under the University policy the review process will be the responsibility of the lead PI’s Faculty. (see also Secure Destruction of data)
Funders may also require some or all data to be held. While they may not have prescribed retention periods, all UK funding councils have a ‘sharing’ policy, as do other significant funders, such as Cancer Research UK and the Wellcome Trust. This implies the need for data to be held in a safe and an accessible way. The Digital Curation Centre has a helpful overview of funders’ data policies
Council/Organisation |
Data Availability |
Retention |
Repository |
Exceptions/comments |
University of Southampton | In line with funders sharing policy | Min. rolling 10 years from date of last access | Institutional (ePrints Soton) or subject-based |
applies to signficant data; data catalogue record entry - ePrints Soton |
AHRC | Within 3 months of end of project | Min. 3 years from end of project; ADS longer |
Archaeology Data Service*, Institutional or subject-based |
*Contact ADS within 3 months of start of proposed research |
BBSRC | Timely; no later than publication of main findings; best practice | 10 years from end of project | ||
Cancer Research UK | Timely; no later than acceptance for publication of the main findings | Curated throughout its life-cycle for a min. of 5 years after end of project | Limited period of exclusive use for primary research | |
EPSRC | Metadata available within 12 months of generation; data - timely | 10 years from the date of last access | ||
ESRC | Within 3 months of end of project |
UK Data Archive; UK Data Service |
Limited defined period of exclusive use for primary research team The ESRC UK Data Service are responsible for ensuring long-term access to the data |
|
MRC | Timely | 10 years from generation (in original form) | ||
NERC | As soon as possible after the end of data collection | NERC Data Centre, where available and data appropriate |
Embargo allowed; normally max. 2 years Criteria to identify data sets with long-term value found in NERC's Data Value Checklist |
|
STFC | Within 6 months of relevant publication |
Min. 10 years after end of project for data that can be reproduced; "In perpetuity" for data that cannot be re-measured or reproduced. |
Data should be made publically available after a limited period, unless there are specific reasons why this should not happen |
|
Wellcome Trust | Timely; linked to publication | Min. 10 years |