It’s crucial to actively manage your research data once you start collecting it, to keep it safe from technical failures, maintain data integrity and to ensure you keep on top of legal and regulatory requirements.
Research tools: You should consider your research tools, i.e. software and hardware carefully, ensuring that all equipment or software used meets LSE’s minimum standards. The Digital Skills Lab can help with a range of specialist research tools that are supported by the School.
Storage: We recommend storing your research data in the LSE managed O365 environment, either in OneDrive (for personal projects) or in Teams or on a SharePoint site (for collaborative projects).
When working in the field, you may be required to use portable storage devices, eg, mobile phones, audio recorders, USB sticks. Where this is the case, ensure you encrypt data files on devices and transfer data to secure, managed, storage as soon as you are able.
Working with personal data: Personal data includes information such as occupation, sex, biometric data, audio/visual information e.g. interview recordings, social media data eg, handles. It’s important to understand if you are collecting this type of data, so you can fully comply with data protection legislation and ethical standards.
Most personal data in research is anonymised before further dissemination as shared datasets. To do this properly you must consider indirect as well as direct identifiers such as contextual clues, or where two bits of information, when taken together, can identify a participant. It’s also important to ensure you gain informed consent for additional use of participants’ data, including sharing anonymised data via a data repository.
Copyright: Where research data is newly created, the staff or student creating the data is the original copyright holder. When you are using third-party copyrighted material, you will need to ensure you have a legal basis for use, and any permissions for further data sharing. In many instances, data can be used and copied for non-commercial teaching or research purposes, or criticism or review, providing that the data source, distributor and the copyright holder are acknowledged.
When sharing your research data you should also think carefully about how you will license it – for more information on this see our open data page.
Documentation: Having accompanying documentation to your data can provide important contextual, methodological and technical information to help understand your data and how it’s been created.
As well as being useful to ensure data creation is consistent when working in a research team, it’s crucially important where you are planning on sharing your data after the project, to help users coming fresh to the data navigate and be able to reuse it. The UK Data Service have great guidance on data documentation at both study and data level.
Organisation: Working with data throughout a project can be messy, with multiple processes, versions, and possibly people involved, so properly organising your data can really help save time when working with your data. The UK Data Service has good examples of best practice.
Having clear file naming schema helps uniquely identify a file or set of files. File names should reflect the file content, for instance data type, collection date, version.
Equally, storing data in well-organised folder structures will help make finding data easier. Think about the hierarchy of folders and how you want to divide the data eg, by project phase, collection method etc.
Data validation and quality assurance: It’s important to have specific procedures embedded in your methods that act as quality controls to ensure data is high-quality and valid.