Skip to Main Content

Research Data Management (RDM): What is RDM?

This guide will serve as an introduction to the basic topics behind Research Data Management.

Banner

What is Data?

The Digital Curation Centre defines research data as "a reinterpretable representation of information in a formalized manner suitable for communication, interpretation, or processing."

Research data can be very diverse for different fields, but essentially if you are using something to answer your research question, it's data!

As you move into a new project it is important to consider the data that you will create, gather and use in the course of the project and make decisions about how you will manage your data.

Research Data Management Overview

Data can have a longer lifespan than that of the research project that creates or collects it. You may continue to work on your data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. So making sure you are properly managing your data through the whole lifecycle of the data is increasingly relevant.

Many funders are now asking you to do this as part of their application process.  Considering options for data management at an early stage can help you make the right decisions at the right time about creating, storing and sharing your data. For example, you should make sure you know about your funders' expectations.

Types of Data

Much research data is created ‘new’ for a specific project as it is answering a novel question but it may also be research data from a previous project that has been transformed, adjusted or reinterpreted to fit the needs of the new project. Five data types commonly used are:

  1. Observational: data captured in real time that is usually unique and irreplaceable. For example, remote sensing data, survey data, field recordings, sample data
  2. Experimental: data captured from lab equipment that is often reproducible. For example, gene sequences, chromatograms, magnetic field data
  3. Models or simulation: data generated from test models where model and metadata may be more important than output data from the model. For example, climate models, economic models
  4. Derived or compiled: resulting from processing or combining ‘raw’ data. For example, text and data mining, compiled databases, 3D models
  5. Reference or canonical: a static or organic conglomeration or collection of datasets, probably published and curated. For example, gene sequence databanks, collection of letters or archive of historical images

Research Data Lifecycle

The Research Data Lifecycle is a way of looking at research data which incorporates every stage at which data may be handled in a research project. Considering each stage before embarking on research is a good way to ensure that you have thought through your work, and reviewing each stage regularly ensures that you are sticking to your plan and improves the efficiency of a project.

Examples of Research Data

Research data can be electronic or in hardcopy (e.g. paper) and it may include the following:

  • Documents (text, Word, PDF), spreadsheets
  • Laboratory notebooks, field notebooks, diaries
  • Questionnaire responses, transcripts, codebooks
  • Audiotapes, videotapes, photographs, films
  • Slides, artefacts, specimens, samples
  • Collection of digital objects acquired and generated during the process of research (including digitised archive material)
  • Database contents (video, audio, text, images)
  • Models, algorithms, scripts
  • Contents of an application (input, output, logfiles for analysis software, simulation software, schemas)
  • Methodologies, workflows and protocols
  • Contact

  • University of Warwick Library
    Gibbet Hill Road
    Coventry
    CV4 7AL
  • Telephone: +44 (0)24 76 522026
  • Email: library at warwick dot ac dot uk
  • More contact details