LogoLogo
  • What is LABDRIVE
  • Concepts
    • Architecture and overview
    • Organize your content
    • OAIS and ISO 16363
      • Understanding OAIS and ISO 16363
      • LABDRIVE support for OAIS Conformance
      • Benefits of preserving research data
      • Planning for preservation
      • ISO 16363 certification guide
      • LABDRIVE support for FAIRness
  • Get started
    • Create a data container
    • Upload content
    • Download content
    • Introduction to metadata
    • Search
    • File versioning and recovery
    • Work with data containers
    • Functions
    • Storage mode transitions
    • Jupyter Notebooks
  • Configuration
    • Archive organization
    • Container templates
    • Configure metadata
    • Users and Permissions
    • Running on premises
  • DATA CURATION AND PRESERVATION
    • Introduction
    • Information Lifecycles
    • Collecting Information needed for Re-Use and Preservation
    • Planning and Using Additional Information in LABDRIVE
    • How to deal with Additional Information
      • Representation Information
      • Provenance Information
      • Context Information
      • Reference Information
      • Descriptive Information
      • Packaging Information
      • Definition of the Designated Community(ies)
      • Preservation Objectives
      • Transformational Information Properties
    • Preservation Activities
      • Adding Representation Information
        • Semantic Representation Information
        • Structural Representation Information
        • Other Representation Information
          • Software as part of the RIN
            • Preserving simple software
              • Jupyter Notebooks as Other RepInfo
            • Preserving complex software
              • Emulation/Virtualisation
                • Virtual machines as Other RepInfo
                • Docker and other containers as Other RepInfo
              • Use of ReproZip
      • Transforming the Digital Object
      • Handing over to another archive
    • Reproducing research
    • Exploiting preserved information
  • DEVELOPER'S GUIDE
    • Introduction
    • Functions
    • Scripting
    • API Extended documentation
  • COOKBOOK
    • LABDRIVE Functions gallery
    • AWS CLI with LABDRIVE
    • Using S3 Browser
    • Using FileZilla Pro
    • Getting your S3 bucket name
    • Getting your S3 storage credentials
    • Advanced API File Search
    • Tips for faster uploads
    • File naming recommendations
    • Configuring Azure SAML-based authentication
    • Exporting OAIS AIP Packages
  • File Browser
    • Supported formats for preview
    • Known issues and limitations
  • Changelog and Release Notes
Powered by GitBook
On this page
  • Use of Provenance in Reproducibility
  • LABDRIVE support for Reproducibility

Was this helpful?

  1. DATA CURATION AND PRESERVATION

Reproducing research

The ability to verify results by re-doing experiments (observational, physical or computational) is fundamental to science. This page discusses how LABDRIVE supports this.

PreviousHanding over to another archiveNextExploiting preserved information

Last updated 2 years ago

Was this helpful?

A referencing an make the point that one must distinguish between reproducibility, which is the ability to duplicate results using the same raw data (and procedures) whereas replicability is the ability of a study to duplicate results with newly collected data. Other, have also been proposed, particularly for medical and social science studies.

Use of Provenance in Reproducibility

As noted below, the n of a Data Object is extremely important in terms of providing details of how that Data Object has been created - including what inputs, processes and parameters have been used.

LABDRIVE support for Reproducibility

LABDRIVE supports reproducibility, as defined as the ability to duplicate results using the same raw data and procedures, by being able to preserve:

  • the raw data - preserved as Data Objects, requiring little or no Representation Information, but adequate Provenance etc.

  • the procedures performed - as preserved in the form of text or scripting languages with additional information in the Provenance. Note that some of the Provenance Information may be in the header of the data files e.g. in FITS files.

  • the software used - preserved as described in Software as part of the RIN.

Reproducibility of computer based research using encapsulated complex software is discussed in terms of the usability of such software in:

  • Jupyter Notebooks as Other RepInfo

  • Docker and other containers as Other RepInfo

  • Virtual machines as Other RepInfo

  • Use of ReproZip

Of course being able to preserve the procedures and software allows LABDRIVE users to also carry out replicability studies, by collecting fresh raw data.

These complex software set-ups may be useful in Exploiting preserved information.

discussion of the reproducibility of research
NSF study
finer grained, definitions
Provenance Informatio