LogoLogo
  • What is LABDRIVE
  • Concepts
    • Architecture and overview
    • Organize your content
    • OAIS and ISO 16363
      • Understanding OAIS and ISO 16363
      • LABDRIVE support for OAIS Conformance
      • Benefits of preserving research data
      • Planning for preservation
      • ISO 16363 certification guide
      • LABDRIVE support for FAIRness
  • Get started
    • Create a data container
    • Upload content
    • Download content
    • Introduction to metadata
    • Search
    • File versioning and recovery
    • Work with data containers
    • Functions
    • Storage mode transitions
    • Jupyter Notebooks
  • Configuration
    • Archive organization
    • Container templates
    • Configure metadata
    • Users and Permissions
    • Running on premises
  • DATA CURATION AND PRESERVATION
    • Introduction
    • Information Lifecycles
    • Collecting Information needed for Re-Use and Preservation
    • Planning and Using Additional Information in LABDRIVE
    • How to deal with Additional Information
      • Representation Information
      • Provenance Information
      • Context Information
      • Reference Information
      • Descriptive Information
      • Packaging Information
      • Definition of the Designated Community(ies)
      • Preservation Objectives
      • Transformational Information Properties
    • Preservation Activities
      • Adding Representation Information
        • Semantic Representation Information
        • Structural Representation Information
        • Other Representation Information
          • Software as part of the RIN
            • Preserving simple software
              • Jupyter Notebooks as Other RepInfo
            • Preserving complex software
              • Emulation/Virtualisation
                • Virtual machines as Other RepInfo
                • Docker and other containers as Other RepInfo
              • Use of ReproZip
      • Transforming the Digital Object
      • Handing over to another archive
    • Reproducing research
    • Exploiting preserved information
  • DEVELOPER'S GUIDE
    • Introduction
    • Functions
    • Scripting
    • API Extended documentation
  • COOKBOOK
    • LABDRIVE Functions gallery
    • AWS CLI with LABDRIVE
    • Using S3 Browser
    • Using FileZilla Pro
    • Getting your S3 bucket name
    • Getting your S3 storage credentials
    • Advanced API File Search
    • Tips for faster uploads
    • File naming recommendations
    • Configuring Azure SAML-based authentication
    • Exporting OAIS AIP Packages
  • File Browser
    • Supported formats for preview
    • Known issues and limitations
  • Changelog and Release Notes
Powered by GitBook
On this page
  • Information structural Representation Information
  • Formal structural Representation Information
  • EAST (Enhanced Ada SubseT)
  • Data Resource Broker (DRB)
  • Data Format Description Language (DFDL)
  • LABDRIVE Automated File Format Identification
  • Tools to create Structural Representation Information

Was this helpful?

  1. DATA CURATION AND PRESERVATION
  2. Preservation Activities
  3. Adding Representation Information

Structural Representation Information

This page describes a number of ways in which Structural Representation Information may be created.

PreviousSemantic Representation InformationNextOther Representation Information

Last updated 3 years ago

Was this helpful?

Information structural Representation Information

A Data Object may be described down to the bit level in a text document, by which we mean they are meant for human use rather than a computer being able to use it (this may change as computers using deep learning to truly understand the text).

As an example the Internet Engineering Task Force standards, on which much of the Internet is based, is documented in . For example the RFC describing ASCII format for Network Interchange is , which includes a table, constructed using text characters, to show how ASCII alphanumeric characters and a number of control characters should be encoded in bits.

Other RFCs include complex diagrams constructed using ASCII characters, for example .

Other text descriptions are for example PDF documents such as for , or indeed paper documents in the case of older instruments.

Formal structural Representation Information

Several formal description languages are available which allow the description of Data Objects down to the bit level in a way which can be used by computers.

EAST (Enhanced Ada SubseT)

This is a which allows data to be described. There is a support website for EAST Based Access Tools – the , and supported by the French Space Agency. An example complex structure for a communications package which can be described by EAST is shown below.

uses EAST to describe and provide access to and combine many types of data, and provides many examples.

Data Resource Broker (DRB)

is is an Open Source Java application programming interface for reading, writing and processing heterogeneous data.

Data Format Description Language (DFDL)

DFDL is a way of describing the data. It is not a data format. DFDL should be able to describe many data formats, including:

  • Textual and binary

  • Commercial record-oriented

  • Scientific and numeric

  • Modern and legacy

  • Industry standards

LABDRIVE Automated File Format Identification

The format identification allows the file to be displayed but does not normally identify the various components to which semantics may be attached.

Tools to create Structural Representation Information

is a modeling language from the Open Grid Forum that allows description of text, dense binary, and legacy data formats in a vendor- neutral declarative manner. DFDL is an extension to the XML Schema Description Language (XSD).

An open source interpreter is available as

DFDL is used to describe various scientific data files, particularly those used in Earth Observation, by ESA in the Standard Archive Format for Europe ().

On ingestion LABDRIVE uses a number of tools (see ) that enable the automatic identification of the file format of a particular file, typically by examining file signatures, usually the first few bytes of the file.

​The file formats are identified using identifiers, however PRONOM provides very little information about the format as illustrated in the following.

the , and

DROID

ASN.1 ()

DFDL
Apache Daffodil
SAFE
https://public.docs.libnova.com/labdrive/api/#/PRONOM
PRONOM
BEST toolkit
other tools
Apache Daffodil
https://www.nationalarchives.gov.uk/information-management/manage-information/preserving-digital-records/droid/
https://www.itu.int/en/ITU-T/asn1/Pages/asn1_project.aspx
RFCs
RFC20
RFC5755
SOHO
CCSDS and ISO standard language
BEST toolkit
other tools
CDPP (Plasma Physics Data Centre)
DRB