Archiving and Preserving Information Packages

These tools and services form the technical foundation of a trusted digital repository, helping data curators carry out responsible archiving and preservation strategies and provide access to their designated communities.

Access Platforms (5)
Tools to publish content and metadata to the web.
Backup and Storage Management (7)
Tools to coordinate responsible storage and preservation strategies.
Creating and Manipulating Metadata (7)
Enriching object descriptions and standardising records.
Emulation (2)
Re-creating obsolete software environments to access old formats.
File Format ID and Validation (3)
Defining and validating digital files.
Metadata Harvest and Exposure (3)
Using OAI-PMH to share records across repositories.
Normalisation and Migration (4)
Transferring digital materials into preservation-friendly formats.
Persistent ID Assignment (4)
Creating unique identifiers for digital objects.
Repository Platforms (8)
Enabling deposit, preservation, and access to digital content.

XArch is an archive management system that allows one to create, populate, and query archives of multiple database versions.


Archivematica is a digital preservation system that automates the process of preparing digital objects for ingest into a repository, ingesting them into archival storage and providing access to the archived material. The process is monitored and controlled through a Web-based dashboard that co-ordinates a suite of micro-services. It primarily relies on normalisation as its preservation technique.


ArchivesSpace is an archives information management application for managing both physical and digital archival holdings.


CKAN is a catalogue and repository platform tailored for publishing datasets.

Colectica desktop applications

The Colectica platform includes a set of desktop applications for creating, viewing, and editing DDI metadata.


CONTENTdm is a digital collection management system and hosting service. 

Curator's Workbench

The Curator’s Workbench is a tool that automates and streamlines the process of preparing collections of digital materials for submission to a repository.


The D-Net Software Kit creates a network of repositories that share the infrastructure services necessary to process and provide access to digital content.


DataBank is a scalable, domain-agnostic data repository system designed specifically to manage and share research data in an institutional setting.


The DataCite Consortium provides a number of services to support efforts at increasing the ease and prevalence of data citation.


DataUp is a tool that assists researchers in reviewing, documenting, sharing and archiving their tabular data, especially Microsoft Excel spreadsheets.


DataVerse Network software allows organisations to host a storage and access system for research materials.


Dioscuri is a computer hardware emulator, specifically designed to be used as part of a digital preservation strategy.


DSpace is an institutional repository system which enables easy deposit, preservation, and access for all types of digital content. 


DuraCloud is a hosted service that provides a centralised interface for organisations interested in using cloud storage as a part of their digital archiving and preservation programs.


EPrints provides digital repository software that is intended to create a highly configurable web-based repository.   


EZID is a service provided by California Digital Library for creating and managing persistent, unique identifiers.


Fedora provides the back-end foundation for digital repository systems responsible for managing and preserving all types of digital content.


Figshare is an online, open access digital repository which enables users to upload and share research outputs of many different types, including qualitative datasets, media, presentations, posters, software code and figures. Figshare offers unlimited storage space for publicly-shared content, as well as 20GB of private storage per user.


FITS allows data curators to identify, validate, and extract technical metadata for the objects in their digital repository.


ICA-AtoM allows organisations to create standards-based descriptions of their archival holdings and subsequently publish them to the World Wide Web.


iRODS software creates virtual collections, allowing the user to interact with their stored data without needing to keep track of, or even have ultimate control over, the storage and computing facilities hosting the information.


JHOVE2 allows data curators to characterise the digital objects in their repositories using four methods: identifying formats, validating specifications, extracting metadata, and assessing acceptability. 

JSTOR/Harvard Object Validation Environment (JHOVE)

JHOVE allows data curators to verify the file formats of the digital objects in their repositories. 

KEEP Emulation Framework

The KEEP Emulation Framework (EF) allows users to view and interact with digital files that otherwise would require obsolete hardware and software.

LOCKSS (Lots of Copies Keeps Stuff Safe)

Taking its name (Lots of Copies Keeps Stuff Safe) from the idea that a file is more likely to survive if it has multiple copies stored in multiple places, LOCKSS software allows libraries to create preserved digital collections out of materials that would otherwise be accessible only through a licensed academic subscription.

MIXED (Migration to Intermediate XML for Electronic Data)

MIXED (Migration to Intermediate XML for Electronic Data) is a web service that converts tabular data files such as spreadsheets and databases to the Standard Data Format for Preservation (SDFP), a supplier-independent XML format.


The Nesstar suite is an online publishing platform for organisations wishing to share datasets both internally and with the wider web.


OpenDataForge is a collection of tools for transforming data, with an emphasis on using open data formats and metadata standards.

PREMIS in METS Toolbox

The PREMIS in METS Toolbox is a browser-based application which supports the implementation of PREMIS in the METS container format.

ReDBox / Mint

ReDBox and Mint are two complimentary applications designed to create, store, and provide access to research metadata.


RightField is an open-source tool for adding ontology term selection to Excel spreadsheets. RightField uses a 'Template Creator' to create semantically aware Excel spreadsheet templates. These Excel templates are then reused by scientists to collect and annotate their data, without any need to understand, or even be aware of, RightField or the ontologies used. The project’s slogan is “Semantic data annotation by stealth.”


SafeArchive is a policy-driven auditing tool for monitoring and managing content replicated across a LOCKSS (Lots Of Copies Keeps Stuff Safe) network.


WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.

Xena Software

Xena is a digital preservation tool that first identifies a digital object’s file format, and for certain formats then converts the object into an open format.