Software and Hardware

Tools

Biosharing

Functionality:  CATALOGUES The web-based BioSharing catalogues aim to centralize bioscience data policies, reporting standards and links to other related portals. 1. Providing a “one-stop shop” for those seeking data sharing policy documents and information about the standards and technologies that support them. 2. Exposing core information on well-constituted, community-driven standardization efforts and link to their standards, documentation, training material, news and contact point. 3.

ShareGeo Open

ShareGeo Open is a data repository for Open data. There are lots of useful spatial datasets that have been deposited by users for others to download and re-use. This create – share – reuse philosophy is central to ShareGeo Open. All the data in the repository is open and can be re-used freely making it ideal for students, researchers and teaching staff to find data. In addition to the just downloading data, users can upload data to the repository.  URL: www.sharegeo.ac.uk

WebCite

WebCite® is a project initiated by the Centre for Global eHealth Innovation at the University of Toronto intended to digitally archive web material (web pages, PDF documents, and so on) which are cited in scholarly articles. The idea of WebCite is that authors of scholarly papers (as well as editors and publishers of scholarly work) are increasingly citing digital material which is in the public domain on the web, yet which is at risk to disappear, i.e.

Cairo tools survey: a survey of tools applicable to the preparation of digital archives for ingest into a preservation repository

Cairo is a project funded under the 'Tools and Innovation' strand of the JISC's capital programme on Repositories and Preservation. Cairo will develop a tool for ingesting complex collections of born-digital materials, with basic descriptive, preservation and relationship metadata, into a preservation repository.

Automated Obsolescence Notification System (AONS) II

AONS (Automated Obsolescence Notification System) notifies repository managers about formats within digital resources in their repositories and alerts them to potential problems relevant to obsolescence and long term usage.

Collection Services Infrastructure (COSI) Framework

COSI is a web applications framework built to provide authentication (via either LDAP or the built-in service) and roles based access to modules of grouped functionality (or sub applications). More information about COSI can be found at the APSR web site. COSI is built with PHP (5.2.1) and PostgreSQL (8.2.3), and has been tested running under Apache 2.2.4 on Redhat Linux 9 and Mac OS X 10.4, but should also run on windows (and maybe even under Internet Information Server). PHP should be configured with pgsql, libxml, xsl, ldap, and open ssl.

Field Helper

Field Helper is a desktop application that enables you to quickly view and categorise groups of related digital files and then submit the resulting package to a repository for long term preservation and access.

FEZ

Fez is an open source project to produce and maintain a highly flexible web interface to FEDORA for any Library or Institution to configure and publish or archive documents of any type sustainably.

Online Research Collections Australia (ORCA) Registry

The ORCA Registry software provides for a registry of collection level (and associated service, party and activity) metadata that is based on the ISO 2146 draft standard. More information about ORCA can be found at the APSR web site. The ORCA Registry software consists of a PostgreSQL database that is managed/utilised by a PHP module housed within an instance of the COSI framework.

Repository Interoperability Framework (RIFF) Submission Service Source Code

The RIFF Submission Service provides a service-oriented framework to support the packaging and routing of content and metadata from a source application to a target repository.

Arts and Humanities E-Science Support Centre (AHeSSC)

The Arts and Humanities e-Science Initiative is jointly funded by the AHRC and JISC. The Initiative aims to enable research practitioners to embed the advanced use of ICT in their research and teaching practices. It will also facilitate collaboration across traditional subject and discipline boundaries. The Arts and Humanities e-Science Support Centre (AHeSSC) forms a critical part of the AHRC-JISC initiative on e-Science in Arts and Humanities research.

eDAVID - Expertisecentrum DAVID

Expertisecentrum DAVID is a centre of research and knowledge on digital archiving.

JULIET

JULIET provides a summary of policies given by various research funders as part of their research grant awards. This information is accurate to the best of our knowledge, but should not be relied upon for legal advice. Information on the JULIET breakdown of these policies is available below. JULIET is a complement to the RoMEO service provided by SHERPA for authors and repository administrators, which lists summaries of publishers' copyright transfer agreements as they relate to archiving.

Web Curator Tool

The Web Curator Tool (WCT) is a tool for managing the selective web harvesting process. It is designed for use in libraries and other collecting organisations, and supports collection by non-technical users while still allowing complete control of the web harvesting process. The WCT Project is a collaborative effort by the National Library of New Zealand and the British Library, initiated by the International Internet Preservation Consortium.
Suitable for:

Case Study: PerX Experience of Harvesting & Utilising Metadata from Oxford Journals

This brief case study illustrates some of the types of issues encountered by OAI-PMH service providers attempting to utilise third party metadata obtained via OAI-PMH.

Preservation Guide (for Audiovisual Media)

If you have audiovisual media, it needs maintenance - or you will lose it. This guide shows how to:conserve old formatsdigitise for transfer to new formatscreate digital file formatsuse digital restorationuse mass storageprovide electronic and web access Functionality:  Wiki Level of Expertise:  Introductory, but also has information of use to experienced professionals

AV Digitisation and Storage GuideStorage Guide

Web site for archive owners and the public alike, which provides first point-of-call for information on audiovisual storage. The site provides information and management tools on digital technology for the storage of film, video and audio content and associated metadata. The information covers the state-of-the-art in storage technology, and include forecasts of trends over the next twenty years. Functionality:  Tutorials and on-line tools supporting calculations of storage sizes and costs.

OJAX

OJAX provides a highly dynamic AJAX-based user interface to a federated search service for OAI-PMH compatible repository metadata. OJAX is simple, non-threatening but powerful.
Suitable for:

CRiB (Conversion and Recommendation of Digital Object Formats)

The CRiB is a Service Oriented Architecture (SOA) designed to assist cultural heritage institutions in the implementation of migration-based preservation interventions. The CRiB system works by assessing the quality of distinct conversion applications or services to produce recommendations of optimal migration strategies. The recommendations produced by the system take into account the specific preservation requirements of each client institution.

RODA (Repositório de Objectos Digitais Autênticos)

The National Archive Institute of Portugal (IAN/TT) doesn't currently have the needed infrastructures to support the processes of ingestion and management of digital objects produced by the public administration (PA). The initiatives of the eGovernment establish the need to support its activity in information and communication technologies to improve the efficiency, productivity and quality of their public services. In this scenario it is clear that the number of digital objects produced by these institutions will grow, and that their legal value and authenticity should be assured.

Preserv Project - Preservation Eprints Services

Preserv (now Preserv 2, since it is in its second phase of funding) is a JISC project investigating and developing infrastructural digital preservation services for institutional repositories. Project partners are Southampton University, The National Archives, The British Library and Oxford University. NOTE: This project ended in 2009. Functionality:  An ingest service based on the OAIS reference model for institutional archives built using EPrints software.

Protégé Ontology Editor

Protégé is a free, open source ontology editor and knowledge-base framework. The Protégé platform supports two main ways of modelling ontologies via the Protégé-Frames and Protégé-OWL editors. Protégé ontologies can be exported into a variety of formats including RDF(S), OWL, and XML Schema.

UVC for Images

The National Library of the Netherlands, Koninklijke Bibliotheek [KB], have enhanced their existing preservation services by working with IBM in developing the Universal Virtual Computer. The UVC works by archiving the programme along with a digital file in order that the file can be decoded. PDF files have been used as a testbed, and the project has moved on to investigate JPEG image files. The KB are interested in members of the community testing this programme. Functionality:  Long-term preservation of a specific file format.
Suitable for:

GDFR

Academic institutions are beginning to create institutional digital repositories. This is an on-going project set up by a collaboration of international digital library and archival communities to develop a sustainable, global registry of digital representation formats, which are represented in a human readable form. The GDFR would interoperate with the TOM tool Functionality:  File format registry. Level of Expertise:  Knowledge of digital repositories.
Suitable for:

Representation Information Repository

This DCC tool records the representation information (e.g. file format structural information as well as semantic metadata) necessary to be able to interpret an archived digital object, and in turn assists users to populate relevant technical preservation metadata fields for digital objects being preserved. Representation information is a term from the OAIS Reference Model, ISO 14721. Functionality:  Tool to create technical preservation metadata.
Suitable for:

EdNA Online Metadata Toolset

This tool assists web page creators to embed standardised metadata into their pages. It will also prepare pages for harvesting and integrate different metadata schemas. Lists of the standards can be found on the EdNA website. NOTE: This project ended in September 2011. Functionality:  Metadata creator and editor. Level of Expertise:  Knowledge of metadata and some knowledge of Java applications.
Suitable for:

erpaGuidance

This set of guidance papers include advice on digital preservation issues such as Costing, Risk Management and Policy as well as more hands-on preservation issues such as Selection and Ingest. They provide a valuable source of information for any institution setting out on a preservation project. NOTE: This project ended in 2005. Functionality:  Preservation advice tool.

Records Management Capacity Assessment System

The International Records Management Trust has created a software tool to assist the preservation and management of records in the public sector. This tool can be applied across all public sector areas and disciplines and can identify strengths, weaknesses and risks in existing systems. Functionality:  Records management tool to assist long term preservation. Level of Expertise:  Knowledge of records management.

Xena Software

This open-source preservation software developed by the National Archives of Australia is free and openly available. Xena (XML Electronic Normalising of Archives) converts digital records into two forms for preservation: a base64-encoded version (using only printable ASCII characters) and an XML representation. Both are stored in a digital repository for long-term preservation. Functionality:  Electronic Records Management tool.
Suitable for:

XMP

XMP (eXtensible Metadata Platform) is a tool that assists the digital preservation workflow in that it allows you to attach metadata to digital objects. It is an open-source product and freely available to the community. XMP is extensible in that it can also accommodate existing metadata schemas. This tool is particularly pertinent for digital imaging. Functionality:  Metadata creation tool. Level of Expertise:  Fairly high. Knowledge of C++ programming to implement this tool.
Suitable for:

DSpace

This software is open-source and free for anyone to use. DSpace is an institutional repository system that focuses on long-term preservation. Functionality:  Active preservation strategy. Level of Expertise:  Useful for anyone interested in digital preservation implementations, particularly in the HE/FE research domain.

Fedora

This digital repository service is open source and free to implement. The system architecture supports the storage of digital objects, version control, import and export of objects in XML as well as search and retrieval. Its most powerful feature is that it supports multiple representations of the same digital object, which can be local or remote, thus leading to rich information networks. Functionality:  Digital library/repository software.

EPrints

EPrints is a tool to manage the digital research output of scholarly institutions. It is free and open source and because it is OAI (Open Archives Initiative) compliant, other EPrints archives can also be searched. Its main aim is not long-term preservation, rather it enables libraries to store and retrieve scholarly output in a manner that may enable by-passing other conventional ways of publishing. Functionality:  Repository software.

LOCKSS

Lots Of Copies Keep Stuff Safe. This service is a freely available preservation service that works on the principle that by persistently caching multiple copies of a web serials over multiple sites, the chances of that particular object being preserved are greatly increased. It is used by libraries to preserve their content over the long-term. The software is cheap and easy to use, and any institution can get involved. Functionality:  Digital library tool and web page harvester.
Suitable for:

IMAPpreserve

Independent Media Arts Preservation, Inc. (IMAP) is a non-profit service, education, and advocacy organization committed to the preservation of non-commercial electronic media. IMAP has grown from a New York-based consortium of arts organizations and individuals to a national resource for preservation training, information, and advocacy. IMAP's core constituents include institutions, organizations, and individuals whose diverse media collections are underserved by existing preservation efforts.

ERPANET

By far one of the largest repositories of information on Digital Preservation. The various erpaProducts range from Guidance documents on the costing of digital preservation and implementing policies (among others), to critiques and contextualisation of all the important papers and materials so far produced. Case studies explore various companies' and Institutions' experiences of digital preservation and a large number of reports and papers offer access to the latest research in the field.

CORDRA (Content Object Repository Discovery and Registration/Resolution Architecture)

This tool can be used to create handles or IDs to assign to files NOTE: To the best of our knowledge, this project is no longer active.  A cached version of the project website is available thanks to the Internet Archive.  Please let us know if you have more current information! Functionality:  ID generator. Level of Expertise:  High. Simple to execute, but knowledge of Data IDs and handles needed.
Suitable for:

OceanStore

A global persistent store that will support millions of users at any given time. Functionality:  Durable, consistent storage for back-up purposes. Level of Expertise:  Basic knowledge of storage systems.
Suitable for:

CASTOR

(CERN Advanced STORage Manager. This software is free for academics, but only runs on Linux. CASTOR is a system for interfacing to mass storage, such as a tape robot. Functionality:  Mass Storage Interface. CASTOR provides a C API and set of Unix commands to access data transparently, not knowing whether it is on local or remote storage. Level of Expertise:  High
Suitable for:

Tei Publisher

This digital repository toolkit will enable xml-based digital objects to be searched and retrieved according to any XML DTD. The tool is aimed at assisting repository administrators who have limited technical knowledge. NOTE: To the best of our knowledge, this project is no longer active.  Please let us know if you have more current information! Functionality:  XML-based digital repository for management of digital collections.
Suitable for:

SRB

Storage Resource Broker, San Diego Supercomputer Centre. Free to academic users. NOTE: To the best of our knowledge, this project is no longer active.  Please let us know if you have more current information! Functionality:  Mass Storage Interface with metafile catalogue. The SDSC Storage Resource Broker is client-server middleware that provides a uniform interface for connecting to heterogeneous data resources over a network and accessing replicated data sets.
Suitable for:

Metadata Extraction Tool

This has been developed by the National Library of New Zealand and leads the way in being one of the first tools of its kind to extract preservation metadata from the headers of a range of file formats. The metadata standard used is the NLNZ preservation metadata schema, but it can be configured to support other business processes. Automatic extraction is a key development for the digital preservation community.

JISC DIGITAL MEDIA

The Technical Advisory Service for Images is a JISC funded service. It provides advice and guidance to the Further and Higher Education community on the issues of: Creating digital images (including raster, vector and animated formats) Delivering digital images to users Using digital images to support teaching, learning and research Managing both small and large scale digitisation projects.

Grainger Engineering Library Information Center - Digital Library Research Projects

A collection of digital library projects under existence at the University of Illinois at Urbana-Champaign. These include search tools and resources for digital library development. NOTE: To the best of our knowledge, this project is no longer active.  Please let us know if you have more current information!

VIRLIB - Electronic Document Store (Universities of Antwerp)

The VIRLIB project's focus is on the development of a real delivery service of electronic documents added to Impala, i.e. the Belgian system of managing and transmitting interlibrary requests for loan. In other words, the system VIRLIB II will enable the interlibrary loan department of any scientific library affiliated to the Impala network to deliver directly to the users workpost, in electronic format, any article asked for by the user in question.
Suitable for:

ICA (International Council on Archives)

The International Council on Archives (ICA) is dedicated to the advancement of archives worldwide. In pursuing the advancement of archives, ICA works for the protection and enhancement of the memory of the world. ICA is the professional organisation for the world archival community, dedicated to promoting the preservation, development, and use of the world's archival heritage. It brings together national archive administrations, professional associations of archivists, regional and local archives and archives of other organisations as well as individual archivists.

DSpace@Cambridge

Cambridge University Library, in association with the University Computing Service, has formulated a major project to provide the University with an institutional digital repository, "DSpace@Cambridge". This repository will provide a home for the increasing amount of material that is being digitised from the University Library's own printed and manuscript collections. It also has the ability to capture, index, store, disseminate and preserve digital materials created in any part of the University.

SKOS Core

SKOS stands for Simple Knowledge Organisation System. SKOS Core is a model used to describe 'concept' schemas and the relationships between them. For example, glossaries, thesauri, controlled vocabularies. Functionality:  Provides a framework for expressing knowledge organisation systems in a machine-understandable way. Level of Expertise:  Good understanding of the concept of ontologies and descriptive schemas, such as RDF.

Maastricht McLuhan Institute

The Maastricht McLuhan Institute (MMI), European Centre for Digital Culture, Knowledge Organisation and Learning Technology, was officially opened by Dr Eric McLuhan in November, 1998 and began its formal activities in January, 1999 at the Grote Gracht 82 in Maastricht. MMI is an initiative of the Universiteit Maastricht, the Hogeschool Maastricht, the Hogeschool Limburg, the Limburgs Universitair Centrum (Diepenbeek), the LIOF Industriebank N.V. and the Province of Limburg.

EPICUR - Enhancement of Persistent Identifier Services: Comprehensive Method for Unequivocal Resource Identification

Persistent Identifiers are essential conditions for an effective management of digital resources and reliable access to electronic documents. Currently several persistent identifiers services have been established, however there is a general demand concerning the active introduction of persistent identifiers, further development of technical components of persistent identifier services and the establishment of an organisational infrastructure regarding persistent identifiers.

Go-Geo

Go-Geo! is an online resource discovery tool which allows for the identification and retrieval of records describing the content, quality, condition and other characteristics of geospatial data that exist with UK tertiary education and beyond. The portal supports geospatial searching by interactive map, grid co-ordinates and place name, as well as the more traditional topic or keyword forms of searching.

JHOVE

(JSTOR/Harvard Object Validation Environment): USA. Every object that is ingested into a digital archive needs to be identified and its file format needs to be verified. A data curator needs to be sure that the digital object entered into an archive is the file format that it purports to be. Functionality:  File format verification tool. Level of Expertise:  High. Technical knowledge of applications, APIs (Application Programme Interfaces) to implement the tool.

PRONOM

The online registry of technical information. PRONOM is a resource for anyone requiring impartial and definitive information about the file formats, software products and other technical components required to support long-term access to electronic records and other digital objects of cultural, historical or business value. Functionality:  File format specification database. Level of Expertise:  Knowledge of metadata schemas in particular technical preservation metadata.

CAMiLEON (Creative Archiving at Michigan and Leeds: Emulating the Old on the New)

The CAMiLEON Project is developing and evaluating a range of technical strategies for the long term preservation of digital materials. User evaluation studies and a preservation cost analysis are providing answers as to when and where these strategies will be used. The project is a joint undertaking between the Universities of Michigan (USA) and Leeds (UK) and is funded by JISC and NSF. CAMiLEON stands for Creative Archiving at Michigan and Leeds: Emulating the Old on the New.
Suitable for:

Software

Biosharing

Functionality:  CATALOGUES The web-based BioSharing catalogues aim to centralize bioscience data policies, reporting standards and links to other related portals. 1. Providing a “one-stop shop” for those seeking data sharing policy documents and information about the standards and technologies that support them. 2. Exposing core information on well-constituted, community-driven standardization efforts and link to their standards, documentation, training material, news and contact point. 3.

ShareGeo Open

ShareGeo Open is a data repository for Open data. There are lots of useful spatial datasets that have been deposited by users for others to download and re-use. This create – share – reuse philosophy is central to ShareGeo Open. All the data in the repository is open and can be re-used freely making it ideal for students, researchers and teaching staff to find data. In addition to the just downloading data, users can upload data to the repository.  URL: www.sharegeo.ac.uk

WebCite

WebCite® is a project initiated by the Centre for Global eHealth Innovation at the University of Toronto intended to digitally archive web material (web pages, PDF documents, and so on) which are cited in scholarly articles. The idea of WebCite is that authors of scholarly papers (as well as editors and publishers of scholarly work) are increasingly citing digital material which is in the public domain on the web, yet which is at risk to disappear, i.e.

OJAX

OJAX provides a highly dynamic AJAX-based user interface to a federated search service for OAI-PMH compatible repository metadata. OJAX is simple, non-threatening but powerful.
Suitable for:

OSSWatch CDs

OSSWatch, the Jisc funded Open Source advisory service, have just released two new CDs to promote awareness of Open Souce software. The first is the Knoppix 4.02. Knoppix is a Linux LiveCD, which means that you put it in the CD-ROM drive of your PC, reboot, and you will have a usable Linux desktop environment that you can explore. The CD also contains running versions of the Moodle and Boddington VLEs. The second disk is TheOpenCD. TheOpenCD is a collection of open source software that runs on the Windows operating system. All of the programs are ready to install.

Media

Permanence Through Change: The Variable media Approach

Jean Gagnon is Executive Director, The Daniel Langlois Foundation for Art, Science, and Technology, Montreal. Since its founding, the Daniel Langlois Foundation for Art, Science, and Technology has considered the preservation of electronic and digital artworks a pressing matter. But it took some years before we received any project demonstrating a truly innovative approach to this issue. When the Solomon R.

IMAPpreserve

Independent Media Arts Preservation, Inc. (IMAP) is a non-profit service, education, and advocacy organization committed to the preservation of non-commercial electronic media. IMAP has grown from a New York-based consortium of arts organizations and individuals to a national resource for preservation training, information, and advocacy. IMAP's core constituents include institutions, organizations, and individuals whose diverse media collections are underserved by existing preservation efforts.

Software development models

CRiB (Conversion and Recommendation of Digital Object Formats)

The CRiB is a Service Oriented Architecture (SOA) designed to assist cultural heritage institutions in the implementation of migration-based preservation interventions. The CRiB system works by assessing the quality of distinct conversion applications or services to produce recommendations of optimal migration strategies. The recommendations produced by the system take into account the specific preservation requirements of each client institution.

The DCC is funded by

Joint Information Systems Committee