IDCC11 Session 2B: Preservation

9 December, 2011
Marieke Guy

Parallel session 2B on preservation

The preservation strand presentations seemed to be only loosely connected to preservation, but all the institutions involved have taken open approaches to archiving:

CKAN: Enabling Collaboration around the Open Data Ecosytem, Irina Bolychevsky, The Open Knowledge Foundation

Irina Bolychevsky from the Open Knowledge Foundation, a not-for-profit organisation promoting open data, started the session by demoing some of the OKF’s initiatives.

Openspending.org aims to track as many government and corporate financial transactions as possible. It offers visualisation of UK government spending data. The main premise of the OKF is that data is valuable and openness is key. They have worked with many clients to help create open portals using CKAN open-source data portal software.

Data.gov.uk is one of their flagship clients. They want to help users solve the difficulty of finding datasets on official sites.

Another CKF offering is the community site datahub.org, which serves as a data store.

Areas OKF are looking into include collaboration, citation (bringing more value to the data itself) and curation.

Peer-reviewed open research data: pilot results, Marjan Grootveld, Data Archiving and Networked Services (DANS)

The Data Archiving and Networked Services (DANS) promotes sustained access to digital research data and holds "Open if possible, protected if necessary" as one element of its mission.

Majan described a recent pilot study carried at EASY, the electronic archive for (open) research data of DANS. A workflow was established allowing researchers who have downloaded data sets from the archive to review them.

Terms like quality were deliberately not defined and results showed basic ratings for various data sets which could provide an estimate of feasibility of ratings in the repository.

The next steps for the project include how to embed the structural data review process and present qualitative feedback and tags.

Managing risks in the preservation of research data with preservation networks, Esther Conway, Science and Technology Facilities Council

Esther Conway from the Science and Technology Facilities Council (STFC) explained how networked modelling strategies could be applied to preservation solutions and tailored to the needs of the community and responsive to their environment.

At STFC they have created a Preservation Network Model that is used to plan preservation activities according to different situations.

Esther explained that when you are archiving an ARP you need to record certain information such as location, and that the OAIS model can then be used. The ENSURE project looked at the cost and value of solutions that form the preservation lifecycle.

Esther asked “What do you value about your data?” as this will help you decide what solution you use.

The ENSURE project had raised questions about the preservation approaches chosen: risks, benefits, performance, costs, reuse, quality and trust.