How to run a data service
Members of the UK Data Archive team ran an very useful event over two days earlier this month to share their experiences of how to run a data service. They explained the processes they've developed and shared practical tips based on lessons learned. My event fee was generously funded through the DPC leadership programme - thanks!
Matthew Woollard explained the UKDA is a family of services including ESDS, the History Data Service, Survey Question Bank and Secure Data Service. They have an annual turnover of around £3 million and employ sixty-two staff, demonstrating how much can be achieved on comparatively little. Running services together clearly brings them great economies of scale and seems a useful model for HEIs to reflect on as they start to address demands for data management infrastructure - collaboration will be key.
Matthew suggested there are six main functions data centres need to cover:
- Pre-ingest / acquisition
- Ingest
- Data management / archival storage
- Preservation
- Access
- Administration
Staff working in each of these areas gave presentations and ran surgeries. Some key lessons are shared below and full event notes can be downloaded:
- Building trust is key. Sue Cadogan explained some researchers find it emotional to hand over their data so staff will invite them into the archive so they can see their data will be properly looked after.
- In the past, data tended to be refused for poor documentation but now it’s more likely IPR or ethics will prevent deposit. If licence and consent agreements don’t cover preservation and re-use, the archive can’t accept the data. They’ve done lots of outreach and training on this issue and have reduced refusals of qualitative data on grounds of insufficient consent from around 40% to 25%.
- Depositors often select the closed options on license forms - even when data can be shared. So now, there are only two options on the licence form as standard: for educational and commercial access; or for educational use only.
- UKDA produces various resources to supporting teaching and learning. There are thirty-five tailor-made sample survey datasets (subsets of large surveys that are easier for students to handle) and fifteen teaching datasets in the ESDS Nesstar Catalogue. They also have SPSS workbooks that focus on the skills students need in order to work with a real-world survey dataset.
- The Secure Data Service feels people are the weak link in security and that researcher ignorance is a far greater risk than technical threats. They provide compulsory training to try and combat this and have been careful to replicate researchers’ preferred working environment, as breaches occur most when systems are inconvenient.
The Incremental project ran a training course shortly after this event, pulling UKDA’s processes out in a case study to explain the principles researchers could adopt to try to keep their data accessible in the long-term. See slides 14-16 from the ‘How to Manage Your Data’ presentation.
- Home
- Digital curation
- About us
- News
- Events
- Resources
- Briefing Papers
- Introduction to Curation
- Annotation
- Appraisal and Selection
- Curating Emails
- Curating e-Science Data
- Curating Geospatial Data
- Data Accreditation
- Data Citation and Linking
- Data Protection
- Database Archiving
- Digital Repositories
- Freedom of Information
- Genre Classification
- Interoperability
- Persistent Identifiers
- Trust Through Self Audit
- Using OAIS for Curation
- Web 2.0
- What is Digital Curation?
- Making the Case for RDM
- Research Data Readiness
- Legal Watch Papers
- Standards Watch Papers
- Technology Watch Papers
- Introduction to Curation
- How-to Guides
- Curation Reference Manual
- Peer review
- Editorial Board
- Completed chapters
- Appraisal and Selection
- Archival Metadata
- Archiving Web Resources
- Curating Emails
- File Formats
- Investment in an Intangible Asset
- Learning Object Metadata
- Metadata
- Ontologies
- Open Source for Digital Curation
- Preservation Metadata
- Preservation Strategies
- Principles for Enabling Access to Engineering Design Information Through Life
- Chapters in production
- Curation Lifecycle Model
- Policy and legal
- Data Management Plans
- Tools
- Case studies
- Repository audit and assessment
- Standards
- Publications and presentations
- Roles
- Curation journals
- Informatics research
- External resources
- Briefing Papers
- Training
- Projects
- Community
