CKAN

CKAN (Comprehensive Knowledge Archive Network) is a data portal platform, that is, software for building a catalogue and repository for datasets. The system can store datasets, or simply hold metadata for datasets hosted externally. It provides discovery services such as keyword and map-based searching, and can harvest metadata from and syndicate its own metadata to other repositories.
Several providers offer hosted solutions, or the software may be downloaded and installed locally. For those taking the latter option, the CKAN team offers various support packages.
Provider
Open Knowledge Foundation.
Licensing and cost
CKAN is free and open source software, released under the GNU Affero GPL (General Public Licence) version 3.0.
The Open Knowledge Foundation offers hosting options on a monthly subscription basis, plus deployment and support contracts for organisations hosting their own installations. The costs are quoted on application, but several levels are offered depending on the scale of the installation.
Development Activity
CKAN version 2.2 was released in February 2014.
Development is ongoing at the time of writing.
Platform and interoperability
CKAN is most easily installed on a server running Ubuntu 12.04 LTS, for which pre-compiled packages are available. It may however be installed from source on many other platforms, since the technologies it uses – Python, PostgreSQL, Apache Solr, Jetty, Java – are cross-platform.
Filters have been written for harvesting metadata from other CKAN instances, CSW servers, WAFs (Web Accessible Folders), ArcGIS portals, Geoportal Servers, Z39.50 databases, DCAT (Data Catalog Vocabulary) RDF/XML sources, and various specific repositories. These can be used as templates for defining further filters. A CKAN instance can syndicate its metadata through CSW, RSS/Atom feeds, RDF (XML, N3), and CSV and JSON dumps.
An API is provided based on REST (Respresentational State Transfer) principles and JSON.
Functional notes
Data uploaded to CKAN can be public or private to a particular organisation. Uploaded data may be visualised using the built-in preview tools, and custom previewers may be added. Themes are available for changing the look and feel of the installation with relative ease.
Extensions are available for extending the functionality of CTAN in various ways, such as allowing users to comment on datasets, providing activity alerts on a per-dataset basis, and performing formal quality assurance checks.
Documentation and user support
The CKAN documentation website offers separate guides for users, administrators, installation maintainers, and developers. There are also a developer mailing list and a discussion mailing list, both of which show significant activity at the time of writing.
The Open Knowledge Foundation offers paid support in the form of consultancy, training, custom development, technical support (especially for upgrades) and priority/local bug fixes.
Usability
Day-to-day use of the system is through a straighforward web interface, while command-line tools are provided for technical maintenance tasks.
Expertise required
Little expertise is needed to use the system as an end user. Administering an installation is also relatively simple.
Installing, modifying and upgrading systems, however, requires some confidence in the technologies involved. Depending on the task involved that might involve familiarity with HTML, CSS, Python, or use of the command line.
Standards compliance
CKAN uses its own metadata scheme internally, but it may be extended with arbitrary metadata elements in key/value pairs. Support is included for geospatial data standards such as ISO 19139, UK GEMINI 2.1 and FGDC.
Influence and take-up
The CKAN website lists over 70 instances in use across the world, including national data portals for Australia, Austria, Canada, Italy, Norway, the Netherlands, Romania, Slovakia, the UK, and the USA.
- Home
- Digital curation
- About us
- News
- Events
- Resources
- Briefing Papers
- Introduction to Curation
- Annotation
- Appraisal and Selection
- Curating Emails
- Curating e-Science Data
- Curating Geospatial Data
- Data Accreditation
- Data Citation and Linking
- Data Protection
- Database Archiving
- Digital Repositories
- Freedom of Information
- Genre Classification
- Interoperability
- Persistent Identifiers
- Trust Through Self Assessment
- Using OAIS for Curation
- Web 2.0
- What is Digital Curation?
- Common Directions in Research Data Policy
- 5 Steps to Research Data Readiness
- Citizen Science
- Making the Case for RDM
- Legal Watch Papers
- Standards Watch Papers
- Technology Watch Papers
- Introduction to Curation
- How-to Guides & Checklists
- Appraise & Select Research Data for Curation
- Cite Datasets and Link to Publications
- Develop RDM Services
- Develop a DMP
- Discover Requirements
- Five Steps to Decide What Data to Keep
- Five Things You Need to Know About RDM and the Law
- License Research Data
- Track Data Impact with Metrics
- Using RISE
- Where to keep research data
- Write a Lay Summary
- Developing RDM Services
- Reviewing research data platform capabilities at CISER
- Using EPrints to Build a Repository for UEL
- Assigning DOIs at Bristol
- DMPs in the Arts and Humanities
- Improving RDM at Monash
- Improving Research Visibility
- Increasing Participation in Training
- RDM Training for Librarians
- RDM strategy: moving from plans to action
- Storing and Sharing Data in Hull
- Curation Lifecycle Model
- Curation Reference Manual
- Peer review
- Editorial Board
- Completed chapters
- Appraisal and Selection
- Archival Metadata
- Archiving Web Resources
- Automated Metadata Generation
- Curating Emails
- File Formats
- Investment in an Intangible Asset
- Learning Object Metadata
- Metadata
- Ontologies
- Open Source for Digital Curation
- Preservation Metadata
- Preservation Scenarios for Projects Producing Digital Resources
- Preservation Strategies
- Principles for Enabling Access to Engineering Design Information Through Life
- Scientific Metadata
- The Role of Microfilm in Digital Preservation
- Chapters in production
- Policy and legal
- Data Management Plans
- Tools
- Case studies
- Repository audit and assessment
- Standards
- Publications and presentations
- Roles
- Curation journals
- Informatics research
- External resources
- Online Store
- Briefing Papers
- Training
- Projects
- Community
- Tailored support