FITS
FITS allows data curators to identify, validate, and extract technical metadata for the objects in their digital repository. It does this by encorporating a range of mostly third-party open source tools, normalising and consolidating their output.
Provider
Harvard University Library Office for Information Systems
Licensing and cost
GNU Lesser GPL – free.
Development activity
FITS 0.8.3 was released in September 2014.
The tool was created to be used in Harvard’s Digital Repository Service, so is likely to be well maintained. Development activity can be tracked through the tool's GitHub source code repository.
Platform and interoperability
FITS is written in Java and is compatible with Java 1.6 or higher. It uses seven external tools – DROID, Exiftool, FFIdent, the File utility, JHOVE, the National Library of New Zealand Metadata Extractor, and Apache Tika – plus four tools developed by Harvard Library.
Instructions for installation and command line use are given for Windows, Mac OS X and '*nix' (i.e. Linux, BSD, Solaris, etc.).
As a command-line tool, it is relatively simple to incorporate FITS into automated workflows and larger applications. It also has a documented Java API so it can be incorporated into other applications directly.
Functional notes
FITS acts as a wrapper, invoking and managing the output from several other open source tools. Output from these tools are converted into a common format, compared to one another and consolidated into a single XML output file. Technical metadata is only output (and a part of the consolidation process) for tools that were able to identify the file. All other output is discarded.
Documentation and user support
Documentation exists in the form of a Quick Start guide, a User Manual and a Developer Manual. The Quick Start guide walks the user through using the tool, assuming no familiarity with the command line/terminal, while the manuals are progressively more technical.
The project actively uses the fits-users Google Group, which is active as of November 2014. Issues may be reported to the tool's issue tracker on GitHub.
Usability
FITS uses a command line interface; it is designed to be integrated into other software workflows, and so is aimed at those with application design experience.
Expertise required
Installation and use are relatively straightforward, though some may find the command-line interface intimidating. Interpreting the output requires familiarity with file format and metadata standards, and possibly additional tools.
Standards compliance
FITS can output in various XML formats, including AES Audio Object, DocumentMD, MIX, and TextMD.
Influence and take-up
The old FITS website (now deprecated) shows there were over 3800 downloads of version 0.6.2. The tool was designed for and is in use at the Harvard Digital Repository Service; it is also used in the Archivematica digital preservation system.
- Home
- Digital curation
- About us
- News
- Events
- Resources
- Briefing Papers
- Introduction to Curation
- Annotation
- Appraisal and Selection
- Curating Emails
- Curating e-Science Data
- Curating Geospatial Data
- Data Accreditation
- Data Citation and Linking
- Data Protection
- Database Archiving
- Digital Repositories
- Freedom of Information
- Genre Classification
- Interoperability
- Persistent Identifiers
- Trust Through Self Assessment
- Using OAIS for Curation
- Web 2.0
- What is Digital Curation?
- Common Directions in Research Data Policy
- 5 Steps to Research Data Readiness
- Citizen Science
- Making the Case for RDM
- Legal Watch Papers
- Standards Watch Papers
- Technology Watch Papers
- Introduction to Curation
- How-to Guides & Checklists
- Appraise & Select Research Data for Curation
- Cite Datasets and Link to Publications
- Develop RDM Services
- Develop a DMP
- Discover Requirements
- Five Steps to Decide What Data to Keep
- Five Things You Need to Know About RDM and the Law
- License Research Data
- Track Data Impact with Metrics
- Using RISE
- Where to keep research data
- Write a Lay Summary
- Developing RDM Services
- Reviewing research data platform capabilities at CISER
- Using EPrints to Build a Repository for UEL
- Assigning DOIs at Bristol
- DMPs in the Arts and Humanities
- Improving RDM at Monash
- Improving Research Visibility
- Increasing Participation in Training
- RDM Training for Librarians
- RDM strategy: moving from plans to action
- Storing and Sharing Data in Hull
- Curation Lifecycle Model
- Curation Reference Manual
- Peer review
- Editorial Board
- Completed chapters
- Appraisal and Selection
- Archival Metadata
- Archiving Web Resources
- Automated Metadata Generation
- Curating Emails
- File Formats
- Investment in an Intangible Asset
- Learning Object Metadata
- Metadata
- Ontologies
- Open Source for Digital Curation
- Preservation Metadata
- Preservation Scenarios for Projects Producing Digital Resources
- Preservation Strategies
- Principles for Enabling Access to Engineering Design Information Through Life
- Scientific Metadata
- The Role of Microfilm in Digital Preservation
- Chapters in production
- Policy and legal
- Data Management Plans
- Tools
- Case studies
- Repository audit and assessment
- Standards
- Publications and presentations
- Roles
- Curation journals
- Informatics research
- External resources
- Online Store
- Briefing Papers
- Training
- Projects
- Community
- Tailored support
