Web Archiving

Gathering collections of web content.
Tool Implementation Cost Platform Installation User interface API
Archive-It
  • Service
  • Subscription
  • Lin
  • Mac
  • Win
  • N/A
  • Web
  • Yes
CDL Web Archiving Service
  • Web Service
  • Subscription
  • N/A
  • Web
  • No
Heritrix
  • Download
  • Free
  • Lin
  • Mac
  • Win
  • Moderate
  • CL
  • Web
  • Yes
Netarchive Suite
  • Download
  • Free
  • Lin
  • Simple
  • GUI
  • Yes
Web Curator Tool
  • Download
  • Free
  • Lin
  • Win
  • Complex
  • GUI
  • Yes
WebCite
  • Service
  • Free
  • Lin
  • Mac
  • Win
  • N/A
  • Web
  • No

A subscription-based web archiving service from the Internet Archive.

The California Digital Library's Web Archiving Service allows libraries and insititutions to build archival collections of websites relevant to particular topics.

Heritrix is an open-source web crawler, allowing users to target websites they wish to include in a collection and to harvest an instance of each site.

The Netarchive Suite is a web archiving software package designed to plan, schedule and run web harvests of parts of the Internet.

The Web Curator Tool (WCT) is a workflow management application for selective web archiving. 

WebCite is an on-demand web archiving service that takes snapshots of Internet-accessible digital objects at the behest of users, storing the data on their own servers and assigning unique identifiers to those instances of the material.