Save KB websites to Wayback Machine

Data, stories and scripts for archiving KB-managed websites to the Internet Archive’s Wayback Machine.

Purpose

Some websites managed by the KB, national library of the Netherlands, have been discontinued over the past years. To preserve their content, eg. for Wikipedia sourcing and other cultural heritage purposes, the KB actively archives these websites into the Wayback Machine at web.archive.org.


Browse archived sites

See the overview of archived sites. This page also gives access to the datasets of pages (URLs) archived in the Wayback Machine, available as Excel, TXT or CSV files.

Screenshot of the overview page of all archived sites


Stories

Read the stories behind some of these archiving projects — narratives of how (parts of) KB websites were rescued from the digital memory hole, and the role AI assistants played along the way.


Scripts

The Python scripts used to archive websites into the Wayback Machine are documented in the wbm-archiver-scripts section. These include: