← Back to manuscripts.kb.nl

URL spreadsheet: manuscripts-urls-wbm-archived.xlsx

The master spreadsheet manuscripts-urls-wbm-archived.xlsx contains all URLs from manuscripts.kb.nl (Medieval Illuminated Manuscripts / Middeleeuwse Verluchte Handschriften) that were archived to the Wayback Machine in December 2025, before the site’s shutdown on 15 December 2025.

A second file, manuscripts-urls-spider-output.xlsx, contains the full spider crawl output (12,550 URLs) but has no archiving data. It is kept as a reference for the complete URL inventory.

Sheets overview

Sheet Rows Purpose
ALL_URLS 7,460 Union of all sheets below — one row per unique archived URL
show_manuscript 2,371 Manuscript detail pages
show_images_text 2,322 Image gallery pages with text descriptions
show_text 1,520 Text-only manuscript views
search_extended 806 Extended search result pages
search_literature 397 Literature search result pages
indexes 9 Browse index pages (shelfmark, author/title, place, language, etc.)
static_pages 8 Static pages (homepage, introduction, background, advanced search)
wiki_priority 61 All URLs linked from Dutch Wikipedia or Wikimedia Commons

Column schema

All sheets share the same column structure:

Column Description Example
URL Original manuscripts.kb.nl URL https://manuscripts.kb.nl/show/manuscript/10+A+11
Source Sheet/category this URL belongs to show_manuscript
Wiki_Source Wikipedia or Commons origin, if linked from Wikimedia Wikipedia NL, Commons
Wiki_Page_Title Title of the Wikipedia/Commons page that links to this URL Meester van het gebedenboek van Dresden
Wiki_Page_ID Page ID on the source wiki 2810762
WBM_Status Archiving result status OK
WBM_Job_ID Save Page Now (SPN2) job identifier spn2-abf176e5dee7...
WBM_URL_submission WBM URL returned by the SPN2 API at submission time https://web.archive.org/web/20251211014959/https://manuscripts.kb.nl/indexes/authortitle
WBM_Timestamp_submission Timestamp of the SPN2 submission 2025-12-11T01:49:59
WBM_URL_capture Actual capture URL found via CDX API lookup https://web.archive.org/web/20251211005650/https://manuscripts.kb.nl/indexes/authortitle
WBM_Timestamp_capture Timestamp of that CDX capture 2025-12-11T00:56:50
Error Error message, if the submission failed (blank when OK)

Sheet: ALL_URLS

Union of all per-type sheets, plus 27 URLs that appear only in wiki_priority (mostly /zoom/ and /search/ pages linked from Wikimedia but not covered by the main archiving run). This is the single sheet to query if you want to look up any manuscripts.kb.nl URL.


Sheet: show_manuscript

2,371 manuscript detail pages. Each URL follows the pattern /show/manuscript/{shelfmark} or /show/manuscript/{shelfmark}/page/{n}.

Example: https://manuscripts.kb.nl/show/manuscript/10+A+11/page/1


Sheet: show_images_text

2,322 image gallery pages with accompanying text. URL pattern: /show/images_text/{shelfmark} or /show/images_text/{shelfmark}/page/{n}.

Example: https://manuscripts.kb.nl/show/images_text/10+A+11


Sheet: show_text

1,520 text-only views of manuscripts. URL pattern: /show/text/{shelfmark} or /show/text/{shelfmark}/page/{n}.

Example: https://manuscripts.kb.nl/show/text/10+A+11


Sheet: search_extended

806 extended search result pages. URL pattern: /search/manuscript/extended/page/{n}/shelfmark/{shelfmark}.

Example: https://manuscripts.kb.nl/search/manuscript/extended/page/1/shelfmark/10+A+11


Sheet: search_literature

397 literature search pages, one per shelfmark. URL pattern: /search/literature/{shelfmark}.

Example: https://manuscripts.kb.nl/search/literature/10+A+11


Sheet: indexes

9 browse index pages covering all index types on the site.

URL path Index type
/indexes/authortitle Author / title
/indexes/haspart Has part
/indexes/iconclass Iconclass subject
/indexes/imagetype Image type
/indexes/language Language
/indexes/miniaturist Miniaturist
/indexes/place Place of origin
/indexes/shelfmark Shelfmark
/indexes/titleImage Title / image

Sheet: static_pages

8 static pages: the homepage, introduction, background, advanced search, and related entry points.


Sheet: wiki_priority

All 61 URLs that are linked from Dutch Wikipedia (nl.wikipedia.org) or Wikimedia Commons. These were archived as a priority before the main bulk run. 34 of these URLs also appear in other sheets (with wiki metadata added there too); 27 are unique to this sheet (mostly /zoom/ image viewer URLs and some /search/ pages with specific query parameters).

The Wiki_Source, Wiki_Page_Title, and Wiki_Page_ID columns are populated for all 61 rows in this sheet, and for the 34 overlapping rows in their respective type sheets.


Notes