URL spreadsheet: manuscripts-urls-wbm-archived.xlsx
The master spreadsheet manuscripts-urls-wbm-archived.xlsx contains all URLs from manuscripts.kb.nl (Medieval Illuminated Manuscripts / Middeleeuwse Verluchte Handschriften) that were archived to the Wayback Machine in December 2025, before the site’s shutdown on 15 December 2025.
A second file, manuscripts-urls-spider-output.xlsx, contains the full spider crawl output (12,550 URLs) but has no archiving data. It is kept as a reference for the complete URL inventory.
Sheets overview
| Sheet | Rows | Purpose |
|---|---|---|
ALL_URLS |
7,460 | Union of all sheets below — one row per unique archived URL |
show_manuscript |
2,371 | Manuscript detail pages |
show_images_text |
2,322 | Image gallery pages with text descriptions |
show_text |
1,520 | Text-only manuscript views |
search_extended |
806 | Extended search result pages |
search_literature |
397 | Literature search result pages |
indexes |
9 | Browse index pages (shelfmark, author/title, place, language, etc.) |
static_pages |
8 | Static pages (homepage, introduction, background, advanced search) |
wiki_priority |
61 | All URLs linked from Dutch Wikipedia or Wikimedia Commons |
Column schema
All sheets share the same column structure:
| Column | Description | Example |
|---|---|---|
URL |
Original manuscripts.kb.nl URL | https://manuscripts.kb.nl/show/manuscript/10+A+11 |
Source |
Sheet/category this URL belongs to | show_manuscript |
Wiki_Source |
Wikipedia or Commons origin, if linked from Wikimedia | Wikipedia NL, Commons |
Wiki_Page_Title |
Title of the Wikipedia/Commons page that links to this URL | Meester van het gebedenboek van Dresden |
Wiki_Page_ID |
Page ID on the source wiki | 2810762 |
WBM_Status |
Archiving result status | OK |
WBM_Job_ID |
Save Page Now (SPN2) job identifier | spn2-abf176e5dee7... |
WBM_URL_submission |
WBM URL returned by the SPN2 API at submission time | https://web.archive.org/web/20251211014959/https://manuscripts.kb.nl/indexes/authortitle |
WBM_Timestamp_submission |
Timestamp of the SPN2 submission | 2025-12-11T01:49:59 |
WBM_URL_capture |
Actual capture URL found via CDX API lookup | https://web.archive.org/web/20251211005650/https://manuscripts.kb.nl/indexes/authortitle |
WBM_Timestamp_capture |
Timestamp of that CDX capture | 2025-12-11T00:56:50 |
Error |
Error message, if the submission failed | (blank when OK) |
Sheet: ALL_URLS
Union of all per-type sheets, plus 27 URLs that appear only in wiki_priority (mostly /zoom/ and /search/ pages linked from Wikimedia but not covered by the main archiving run). This is the single sheet to query if you want to look up any manuscripts.kb.nl URL.
Sheet: show_manuscript
2,371 manuscript detail pages. Each URL follows the pattern /show/manuscript/{shelfmark} or /show/manuscript/{shelfmark}/page/{n}.
Example: https://manuscripts.kb.nl/show/manuscript/10+A+11/page/1
Sheet: show_images_text
2,322 image gallery pages with accompanying text. URL pattern: /show/images_text/{shelfmark} or /show/images_text/{shelfmark}/page/{n}.
Example: https://manuscripts.kb.nl/show/images_text/10+A+11
Sheet: show_text
1,520 text-only views of manuscripts. URL pattern: /show/text/{shelfmark} or /show/text/{shelfmark}/page/{n}.
Example: https://manuscripts.kb.nl/show/text/10+A+11
Sheet: search_extended
806 extended search result pages. URL pattern: /search/manuscript/extended/page/{n}/shelfmark/{shelfmark}.
Example: https://manuscripts.kb.nl/search/manuscript/extended/page/1/shelfmark/10+A+11
Sheet: search_literature
397 literature search pages, one per shelfmark. URL pattern: /search/literature/{shelfmark}.
Example: https://manuscripts.kb.nl/search/literature/10+A+11
Sheet: indexes
9 browse index pages covering all index types on the site.
| URL path | Index type |
|---|---|
/indexes/authortitle |
Author / title |
/indexes/haspart |
Has part |
/indexes/iconclass |
Iconclass subject |
/indexes/imagetype |
Image type |
/indexes/language |
Language |
/indexes/miniaturist |
Miniaturist |
/indexes/place |
Place of origin |
/indexes/shelfmark |
Shelfmark |
/indexes/titleImage |
Title / image |
Sheet: static_pages
8 static pages: the homepage, introduction, background, advanced search, and related entry points.
Sheet: wiki_priority
All 61 URLs that are linked from Dutch Wikipedia (nl.wikipedia.org) or Wikimedia Commons. These were archived as a priority before the main bulk run. 34 of these URLs also appear in other sheets (with wiki metadata added there too); 27 are unique to this sheet (mostly /zoom/ image viewer URLs and some /search/ pages with specific query parameters).
The Wiki_Source, Wiki_Page_Title, and Wiki_Page_ID columns are populated for all 61 rows in this sheet, and for the 34 overlapping rows in their respective type sheets.
Notes
WBM_URL_capturevs.WBM_URL_submission: the submission URL is what the Save Page Now (SPN2) API returned at the moment of archiving; the capture URL is the nearest snapshot the CDX API reports for that URL and timestamp. They usually differ slightly in timestamp because the CDX index records the actual crawl time, not the API response time.Wiki_Source/Wiki_Page_Title/Wiki_Page_IDare only populated for the ~57 URLs that are linked from Wikimedia projects. All other rows have these columns blank.- Archiving period: all submissions were made between 10-14 December 2025.
- Success rate: 100% of the 7,460 URLs have
WBM_Status = OK.