Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX
This project is no longer actively maintained, see describingWebArchives for the current project
There are three scripts here:
-
A sample example script for making requests from the Archive-It CDX
-
by default this requests http://www.albany.edu/history/course-descriptions.shtml from the www.albany.edu Archive-it collection 3308
To look for a different URL just change Line 3 that begins with "requestURL = ":
import requests
requestURL = "http://wayback.archive-it.org/3308/timemap/cdx?url=http://www.albany.edu/history/course-descriptions.shtml"
Set requestURL
as http://wayback.archive-it.org/[Collection#]/timemap/cdx?url=[URL]
with your own URL and collection number.
- A basic command line script for getting the number of captures and a date range from Archive-It URLS
Run in the command line as: python CDX.py
- You will be prompted for a URL and an Archive-It collection number
- An example of the script we are using to make static pages while updating Web Archives records from the Archive-It and Wayback CDX API
- collectionList.xslx is also included as a sample of the spreadsheet we are used to provide the data for this script