Skip to content

Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX

License

Notifications You must be signed in to change notification settings

UAlbanyArchives/staticPages-webArchives

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

staticPages-webArchives

Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX

This project is no longer actively maintained, see describingWebArchives for the current project

There are three scripts here:

basicSample.py

To look for a different URL just change Line 3 that begins with "requestURL = ":

import requests

requestURL = "http://wayback.archive-it.org/3308/timemap/cdx?url=http://www.albany.edu/history/course-descriptions.shtml"

Set requestURL as http://wayback.archive-it.org/[Collection#]/timemap/cdx?url=[URL] with your own URL and collection number.

CDX.py

  • A basic command line script for getting the number of captures and a date range from Archive-It URLS

Run in the command line as: python CDX.py

  • You will be prompted for a URL and an Archive-It collection number

staticPages.py

  • An example of the script we are using to make static pages while updating Web Archives records from the Archive-It and Wayback CDX API
  • collectionList.xslx is also included as a sample of the spreadsheet we are used to provide the data for this script

Wayback CDX API Documentation

About

Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages