Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
CDX.py
LICENSE
README.md
basicSample.py
collectionList.xlsx
staticPages.py

README.md

staticPages-webArchives

Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX

This project is no longer actively maintained, see describingWebArchives for the current project

There are three scripts here:

basicSample.py

To look for a different URL just change Line 3 that begins with "requestURL = ":

import requests

requestURL = "http://wayback.archive-it.org/3308/timemap/cdx?url=http://www.albany.edu/history/course-descriptions.shtml"

Set requestURL as http://wayback.archive-it.org/[Collection#]/timemap/cdx?url=[URL] with your own URL and collection number.

CDX.py

  • A basic command line script for getting the number of captures and a date range from Archive-It URLS

Run in the command line as: python CDX.py

  • You will be prompted for a URL and an Archive-It collection number

staticPages.py

  • An example of the script we are using to make static pages while updating Web Archives records from the Archive-It and Wayback CDX API
  • collectionList.xslx is also included as a sample of the spreadsheet we are used to provide the data for this script

Wayback CDX API Documentation