Skip to content

Latest commit

 

History

History
48 lines (30 loc) · 4.34 KB

index.md

File metadata and controls

48 lines (30 loc) · 4.34 KB

RecordSearch

This repository contains Jupyter notebooks to work with data from the National Archives of Australia's RecordSearch database.

RecordSearch is the online collection database of the National Archives of Australia. Based on the series system, RecordSearch provides rich, contextual information about series, items, agencies, and functions.

Unfortunately RecordSearch doesn't provide access to machine-readable data through an API, so we have to resort to screen scraping. The notebooks here make use of either the RecordSearch Data Scraper or the older RecordSearch Tools library to handle the scraping. I'm in the process of upgrading all the notebooks to use the newer scraper.

See the RecordSearch section of the GLAM Workbench for more details.

Notebook topics

Harvesting data

Analysing data

Useful tools

Data downloads

  • Summary data about all series in RecordSearch (15mb CSV) – contains basic descriptive information about all the series currently registered on RecordSearch (May 2021) as well as the total number of items described, digitised, and in each access category.
  • Recently digitised files (CSV) – containing details of files digitised between 25 February and 26 March 2021, for an ongoing record of digitised files see this repository which creates weekly snapsots.

Cite as

See the GLAM Workbench or Zenodo for up-to-date citation details.


This repository is part of the GLAM Workbench.
If you think this project is worthwhile, you might like to sponsor me on GitHub.