No description or website provided.
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
data
.gitignore
README.md
credentials_blank.py
functions.py
harvest_functions.py
series.py

README.md

NAA Series

Code for harvesting and analysing series-level data in the National Archives of Australia's RecordSearch dabatase.

See my research notebook for more information.

Series

Code for harvesting series level descriptions is in series.py.

See Harvesting all NAA series summaries in my research notebook.

Functions

Functions are used to model the activities of government. In the CRS system, functions are performed by agencies, and agencies create series. So by following the links between functions, agencies, and series, it should be possible to see how this model of government is reflected in the records described and digitised in RecordSearch.

For some context on the history and use of functions in the National Archives of Australia see 'Natural language searching and government thesauri' by Marian Hoy.

Although you can browse and search for agencies by function in RecordSearch, it's not clear what functions thesaurus is actually in use and how this affects search results. So before following the trail from functions to series, I first need to pull together some data about the functions themselves. The thesauruses created and used by the National Archives seem to have gone through four versions:

I've harvested the main terms from each of these versions:

The code used to harvest the functions is in functions.py.

AGIFT is made available by the National Archives of Australia under a CC-BY-NC-ND licence.