Code for harvesting and analysing series-level data in the National Archives of Australia's RecordSearch dabatase.
See my research notebook for more information.
Code for harvesting series level descriptions is in
See Harvesting all NAA series summaries in my research notebook.
Functions are used to model the activities of government. In the CRS system, functions are performed by agencies, and agencies create series. So by following the links between functions, agencies, and series, it should be possible to see how this model of government is reflected in the records described and digitised in RecordSearch.
For some context on the history and use of functions in the National Archives of Australia see 'Natural language searching and government thesauri' by Marian Hoy.
Although you can browse and search for agencies by function in RecordSearch, it's not clear what functions thesaurus is actually in use and how this affects search results. So before following the trail from functions to series, I first need to pull together some data about the functions themselves. The thesauruses created and used by the National Archives seem to have gone through four versions:
- The CRS Thesaurus
- AGIFT (Australian Governments' Interactive Functions Thesaurus) version 1 (1999)
- AGIFT version 2 (2005)
- AGIFT version 3 (2015)
I've harvested the main terms from each of these versions:
The code used to harvest the functions is in
AGIFT is made available by the National Archives of Australia under a CC-BY-NC-ND licence.