asgs2pgsql provides the Australian Bureau of Statistics (ABS) Australian Statistical Geography Standard (ASGS) in a PostgreSQL database using the PostGIS extension to store the geospatial data.
Scripts to generate GeoJSON and MBTiles outputs from the PostGIS data are included too.
The ASGS is published as a combination of non-spatial CSV files and spatial SHAPE files. Both of these files are used in this loader and combined together to form this (unofficial) ASGS PostgreSQL schema and flat GeoJSON/MBTiles.
You may be interested in the corresponding abs2pgsql loader scripts which will load the ABS 2011 Census of Population and Housing into PostgreSQL, making use of this ASGS schema to provide the geographic standard to those statistics.
About the ASGS
The ASGS consists of around 22 individual structures which are classified into ABS Structures and Non-ABS Structures as shown in this diagram.
The ASGS is broken down into Volumes 1-5.
- Volume 1 - 1270.0.55.001 consists of the Main Structure and Greater Capital City Statistical Areas (documentation)
- Volume 2 - 1270.0.55.002 consists of the Indigenous Structure (documentation)
- Volume 3 - 1270.0.55.003 consists of the Non-ABS Structures (documentation)
- Volume 4 - 1270.0.55.004 consists of the Significant Urban Areas and Section of State/Urban Centres and Localities Structure (SUA, SOS/UCL) (documentation)
- Volume 5 - 1270.0.55.005 consists of the Remoteness Structure (documentation)
- Correspondences - 1270.0.55.006 provides correspondences between the older ASGC and the newer ASGS documentation)
The ABS Structures (vol 1, 2, 4, 5) are usually updated for each Census and the Non-ABS Structures (vol 3) are updated annually as required (so not all volume 3 structures are updated each year)
|4||SUA, UCL, SOS|
*Keep in mind the Volume 3 stuctures are approximations indended for statistical purposes, not as a substitute for administrative boundaries. The asterix (*) indicates avaliable via the PSMA Administrative Boundaries and corresponding processing scripts at https://github.com/andrewharvey/psma-admin-bdys-data.
Primary key codes for the ASGS structures are generally made unique by concatenating the code with the code of the parent structure which it was built from. For example the S/T structure is built up from the SA4 structure. That is S/T's are built up from one or more SA4's.
This means that the unique code for SA4's is only unique within its S/T, so to obtain a globally unique code for that SA4 you need to prepend the S/T code.
So for example SA1's are unique with respect to their 11 digit code. However that 11 digit code is made up of S/T . SA4 . SA4 . SA3 . SA2 . SA1.
The ABS ASGS data is Copyright (c) Commonwealth of Australia and as per http://www.abs.gov.au/websitedbs/D3310114.nsf/Home/©+Copyright?opendocument it is released under the Creative Commons Attribution 2.5 Australia license.
All the files within this asgs2pgsql repository are released under the CC0 license by Andrew Harvey firstname.lastname@example.org. Although not required, I would prefer you give attribution and release derived works or modifications under the same CC0 license.
To the extent possible under law, the person who associated CC0 with this work has waived all copyright and related or neighboring rights to this work. http://creativecommons.org/publicdomain/zero/1.0/
Build from Source
Running these scripts is akin to building software from source. If you just want a copy of the database without needing to build it from source skip to the last section of this README.
make, gdal-bin (>= 1.7.0), libdbd-pg-perl, (postgis >= 2.1), libtext-csv-perl, libtext-csv-xs-perl, unzip, wget
The scripts assume you have a PostgreSQL database up and running. We leave authentication to this database your responsibility through the PostgreSQL environment variables (see #setting-up-the-database-environment).
The simplest authentication setup is to allow local unauthenticated access to your database. You can do this by adding the following example line to
user as you choose).
local database user trust
This allows the
database database to be accessed by the
user user without authentication via local socket connections.
Setting up the database environment
You need to set up and export some PG environment variables otherwise the PostgreSQL defaults will be used. For example,
export PGHOST=localhost # not necessary if localhost export PGDATABASE=database export PGUSER=user
Refer to the PostgreSQL documentation for details on the environment variables which you can set.
If you don't already have a database and database user set up, then on Debian you could:
sudo su - postgres createuser --no-createdb --no-createrole --superuser user createdb --owner=user database exit
Stage 1: Downloading the source ASGS data
The download parameters are hard configured within
01-download-asgs.sh. To avoid downloading more than you need you can comment out or delete files you don't need. the bottom part of
01-download-asgs.sh was created from
This should download and unzip the ASGS Volume 1-6 files for years 2011 and onwards.
Stage 2: Loading the ASGS data into the database schema
This stage assumes you have the 02-ASGS-UNZIP directory from stage 1. With this just run,
When you load geographic data into PostgreSQL using the PostGIS extension you must define the coordinate system of that data. The coordinate system you should use depends on what you are most likely to use the data for.
- default -> unprojected lat longs in the GDA94 datum
- web mapping -> EPSG:3857
- analysis and calculations based on area of the regions -> GDA94 / Australian Albers coordinate system (EPSG:3577)
To switch the coordinate system see the configuration section at the top of the
Stage 3: Materialised Pyramids of Generalised Geometries (Optional)
You can optionally produce materialised pyramid tables of the generalised geometries,
Producing a new db dump
Once everything has been loaded into PostgreSQL using these scripts you can create a PostgreSQL dump file using,
pg_dump --format plain --schema "asgs_2011" --no-owner | xz > asgs_2011.sql.xz pg_dump --format plain --schema "asgs_2015" --no-owner | xz > asgs_2015.sql.xz pg_dump --format plain --schema "asgs_2016" --no-owner | xz > asgs_2016.sql.xz
Loading an existing db dump
I host a copy of these files at http://tianjara.net/data/asgs2pgsql/. After setting your PG* environment variables and creating a database (with the PostGIS extensions loaded) as described above, you can load the database dumps using,
xzcat asgs_2011.sql.xz | psql -f - xzcat asgs_2015.sql.xz | psql -f - xzcat asgs_2016.sql.xz | psql -f -
- You may run into the issue described here.
How to use the resulting database
Some examples of scripts you can run to pull intelligence from the resulting asgs2pgsql schema are included in the examples directory.