GitHub - OpenCouncilData/Aggregator: Aggregates open council datasets from various sources and uploads to Cloudant and Mapbox

Overview

This tool finds open datasets published by Australian councils in a number of pre-defined "topics", downloads them, combines them and uploads them to a CloudAnt database. It also generates Mapbox vector tiles so that geospatial data can be previewed through the Aggregator Front End.

All web requests are by default cached into the cache/ directory and indexed in cache.json.

Usage

node findDatasets.js --topics dog-walking-zones --cloudant

This does the following:

Search known data portals for datasets that match the criteria for the topic "dog-walking-zones"
Download geospatial files for each dataset.
Reproject each file to EPSG:4326
Check that the resulting geometry is sensible (points are really points, all locations are roughly within Australia, etc).
Add attributes to each feature, such as its source URL.
Write the combined GeoJSON file (eg, a single file with a feature for each garbage collection zone)
Upsert each feature to CloudAnt.

./make-mbtiles.sh dog-walking-zones

This does:

Take the generated combined GeoJSON file and use TippeCanoe to generate an MBTiles file.
Uploads it to Mapbox.

Topics

Topics are defined in topics.js like this:

// The key defines how the data will be accessed through the CloudAnt API, and is also used by the Aggregator front end.
'garbage-collection-zones': {
        // How relevant datasets will be found, by supplying this search term to CKAN/Socrata. A simple string is fine, or for more
        // complex needs, using CKAN's undocumented query language:
        searchTerm: '+title:"garbage collection" OR +title:"waste collection"',

        // If the title of a found dataset matches this regex, it will be rejected. Here, we want garbage collection zones, not truck routes, bin locations etc.
        titleBlacklist: /bins|stats|trucks|routes/i,

        // If the title of a found dataset *doesn't* match this regex, it will be rejected.
        titleWhitelist: /waste|garbage|recycling|rubbish/i
    },

Components

Fetcher: fetch likely datasets and upload them.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
cache		cache
design-docs		design-docs
out-geojsons		out-geojsons
.gitignore		.gitignore
README.md		README.md
checkGeojson.js		checkGeojson.js
config.json.example.json		config.json.example.json
doMapboxUpload.js		doMapboxUpload.js
findDatasets.js		findDatasets.js
full-process.sh		full-process.sh
jsonCache.js		jsonCache.js
log.js		log.js
looplsof.sh		looplsof.sh
make-mbtiles.sh		make-mbtiles.sh
package.json		package.json
topics.js		topics.js
upload-cloudant.js		upload-cloudant.js
upload-cloudantp.js		upload-cloudantp.js
upload-mapbox.js		upload-mapbox.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Usage

Topics

Components

About

Releases

Packages

Languages

OpenCouncilData/Aggregator

Folders and files

Latest commit

History

Repository files navigation

Overview

Usage

Topics

Components

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages