Skip to content

A project that download the dc_boundary from pervious years and convert it to machine readable format

Notifications You must be signed in to change notification settings

nandiheath/dc-boundary-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DC-Boundary-Parser

Run

This will download the pdf files from 2007 to 2019, and save them in raw/pdf The output will be at output

npm install

node run.js

Known Issues

  • year 2003 cannot be parsed. no return from the library
    • reason is pdfs in 2003 are encoded in big5, and the pdf library cannot decode it
  • some population/deviation cannot be parsed
  • some data cannot be parsed correctly
    • 2015R shatin

About

A project that download the dc_boundary from pervious years and convert it to machine readable format

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages