A collection of crawler project for Indonesia dataset. Data collected will be saved as separate csv files for each item type.
Crawler | Description | Website |
---|---|---|
bpjs | Healthcare facilities in Indonesia | https://faskes.bpjs-kesehatan.go.id/aplicares/ |
ekatalog | Procurement of goods and services for government institution | https://e-katalog.lkpp.go.id/ |
jobsid | Job vacancy | https://www.jobs.id/ |
kpu | 2019 general election result | https://pemilu2019.kpu.go.id |
master_bps | Indonesia administrative list with bridging code between Statistics Indonesia and Ministry of Internal Affairs | https://sig.bps.go.id/bridging-kode/index |
sirs | Hospitals data from Ministry of Health | https://sirs.kemkes.go.id |
pip install -r requirements.txt
cd <crawler_directory>
scrapy crawl <spider_name>