Python + Scrapy bot which helps to load the list of IANA timezones names from corresponding Wikipedia page and store it in any handy and machine-readable format supported by Scrapy.
It grabs the following fields:
- Standardized name
- Country code
- Latitude / longitude
- Timezone status (Canonical / Alias / Deprecated)
- UTC offset
- UTC offset for DST
-
Initialize virtual environment
python3 -m venv venv
-
Activate virtual environment
source venv/bin/activate
-
Install required packages
pip install -r requirements.txt
scrapy crawl wiki [-o file.<json|csv|xml|...>] [-a status=<all|canonical|alias|deprecated>]