Skip to content

MaxValue/KEM-Contacts

Repository files navigation

Web Crawler - KEM Contacts

Get all metadata about the Klima- und Energieregionen from https://www.klimaundenergiemodellregionen.at/modellregionen/liste-der-regionen/

Contents

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development. See deployment for notes on how to deploy the project on a live system.

Prerequisites

What things you need to install the software and how to install them

I recommend you to use the setup_development.sh script by running

./setup_development.sh

but if you don't want to do that, here is the complete list of dependencies:

Deployment

Activate the environment

source venv/bin/activate

Change to the scrapy project

cd kem

Start the crawler

scrapy crawl getcontacts

After the crawler finishes, you'll want to export the data:

./export.py results.db kem getcontacts 1 KEM-Contacts_YYYY-MM-DD

where 1 is the job id and YYYY-MM-DD should be replaced by the date on which you crawled the website.

You will see the job id at the beginning of the log log.txt: Job ID is: XX.

Built With

Contributing

Please open an issue if you want to help or have questions.

Roadmap

Things I already plan to implement, but didn't have yet:

  • Change database scheme to be individual to crawler, make exporter therefore export specific table.

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

Authors

  • Max Fuxjäger - Initial work - MaxValue

License

This project is licensed under the MIT License - see the LICENSE.txt file for details.

Project History

This project was created because I (Max) was asked to crawl this website.

About

Get all metadata about the Klima- und Energieregionen from https://www.klimaundenergiemodellregionen.at/modellregionen/liste-der-regionen/

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors