A collection of python scripts to scrape information about craft beers from various websites.
- Kompendium Piwa: https://kompendiumpiwa.pl/style-piwa/
This setup has been tested with Python 3.10.
beautifulsoup4
- parsing html contentrequests
,aiohttp
+asyncio
- sending (asynchronous) http requestsmkdocs-material
- documentation
Install dependencies. Make sure you have python
and pipenv
installed globally.
pipenv install
Run scraping scripts
python ./kompendium_piwa/main.py
- More supported websites
- Object oriented code instead of set of functions, code split into multiple files
- Documentation
- Unit tests
- CLI, main script in root directory
- CI/CD (unit tests + mkdocs deployment to GitHub Pages)
- Playwright support if some websites disallow web crawlers