Dockerized Python scraper using Selenium with Firefox web driver.
Simply using Python's requests library will not allow for JavaScript execution on a web page. Selenium takes web automation a step further by loading a full (headless) web browser to allow for JavaScript execution.
- Docker
- Python 3.9
- Jupyter Notebook
- Selenium
- Firefox
cd Arachnid
cp git-hooks/pre-commit .git/hooks/
chmod +x .git/hooks/pre-commit
...or all as one line:
cd Arachnid && cp git-hooks/pre-commit .git/hooks/ && chmod +x .git/hooks/pre-commit
docker-compose up
- Navigate to http://localhost:8888 to open Jupyter Notebook UI to create new Python notebook
- Import Selenium, and get started
Inside of Docker container, run:
black --check app/
to check black's changesblack app/
to make changes