This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster
To install dependencies simply run:
pip install -r requirements.txt
Alternatively, to use virtual environments easily you can run:
pip install pipenv
And then:
pipenv install
To run with a particular configuration simply run python -m app {mode}
Available modes are:
- playwright
- selenium
- selenium_multi
- selenium_dagster
- playwright_dagster
To run the benchmark:
- Update your Neptune.AI key in the
.env
file - Run
python -m app.run_benchmark