My Dockerfile for crawler
- python3
- node.js
- requests httpx
- beautifulsoup4
- lxml
- lzstring
- pyquery
- node_vm2 (for execute javascript)
- ipython
- cchardet
- curl_cffi (bypass anti-bot through fingerprint)
- ujson
- numpy
- SQLAlchemy
docker pull b09780978/crawler
docker container run -idt -v {YOUR_CODE_DIR}:{DOCKER_DIR} --name crawler b09780978/crawler
docker container exec -it script /dev/null -c tmux
docker pull b09780978/crawler:deploy
docker container run -it -v {YOUR_CODE_DIR}:{DOCKER_DIR} --name crawler b09780978/crawler
Deploy version use python-alpine