Using python scrapy to crawl articles
- install python (version > 3)
you can use pyenv to control/install different version of python
- install scrapy
pip install scrapy
- install simplified Chinese and Traditional Chinese convertor
pip install opencc-python-reimplemented
pyenv exec scrapy runspider scraper.py -a starturl=https://tw.uukanshu.com/b/125477/ -a output=book.txt
- now only support tw.uukanshu.com article