news-crawler

A news crawler for Life-Long Learning LM

Support website

scrapy crawl <spider_name>

if you don't want to save data to database, you can delete NewsCrawlerPGStoragePipeline in setting.py
you can change postgresql setting use environment variables, see more info in pipelines.py

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
news_crawler		news_crawler
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg