Scraper for LINE Blog in Scrapy
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
LINEBlogScraper
.gitignore
LICENSE
README.md
scrapy.cfg

README.md

LINEBlogScraper

Scraper for LINE BLOG in Scrapy.

Requirements

  • Python 3.5.1
  • Scrapy 1.4.0

How to run

crawl https://lineblog.me//TARGET_BLOG and output blog.json

scrapy crawl lineblog_scraper -a start_url='https://lineblog.me/TARGET_BLOG' -o blog.json

Downloading images

Will be downloaded and stored in the following directory: LINEBlogScraper/images/full/