Skip to content

leVirve-arxiv/ptt-scrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ptt-scrapy

A powerful scrapy spider make all-out effort to collect information from PTT into database.

Requisites

  • Python 3
  • MongoDB

Usage

  • Make sure your mongod is running in system.

  • Run with scrapy command:

    scrapy crawl ptt
  • (Option) You can export these data into json, e.g.:

    mongoexport --db ptt --collection mobilecomm --out mobilecomm.json

Acknowledgement

About

A powerful scrapy spider collect information from PTT into database.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages