Skip to content

leVirve/dcard-lumberjack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

dcard-lumberjack

Lumberjack in Dcard ! 🌲

This aims to dumping the resources on Dcard. The project is under active development.

Requirements

  • Python 3
  • MongoDB
  • Redis

Usage

This project uses MongoDB as data storage layer, and Redis acts as broker and result-backend for Celery. Make sure they are on for services first.

  • Define all your own tasks in spider.py.
$ python spider.py
  • Start celery workers in this way, --pool=solo is needed for Windows.
$ celery -A lumberjack worker [--pool=solo]

> dcard-lbj [forum_name] [strategies_name]

fetching metadata from <$forum>......
dumping the posts on <$forum>.....

Summary: 1500 posts in 60 sec.

About

Lumberjack in Dcard !

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages