Skip to content

This prog is to fetch the ptt's content based on the board name like car

Notifications You must be signed in to change notification settings

paulyang0125/bbs-crawler-utility

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bbs_crawler_utility

The main purpose of this prog is to help user to fetch the ptt content based on the board name like car

Command Usage:

*bbs_crawler_public.exe [OPTIONS] ptt's board_name index_number*

You can use bbs_crawler to figure out how many index page the PTT board is by fetch_index option and fetch the post page against each index by "fetch_page" plus the designated end of index to iterate from index 1 to yours designation.

Examples:

assuming you install python already.

  • run bbs_crawler_public.py with fetch_index plus ptt's board name like the following
**$python bbs_crawler_public.py -c fetch_index car**

or

  • run bbs_crawler_public.py with fetch_page plus the designated end of index number like the following - iterate from index 1 to index 6
**$python bbs_crawler_public.py -c fetch_page car 6** 

if you do not install python and use Windows OS, use bbs_crawler_public.exe instead of bbs_crawler_public.py to run the examples above

For more info, here is my blog - 抓取批踢踢 (Ptt) post文的crawler實作 in Chinese to explain what and how it works.

About

This prog is to fetch the ptt's content based on the board name like car

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages