Skip to content
/ spider Public
forked from simapple/spider

python爬虫 全球网址URL滚动提取

Notifications You must be signed in to change notification settings

sdfzy/spider

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

spider

python 爬虫

版本1 功能简述: 以hao123为入口页面,滚动爬取外链,收集网址,并记录网址上的内链和外链数目,记录title等信息

windows7 32位上测试,目前每24个小时,可收集数据为10万左右

About

python爬虫 全球网址URL滚动提取

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%