- 一个爬取拉钩信息的项目,已经 deprecated ,不会再有更新了⊙﹏⊙b
- 关于多线程,可以参考写的一篇博文 Python one-line 实现多进程和多线程(修正版)
- 关于信息获取,有的页面要用
eautifulSoup
来解析 HTML,有的页面要模拟post
个 Ajax 请求......好分裂,可以看 crawl.py 和 res.json
Allianzcortex/lagou_crawler
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
crawl lagou data with multiprocessing
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published