Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

爬取结果有漏 #41

Closed
Wenzhi-Ding opened this issue Oct 28, 2022 · 1 comment · Fixed by #42
Closed

爬取结果有漏 #41

Wenzhi-Ding opened this issue Oct 28, 2022 · 1 comment · Fixed by #42

Comments

@Wenzhi-Ding
Copy link
Owner

北陆药业,搜索2010-01-01-0到2016-04-27-10,在第49页以前结果都是2016年4月11日左右,但到第50页结果突然变成2015年7月,导致中间的结果漏查。

@Wenzhi-Ding
Copy link
Owner Author

这个是微博自身搜索机制的问题,很难事前应对。可以增加一个事后补救模块,当检测到搜索进度里的较大跳跃,将该区间拆分成小区间再查一遍。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant