Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.6.0版本的qax数据源存在bug #24

Closed
M09Ic opened this issue Apr 11, 2023 · 4 comments
Closed

v0.6.0版本的qax数据源存在bug #24

M09Ic opened this issue Apr 11, 2023 · 4 comments
Labels
enhancement New feature or request

Comments

@M09Ic
Copy link

M09Ic commented Apr 11, 2023

logs:

[WARN] 2023/04/12 00:06 [qianxin-ti]: invalid total size 0
[WARN] 2023/04/12 00:06 retrying as %!s(<nil>)
[WARN] 2023/04/12 00:06 [qianxin-ti]: invalid total size 0
[WARN] 2023/04/12 00:06 retrying as %!s(<nil>)
[INFO] 2023/04/12 00:06 [qianxin-ti]: parsing page 1

v0.4.0能正常获取, 但是会报错退出, 更新到v0.6.0后似乎qax的数据源获取失败

image

@zema1
Copy link
Owner

zema1 commented Apr 12, 2023

奇安信这个接口如果访问频率太快容易被 ban,这个 ban 的策略和 ip 有关,我在服务器上发现还是能正常抓取的,偶尔也会报错。
0.6.0 做了一些 retry,但如果 retry 3次还是失败了,就不再继续抓取了。

打算改进一下抓取策略,其实不必全部抓取,只需抓取前10页做个基础数据就可以,后面发版改下

@M09Ic
Copy link
Author

M09Ic commented Apr 12, 2023

感觉可以不用每个人重复抓取, 可以把sqlite数据库打包到代码中, 随着每个版本发布. 这样只需要监控第一页数据。

@zema1
Copy link
Owner

zema1 commented Apr 14, 2023

想了一下,内置的确实是一种不错改进方案。不过这种方案要求我能定期发版才行,比如一个月发一次版,那么内置的db数据可能就比较老了,还是要重新抓取。所以其实抓取个前三页应该是最快的,也好实现

@zema1 zema1 added the enhancement New feature or request label Apr 14, 2023
@zema1
Copy link
Owner

zema1 commented Apr 14, 2023

已修复,更新后如果还有问题可以新开 issue https://github.com/zema1/watchvuln/releases/tag/v0.7.0

@zema1 zema1 closed this as completed Apr 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants