-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
2 changed files
with
17 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
1. 用 `puppeteer` 绕过网站复杂的异步请求 | ||
2. ex 本身只需要有效 cookie 就可以进入,不需要额外信息。而这个 cookie 看上去没有敏感信息,所以不需要提前登录 e 站获得 cookie,只需要模拟一个即可 | ||
3. 所以 cookie 直接写在配置文件里 | ||
4. 每个网页打开的最短间隔大约是 3 秒,但如果数量多就得调到 5 秒 | ||
5. 服务每次爬取 40 页,也就是 1000 个漫画。因为数量巨大,图片本身需要在浏览时加载,这里需要懒加载 | ||
6. 详情页下载时,会先获得所有地址再一次下完,中间有一张失败就会返工 |