Novel Scraper

这是一个使用 Go 语言编写的小说爬虫程序，使用 chromedp 实现网页内容爬取。

功能特点

使用无头浏览器（Headless Chrome）进行网页爬取
自动检测本地 Chrome 浏览器安装情况
支持自动获取下一章/页面链接
保存小说内容到本地文件

使用前提

安装 Go 语言环境（推荐 1.16 或更高版本）
安装 Chrome 浏览器

安装

git clone [your-repository-url]
cd novel-scraper
go mod download

使用方法

go run main.go <first-chapter-url>

例如：

go run main.go https://example.com/novel/chapter1

注意事项

爬虫的选择器（例如标题、正文、下一章链接的选择器）需要根据目标网站的具体结构进行调整
程序会在运行目录下创建章节文件（格式：chapter_001.txt, chapter_002.txt 等）
为了避免对目标网站造成压力，程序内置了 1 秒的请求间隔

自定义配置

要修改网页元素的选择器，请编辑 main.go 文件中的 scrapeChapter 函数：

chromedp.Text("h1", &chapter.Title),                // 修改标题选择器
chromedp.Text("div.content", &chapter.Content),     // 修改正文选择器
chromedp.AttributeValue("a.next-chapter", "href", &chapter.NextLink, nil), // 修改下一章链接选择器

许可证

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.idea		.idea
.vscode		.vscode
bin		bin
configs		configs
internal		internal
merged		merged
progress		progress
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Novel Scraper

功能特点

使用前提

安装

使用方法

注意事项

自定义配置

许可证

About

Uh oh!

Releases

Packages

Languages

AndiHappy/go-script

Folders and files

Latest commit

History

Repository files navigation

Novel Scraper

功能特点

使用前提

安装

使用方法

注意事项

自定义配置

许可证

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages