Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Web site content cannot be synchronized #2466

Closed
Andyguo5891 opened this issue Mar 3, 2025 · 2 comments
Closed

[Bug] Web site content cannot be synchronized #2466

Andyguo5891 opened this issue Mar 3, 2025 · 2 comments
Assignees

Comments

@Andyguo5891
Copy link

Contact Information

andyguo-19924537486

MaxKB Version

1.10.1

Problem Description

网络可通,
Web 根地址:https://my.org.mo/ 或者 https://my.org.mo/zh_tw/index.html
选择器为空,无法自动爬取网页内容

Steps to Reproduce

Image

The expected correct result

期望输入网页地址后,可自动爬取网页的所有内容

Related log output

Additional Information

No response

@shaohuzhang1 shaohuzhang1 changed the title [Bug] Web站点内容无法同步 [Bug] Web site content cannot be synchronized Mar 3, 2025
@Shenguobin0102
Copy link

你好,请使用https://my.org.mo/zh_tw 作为网站地址,添加知识库。你提供的其他两个地址在爬去时,链接会发生变化,不能作为根地址来使用。

@shaohuzhang1
Copy link
Contributor

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿


Hello, please use https://my.org.mo/zh_tw as the website address to add a knowledge base. The links will change when the other two addresses you provide are crawling and cannot be used as root addresses.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants