Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

page 页面URL格式问题 #13

Closed
seozed opened this issue Jul 8, 2015 · 3 comments
Closed

page 页面URL格式问题 #13

seozed opened this issue Jul 8, 2015 · 3 comments

Comments

@seozed
Copy link

seozed commented Jul 8, 2015

如果我的页面正常访问的URL是这样的:domain.com/about,在生成的sitemap.xml里,它导出的URL格式就变成了这样:domain.com/about/index.html,这样的话,对爬虫会造成困扰。
因为domain.com/aboutdomain.com/about/index.html 对爬虫来说是两个页面,爬虫会分别抓取这两个页面,对爬虫来说是一种资源浪费,会造成重复收录。

参考我的sitemap:http://mazih.com/sitemap.xml

@leesei
Copy link
Member

leesei commented Mar 7, 2016

https://moz.com/learn/seo/canonicalization
https://support.google.com/webmasters/answer/139066?hl=en

I think we could use permalink as the canonical URL (in header and sitemap).
And I think the one without index.html is better.

@ApsarasX
Copy link

@seozed 我对插件源码进行了修改,修复了这个问题
hexo-generator-sitemap

@seozed
Copy link
Author

seozed commented May 22, 2018

great!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants