Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Google does not read all sitemap index files #2554

Closed
michaeltorbert opened this issue Jun 4, 2019 · 7 comments

Comments

Projects
None yet
2 participants
@michaeltorbert
Copy link
Member

commented Jun 4, 2019

Reported here: https://wordpress.org/support/topic/sitemap-error-67

usually i set sitemap with 1000 reference inside and it generate N file. I see all file… but google nor read all but only the first…

All in One Seo Pack will generate this SITEMAP: https://www.lucianoblancato.it/sitemap.xml

with this

URL Last Change
https://www.lucianoblancato.it/sitemap_addl.xml
https://www.lucianoblancato.it/sitemap_post.xml
https://www.lucianoblancato.it/sitemap_page.xml
https://www.lucianoblancato.it/sitemap_attachment_1.xml
https://www.lucianoblancato.it/sitemap_attachment_2.xml
https://www.lucianoblancato.it/sitemap_attachment_3.xml
https://www.lucianoblancato.it/sitemap_attachment_4.xml
https://www.lucianoblancato.it/sitemap_attachment_5.xml
https://www.lucianoblancato.it/sitemap_attachment_6.xml
https://www.lucianoblancato.it/sitemap_attachment_7.xml
https://www.lucianoblancato.it/sitemap_attachment_8.xml
https://www.lucianoblancato.it/sitemap_attachment_9.xml
https://www.lucianoblancato.it/sitemap_attachment_10.xml
https://www.lucianoblancato.it/sitemap_robo_gallery_table.xml
https://www.lucianoblancato.it/sitemap_archive.xml
https://www.lucianoblancato.it/sitemap_author.xml
https://www.lucianoblancato.it/sitemap_category.xml
https://www.lucianoblancato.it/sitemap_post_tag_1.xml
https://www.lucianoblancato.it/sitemap_post_tag_2.xml
https://www.lucianoblancato.it/sitemap_post_tag_3.xml
https://www.lucianoblancato.it/sitemap_post_tag_4.xml
https://www.lucianoblancato.it/sitemap_post_tag_5.xml
https://www.lucianoblancato.it/sitemap_post_tag_6.xml
https://www.lucianoblancato.it/sitemap_post_tag_7.xml
https://www.lucianoblancato.it/sitemap_post_tag_8.xml
https://www.lucianoblancato.it/sitemap_post_tag_9.xml
https://www.lucianoblancato.it/sitemap_post_tag_10.xml
https://www.lucianoblancato.it/sitemap_post_tag_11.xml
https://www.lucianoblancato.it/sitemap_post_tag_12.xml
https://www.lucianoblancato.it/sitemap_post_tag_13.xml
https://www.lucianoblancato.it/sitemap_post_tag_14.xml
https://www.lucianoblancato.it/sitemap_post_tag_15.xml
https://www.lucianoblancato.it/sitemap_post_tag_16.xml
https://www.lucianoblancato.it/sitemap_post_format.xml

Google webmaster tools read only this:

Sitemap
https://www.lucianoblancato.it/sitemap_addl.xml
https://www.lucianoblancato.it/sitemap_archive.xml
https://www.lucianoblancato.it/sitemap_attachment_1.xml
https://www.lucianoblancato.it/sitemap_attachment_10.xml
https://www.lucianoblancato.it/sitemap_attachment_2.xml
https://www.lucianoblancato.it/sitemap_attachment_3.xml
https://www.lucianoblancato.it/sitemap_attachment_4.xml
https://www.lucianoblancato.it/sitemap_attachment_5.xml
https://www.lucianoblancato.it/sitemap_attachment_6.xml
https://www.lucianoblancato.it/sitemap_attachment_7.xml
https://www.lucianoblancato.it/sitemap_attachment_8.xml
https://www.lucianoblancato.it/sitemap_attachment_9.xml
https://www.lucianoblancato.it/sitemap_author.xml
https://www.lucianoblancato.it/sitemap_category.xml
https://www.lucianoblancato.it/sitemap_page.xml
https://www.lucianoblancato.it/sitemap_post.xml
https://www.lucianoblancato.it/sitemap_post_format.xml
https://www.lucianoblancato.it/sitemap_robo_gallery_table.xml

@michaeltorbert michaeltorbert added this to the 3.0.1 milestone Jun 4, 2019

@michaeltorbert

This comment has been minimized.

Copy link
Member Author

commented Jun 4, 2019

@michaeltorbert

This comment has been minimized.

Copy link
Member Author

commented Jun 4, 2019

@wpsmort @arnaudbroes Can anyone decipher what he's trying to say?

@michaeltorbert

This comment has been minimized.

Copy link
Member Author

commented Jun 4, 2019

config 1 (with problem)

  1. I go to the configuration page and put 1000 in size sitemap
  2. will be created a sitemap with a child
  3. I insert sitemap url on google
  4. google not read all sitemap (child) but only a part (usally 15 but i have 25 child).

for this reason google does not index the whole site but only a part

config 2 (no problem)

  1. I go to the configuration page and put 50.000 in size sitemap
  2. will be created a sitemap with a child
  3. I insert sitemap url on google
  4. google read all sitemap (child included).

but with config 2 i have a sitemap heavy with 50.000 record… 🙁 and this is another problem …

@wpsmort wpsmort self-assigned this Jun 4, 2019

@wpsmort

This comment has been minimized.

Copy link
Member

commented Jun 4, 2019

I'm trying to reproduce this with my public site.

@wpsmort

This comment has been minimized.

Copy link
Member

commented Jun 4, 2019

I've been able to reproduce this with my public site which has a sitemap index with 26 sitemaps. Google only pulls 17 of them when I submit the sitemap. I've been able to reproduce this with v2.12.1 and an old version 2.5 from last year as well meaning this is not new.

Google's docs state that they'll accept "up to 500 sitemap index files for each site" (https://support.google.com/webmasters/answer/75712).

I've tested with two other well known plugins and been able to reproduce the problem with one of them. It seems to be a problem with the new Google Search Console Sitemaps report. I'm going to see if they pick up all the sitemaps in the index after 12 or 24 hours.

@michaeltorbert michaeltorbert removed this from the 3.0.1 milestone Jun 4, 2019

@arnaudbroes arnaudbroes changed the title Sitemap issue in 3.0 Google does not read all sitemap index files Jun 4, 2019

@michaeltorbert michaeltorbert added this to the 3.1 milestone Jun 4, 2019

@semperfiwebdesign semperfiwebdesign deleted a comment from daliasued Jun 5, 2019

@wpsmort

This comment has been minimized.

Copy link
Member

commented Jun 5, 2019

I tested again with two sites and both worked fine this time. There's a chance that this was a transient with Google Search Console. All of the sitemaps in the index for each site were processed much faster today than when I tested yesterday.

@michaeltorbert

This comment has been minimized.

Copy link
Member Author

commented Jun 10, 2019

Update: it looks as though Google has made an undocumented change, and "post" in the sitemap name does not appear to be supported any longer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.