-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
obey robots.txt is not working #5
Comments
And also I got this in my typo3 error log: Mon, 21 Dec 2020 05:56:00 +0000 [ERROR] request="e909aaa824fb3" component="INM.InmGooglesitemap.Generators.SitemapGenerator": Extension inm_googlesitemap: Error Code: 5 --- Reason: Socket-stream timed out (timeout set to 5 sec). This error log which made site to show the 503 error and restarting the php-fpm service showed the site again. Please check this too |
Hi @notacoder-ui , |
Okay, well adding |
Hi @merzilla I updated the settings as you said and ran the cron job. Tue, 22 Dec 2020 05:05:01 +0000 [ERROR] request="08db8edc7ac5b" component="INM.InmGooglesitemap.Generators.SitemapGenerator": Extension inm_googlesitemap: Response Header not correct. Got HTTP Status Code 302 for URL https://www.xyz.de/mailto:%20%69n%66%6f%40%72eise%6cinie%2e%64e --- Complete Response Header: HTTP/1.1 302 Found |
Hi @merzilla I need to update settings like some links should not be indexed while generating a sitemap.xml file. Or obey robots.txt functionality is also fine for me so that I can set URLs there with disallow and that is not getting indexed while generating a new sitemap.xml |
Hi,
I have set a rule in robots.txt that
Disallow: /mailto:%20iasdf%66o%40r%65%69asdfdf%2ede
Disallow: /news-letter/unsub
And started the cron job to index but the job always indexed the above both urls.
How to skip some urls not getting indexed in the sitemap.
The text was updated successfully, but these errors were encountered: