-
-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cumulativefilter vs crawler #2157
Comments
The same would be the case for any crawler. So I guess it just doesn't make sense for any crawler to even crawl these url's which means they should all get |
makes sense. |
Actually neither |
The debug log should tell you where the URL was found. |
Can i get that live if i use the console? |
Commands are self-documenting :) Just run |
well - i'm stuck here... |
That's exactly what you want. This means it's not going to be requested and thus also not indexed. But maybe it's found elsewhere again? Try finding the entry where the URL is requested. You should see where it was found, then you have to fix it on this page as well. |
There is only one cumulativefilter-module and all links with cumulativefilter parameter get 303-redirected to the same page with isorc parameter. |
I have added |
I recently added a cumulativefilter on a shop-page and since then the crawler/search-indexer doesn't work properly anymore.
there are ~150 pages/products in total but the number of pages to be crawled grows into absurde amounts (57000+) and the crawler never catches up crawling.
I suspect the url-parameter - as with unpublished filter the crawler works as expected.
Filter-urls look like this:
foo.html?cumulativefilter=ODQ7YWRkO2ZpbHRlcl9hdHRyaWJ1dGVzOzQ1OQ==
this is then somehow translated into
foo.html?isorc=1728
Any Ideas how to fix this? Im not totally sure if adding a
rel="noindex"
would be appropiate.The text was updated successfully, but these errors were encountered: