-
Notifications
You must be signed in to change notification settings - Fork 73
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BasicNormalizer] sorting the Query Parameters #246
Labels
Milestone
Comments
We sort query parameters when normalizing (in other projects), so yes, I think that's true. |
aecio
added a commit
to aecio/crawler-commons
that referenced
this issue
Jan 4, 2021
- Sort query parameters (fix crawler-commons#246) - Allows to (optionally) remove common irrelevant query parameters - Consistently encode query parameters with 'application/x-www-form-urlencoded'
aecio
added a commit
to aecio/crawler-commons
that referenced
this issue
Jan 4, 2021
- Sort query parameters (fix crawler-commons#246) - Allows to (optionally) remove common irrelevant query parameters - Consistently encode query parameters with 'application/x-www-form-urlencoded'
aecio
added a commit
to aecio/crawler-commons
that referenced
this issue
Jan 5, 2021
- Sort query parameters (fix crawler-commons#246) - Allows to (optionally) remove common irrelevant query parameters - Consistently encode query parameters with 'application/x-www-form-urlencoded'
sebastian-nagel
added a commit
that referenced
this issue
Sep 21, 2021
…oses #309 - rebase to master and squash commits - fix failing sitemaps unit tests with URL filtering using BasicURLNormalizer (sort query params in test sitemap) - CHANGES.txt: updated to follow style, added missing entry for preceding commit
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
When normalizing a URL should these two urls be normalized to the same url ?
http://shekhargulati.com?lang=en&article=fred
http://shekhargulati.com?article=fred&lang=en
This can be achieved by sorting the query params.
The above example was taken from UrlCleaner
It is an actual unit test there (one which we fail :-( ), the test name is: shouldSortQueryParameters
The text was updated successfully, but these errors were encountered: