-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove all query parameters when extracting protocol #2996
Conversation
Beware of cases like: Maybe: first try to find an extension, and if none, try to remove the By the way, here is the list of URLs for errors of this type, with a '?' in the URL:
|
Hi @severo, I just saw your comment. Thank you. Finally I just swapped the 2 parsings: first I extract extension and then I remove query parameters. 😉 |
OK :) Maybe we should add some unit tests to ensure we improve the detection without regressions (it's Friday afternoon, I trust the unit tests more than my analysis of the code) |
Great! For the tests, I think we should also add some URLs in the form: |
Fix
_get_extraction_protocol
to remove all query parameters, like?raw=true
,?dl=1
,...