New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unicode Link Extractor #2010
Comments
I think it worths fixing in Scrapy and/or in w3lib. There was a similar issue in past (#1352). Do you have an example of URL where this fails? |
|
I also know that erroneous strings such as:
will also fail. |
I believe we should fix it at It's a shame there's not explicit exception for wrong label lengths (we can test the exception string but that feels hacky)
|
I'm adding 1.1.1 milestone because this is a Scrapy 1.1 regression. |
When using the following to extract all of the links from a response:
On rare occasions, the following error is thrown:
I was able to find some information concerning the error from here.
My question is: What is the best way to handle this? Even if there is one bad link in the response, I'd want all of the other good links to be extracted.
The text was updated successfully, but these errors were encountered: