-
Notifications
You must be signed in to change notification settings - Fork 10.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Genspider prepend http://
without checking it in the domain
#3553
Comments
Hello everybody! I would like to contribute to Scrapy, specifically with this issue and I already fork the repo and made the changes but before issuing a pull request I would like to discuss it and reach to an agreement. I made the changes over 5 files:
Lastly and maybe more important, I don't know how to try the code it's my first time and I don't have any idea how to execute it. So any help or guide would be really appreciated. Thank you in advance and happy coding!!!! =D |
@ambarmendez I would suggest you open a pull request already, discussions over actual code are usually easier. Also, make sure you check out previous pull requests (#3554, #3558). Feedback there, specially @kmike’s, could answer some of your points. |
Thank you @Gallaecio! Let's see how it goes =D |
Now URLs are supported as input: #4439 However, we still simply extract the domain from them. Which means that the input protocol is not respected for |
#3558 seems to aim to address the |
Hi @Gallaecio, can I be assigned to work on this issue? |
@msenior85 No need. Feel free to open a pull request and include |
Hello, I saw the note about this issue in the documentation and decided to write a simple patch for it. PR #5808 |
genspider prepend http:// But when i enter address like https://example.com it becomes http://https://example.com that, when run scrapy crawl throws an error.
What it should do, it should first check the receiving domain than take decision according to the passing domain whether it needs a http:// or nothing.
The text was updated successfully, but these errors were encountered: