-
Notifications
You must be signed in to change notification settings - Fork 10.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Integrating Scurl into Scrapy #3332
base: master
Are you sure you want to change the base?
Conversation
# Conflicts: # scrapy/linkextractors/__init__.py # scrapy/linkextractors/lxmlhtml.py # scrapy/linkextractors/sgml.py # scrapy/utils/request.py
@nctl144 in general I think what you did is correct, but the build https://travis-ci.org/scrapy/scrapy/jobs/407810222 has a lot of failures, maybe this is due to this small requirements issue that I left a comment on: #3332 (comment) |
Hey @lopuhin , the build is green now haha. This PR is also ready for review! |
Hey @nctl144 looks good! Before the merge, this might need tweaking the warning wording, change scurl URL to @nctl144 could you please also provide benchmarking results on python 2 and python 3, to provide motivation why scurl is useful? |
I suppose it is not since I tried installing the library without having Cython installed. I will remove it!
hmm, do you have anything in mind for this? I am not sure how I should change that haha
Yeah I will work on that now :) |
I think it would be better to remove the warning - sorry, it was my idea initially but after talking to @kmike we decided that at the moment it would be better to go without it. |
haha sure I will just remove it for now @lopuhin ! |
I just added Scurl to the installation doc in this PR @lopuhin . I will include the profiling result in the comment section soon! But please let me know if there's anything else that I should work on 😄 |
here is some speed check result (I don't have the profile to share yet since vmprof is not cooperating... using
Using
More information can be found in the GSoC final report, which can be found here. Lemme know what you think @lopuhin ! |
Hey @lopuhin , I will continue the PR #3319 in here. I screwed it up somehow and I could not get the diff updated...