Avoid URL Encoding as an option #3542

maramsumanth · 2018-12-21T16:17:46Z

Fixes #833.
Case (i):-


In [4]: Request('http://google.com/"hello"',True)
Out[4]: <GET http://google.com/%22hello%22>

In [5]: Request('http://google.com/"hello"',False)
Out[5]: <GET http://google.com/"hello">

In [6]: Request('http://google.com/"hello"')
Out[6]: <GET http://google.com/%22hello%22>

In the above case I am able to add quote_path as an argument in the Request constructor. This is set to True (default) if the user wants to quote the URL.

Case (ii):-

I want to add an option to the command scrapy shell "http://google.com/\"hello\"" which when set to True, we get <GET http://google.com/%22hello%22> , if it is set to False, we get<GET http://google.com/"hello">

I dont have idea how to add option for that (Case-(ii)) command. Any suggestions?
Mentors please guide me @kmike @cathalgarvey @lopuhin :)

(PS:- I used \ for each " of hello as escape sequence in order to include it in the URL)
At present I had opened a PR for Case (i), ideas for Case (ii) are welcome :)

I opened a PR for w3lib to modify safe_url_string method. Please look into this scrapy/w3lib#119

Thanks :)

maramsumanth · 2018-12-21T16:23:22Z

I can't get this correct unless scrapy/w3lib#119 is merged because this commit uses the modified method of safe_url_string of w3lib.

elacuesta · 2019-01-14T15:48:51Z

scrapy/http/request/__init__.py

@@ -16,12 +16,13 @@

 class Request(object_ref):

-    def __init__(self, url, callback=None, method='GET', headers=None, body=None,
+    def __init__(self, url, quote_path=True, callback=None, method='GET', headers=None, body=None,


Currently the second positional argument is callback, putting quote_path in its place would be backwards incompatible. I'd sugest you to move it to the end of the arguments list, I suspect that's why the tests are failing.

@elacuesta , I did it tests are failing because , my PR scrapy/w3lib#119 didn't merge yet.

@maramsumanth it has been merged:

[MRG+1] If user doesn't want to encode unsafe characters. w3lib#119

felipeboffnunes · 2022-11-02T03:36:36Z

@Gallaecio not really sure about the current CI failing but this seems pretty straightforward. Are the changes on this PR currently worth to be merged or has anything significant changed in master that would make this deprecated somehow? If not, I am willing to do the sync and also add some unit tests if it is worth it.

felipeboffnunes · 2022-11-02T03:42:31Z

From another angle, the integration is seamless as far as it looks, so the unit tests at scrapy/w3lib#119 should suffice. Nevertheless, it may be worth adding a small doc update over the added argument on Request init?

Gallaecio · 2022-11-02T07:12:32Z

I think it may be best not to try and address the issue at all, for the time being.

Update __init__.py

6eecc85

maramsumanth mentioned this pull request Dec 21, 2018

[MRG+1] If user doesn't want to encode unsafe characters. scrapy/w3lib#119

Merged

maramsumanth changed the title ~~Update __init__.py~~ Avoid URL Encoding as an option Dec 21, 2018

maramsumanth mentioned this pull request Dec 24, 2018

Prevent URL encoding option #833

Open

maramsumanth added 3 commits December 29, 2018 13:01

Update __init__.py

163f99c

Changed to quote_path

83b2460

Update __init__.py

7199d18

elacuesta reviewed Jan 14, 2019

View reviewed changes

Moved quote_path to end of argument list

04dba9b

Gallaecio mentioned this pull request Dec 16, 2020

Add dont_encode option in scrapy.http.Request class. #4917

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid URL Encoding as an option #3542

Avoid URL Encoding as an option #3542

maramsumanth commented Dec 21, 2018 •

edited by Gallaecio

maramsumanth commented Dec 21, 2018 •

edited

elacuesta Jan 14, 2019

maramsumanth Jan 15, 2019

azzamsa Oct 24, 2021

felipeboffnunes commented Nov 2, 2022

felipeboffnunes commented Nov 2, 2022

Gallaecio commented Nov 2, 2022

Avoid URL Encoding as an option #3542

Are you sure you want to change the base?

Avoid URL Encoding as an option #3542

Conversation

maramsumanth commented Dec 21, 2018 • edited by Gallaecio

maramsumanth commented Dec 21, 2018 • edited

elacuesta Jan 14, 2019

Choose a reason for hiding this comment

maramsumanth Jan 15, 2019

Choose a reason for hiding this comment

azzamsa Oct 24, 2021

Choose a reason for hiding this comment

felipeboffnunes commented Nov 2, 2022

felipeboffnunes commented Nov 2, 2022

Gallaecio commented Nov 2, 2022

maramsumanth commented Dec 21, 2018 •

edited by Gallaecio

maramsumanth commented Dec 21, 2018 •

edited