Added minimum viable download_url function. #29

fawkesley · 2013-08-16T08:56:07Z

Request for comment, don't merge yet :)

fawkesley · 2013-08-16T16:35:25Z

Comments on the interface are welcome - once we've got it, we're kind of stuck with it... @IanHopkinson @drj11 @scraperdragon

pwaller · 2013-08-19T07:41:31Z

scraperwiki/contrib/download_url.py

+            'OPTIONS': requests.options}
+
+
+def download_url(url, method='GET', retry=False, cache_seconds=None, **kwargs):


Prefer to avoid forwarding kwargs where possible. At least you should explicitly have all of the useful ones you can think of in the function signature. Otherwise what happens is that people build stuff on top of it and you end up looking five functions deep each of which peels off a couple of arguments.

Also good if forwarding kwargs to specify some useful ones in the docstring and preferably a reference to the inner documentation.

In this case isn't it about letting the caller use all of the keyword arguments from requests.get without us getting in the way? We're not in a position to predict/document/know the keyword arguments to requests.get and it would be annoying if the caller was unable to use one.

Agree in principle, but the sole purpose of this function is to thinly-wrap the requests module - people should look there (and not 5 modules down!) for documentation, really.

The last thing I want to do is diverge from the requests interface in the future (ie by not supporting a future argument)

To be clear, I'm not against forwarding kwargs. The minimum is to have a link pointing directly at the place it's forwarded to. Slightly better is to also explicitly write out the useful ones which are forwarded. Even better is not to have kwargs.

I encountered this just the other day with dragon where we had to go to something like five places in the code to find out what arguments were possible and what they meant because none of the forwarders documented what were sensible arguments or where to find them.

drj11 · 2013-08-19T08:45:48Z

The most disappointing thing about the interface is that it's not just "requests".

I'd prefer an interface where I can go:

import scraperwiki.contrib.cache.auto

and magically all my requests are cached. Maybe there is a scraperwiki.contrib.cache.control I can call too.

And similarly for the retry functionality:

import scraperwiki.contrib.retry.auto

I like this because then I don't have to change my code which is probably already using requests.get

fawkesley · 2013-08-19T16:24:18Z

It's a fair point, but I'm pretty selective about which requests I cache (see requests-cache/requests-cache#12)

I usually don't cache index pages, but do cache things they link to.

The auto retry one is a good point, although I don't particularly want to be in the business of monkey patching requests :(

I don't share your pain of having written requests.get everywhere - I've basically always wrapped it in a download_url function :)

pwaller · 2014-03-24T13:53:53Z

wow, 7 months old. Seems like yesterday. Can we merge or close this already?

scraperdragon · 2014-03-26T09:04:13Z

A lot of this functionality seems to already be in https://github.com/scraperwiki/data-services-helpers/blob/master/dshelpers.py - predominantly because it's been 7 months.

Should we be focussing on one or the other?

scraperdragon · 2014-04-16T14:08:44Z

@paulfurley Reopen this if you feel strongly that it needs considering further.

fawkesley · 2014-04-29T09:06:54Z

Nice one - sorry that was untidy of me.

Added minimum viable download_url function.

de98ba8

pwaller reviewed Aug 19, 2013
View reviewed changes

pwaller mentioned this pull request Aug 19, 2013

Deploy master to pypi #33

Closed

2 tasks

scraperdragon closed this Apr 16, 2014

fawkesley deleted the download-url-contrib branch April 29, 2014 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added minimum viable download_url function. #29

Added minimum viable download_url function. #29

fawkesley commented Aug 16, 2013

fawkesley commented Aug 16, 2013

pwaller Aug 19, 2013

drj11 Aug 19, 2013

fawkesley Aug 19, 2013

pwaller Aug 19, 2013

drj11 commented Aug 19, 2013

fawkesley commented Aug 19, 2013

pwaller commented Mar 24, 2014

scraperdragon commented Mar 26, 2014

scraperdragon commented Apr 16, 2014

fawkesley commented Apr 29, 2014

		'OPTIONS': requests.options}


		def download_url(url, method='GET', retry=False, cache_seconds=None, **kwargs):

Added minimum viable download_url function. #29

Added minimum viable download_url function. #29

Conversation

fawkesley commented Aug 16, 2013

fawkesley commented Aug 16, 2013

pwaller Aug 19, 2013

Choose a reason for hiding this comment

drj11 Aug 19, 2013

Choose a reason for hiding this comment

fawkesley Aug 19, 2013

Choose a reason for hiding this comment

pwaller Aug 19, 2013

Choose a reason for hiding this comment

drj11 commented Aug 19, 2013

fawkesley commented Aug 19, 2013

pwaller commented Mar 24, 2014

scraperdragon commented Mar 26, 2014

scraperdragon commented Apr 16, 2014

fawkesley commented Apr 29, 2014