streaming a response with iter_lines doesn't work #989

gdamjan · 2012-12-06T10:04:23Z

Using the following simple server to simulate a streaming http service, requests 0.14.2 doesn't stream the response. it waits until some amount of data is received or until the end.

The client

import requests

url = 'http://localhost:8000/a/b/c'
req = requests.get(url, prefetch=False)

for line in req.iter_lines():
    print repr(line)

The server

from wsgiref.simple_server import make_server
import time

def simple_app(environ, start_response):

    status = '200 OK'
    headers = [('Content-type', 'text/plain; charset=utf-8')]

    start_response(status, headers)

    for i in range(10):
        yield "line %d\r\n" % i
        time.sleep(1)

httpd = make_server('', 8000, simple_app)
print("Serving on port 8000...")
httpd.serve_forever()

The text was updated successfully, but these errors were encountered:

Lukasa · 2012-12-06T20:52:20Z

Change your line:

for line in req.iter_lines():

to:

for line in req.iter_lines(chunk_size=10):

You'll find this works. =)

@sigmavirus24: IIRC, you worked on the streaming stuff last. Why is the default iter_lines chunk size so large (10240 bytes)? Is there a design decision I don't know about there?

gdamjan · 2012-12-06T21:14:57Z

confirmed, chunk_size=10 does fix the issue.

I'll leave this issue open, until it's decided what the default should be.

sigmavirus24 · 2012-12-06T21:48:26Z

@Lukasa I think you have me mistaken for someone but if I remember correctly
1024 was considered because that's 1MB. It isn't unreasonable, most web stuff
is much larger than that and it is configurable.

sigmavirus24 · 2012-12-06T21:52:26Z

Also @gdamjan it was already decided if I remember correctly so you can close this if you feel your needs were met.

At this point, with the refactor coming up we could change it to half the current size since we can really announce the breaking changes then. But that still wouldn't fix his problem. To try to phrase this how @kennethreitz will see it, 90% of people's cases will be sufficiently met by this default, and probably 10% will be affected negatively. He likes to ignore that 10% if possible. (Paraphrasing from one of his talks.)

slingamn · 2012-12-06T22:13:14Z

Is this related to any of the issues discussed in #844?

sigmavirus24 · 2012-12-06T22:14:54Z

@slingamn yes, the first item on your list I believe is related

Lukasa · 2012-12-06T22:17:03Z

No, item 4 is the relevant one. =)

Lukasa · 2012-12-06T22:19:36Z

10 kB is simply ludicrously large. If you load my website homepage, you'll get only slightly more than 10kB of data in total. That includes CSS, images, Javascript, the lot. To use that as the default line size is braindead.

kennethreitz · 2012-12-06T22:20:31Z

I agree, this seems foolishly large.

kennethreitz · 2012-12-06T22:21:00Z

iter_content's existance, however, based on the concept of extremely large files.

Lukasa · 2012-12-06T22:22:46Z

So leave iter_content as is, and just change the default chunk_size on iter_lines? Everybody wins?

kennethreitz · 2012-12-06T22:23:17Z

+1

Lukasa · 2012-12-06T22:24:38Z

Got a preference for what default value to use? =)

sigmavirus24 · 2012-12-06T22:24:43Z

Erp, my mistake, thought this was all iter_content related.

sigmavirus24 · 2012-12-06T22:25:07Z

From #844 seems like people liked 1024

Lukasa · 2012-12-06T22:28:48Z

It's certainly likely to be better than what we've got. It wouldn't have prevented this issue being raised, though.

gdamjan · 2012-12-06T22:30:39Z

1024 is still too much for ex. when streaming CouchDB's changes feed in continuous mode.

sigmavirus24 · 2012-12-06T22:33:33Z

We can also do 512, 256, 128, 64, 32, 16, or 8. Only the last of which would have prevented this issue. Pick your poison. :P

Lukasa · 2012-12-06T22:34:26Z

@sigmavirus24: Good point well made. =D

Nevertheless, I'd be inclined to go slightly smaller. 512 is tempting.

gdamjan · 2012-12-06T22:46:51Z

What are the usage scenarios of iter_lines ?
In my scenarios I don't see lines as long as 512 bytes. At most they are around 100 bytes.

slingamn · 2012-12-07T06:43:18Z

Oh, OK, I get the context now.

One thing I've been meaning to get around to: understanding _fileobject.readline() from Python's socket module. This is an API to wrap a socket and read lines from it like you would from a regular file. Here's a Gist of the code for convenience: [https://gist.github.com/4231260]

It looks like it reads a byte at a time in some cases and self._rbufsize bytes (default 8192) at a time in others. It also looks like it's doing something clever. Maybe we need something like this.

sigmavirus24 · 2012-12-07T12:27:00Z

You mean buffer the stream so we can return the default sizes? I was thinking of this but I realized it isn't realistic for every situation.

Lukasa · 2013-01-22T14:01:01Z

Resolved (finally!) by #1122.

Lukasa mentioned this issue Jan 21, 2013

Decrease default line length for iter_lines #1122

Merged

Lukasa closed this as completed Jan 22, 2013

github-actions bot locked as resolved and limited conversation to collaborators Sep 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

streaming a response with iter_lines doesn't work #989

streaming a response with iter_lines doesn't work #989

gdamjan commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

slingamn commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

Lukasa commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

Lukasa commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

Lukasa commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

slingamn commented Dec 7, 2012

sigmavirus24 commented Dec 7, 2012

Lukasa commented Jan 22, 2013

streaming a response with iter_lines doesn't work #989

streaming a response with iter_lines doesn't work #989

Comments

gdamjan commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

slingamn commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

Lukasa commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

Lukasa commented Dec 6, 2012

kennethreitz commented Dec 6, 2012

Lukasa commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

sigmavirus24 commented Dec 6, 2012

Lukasa commented Dec 6, 2012

gdamjan commented Dec 6, 2012

slingamn commented Dec 7, 2012

sigmavirus24 commented Dec 7, 2012

Lukasa commented Jan 22, 2013