Follow redirect responses #60

bgreni · 2024-09-13T00:25:26Z

Fixes #48

Follow redirect URLs in the Mojo client implementation.

lightbug_http/sys/client.mojo

saviorand · 2024-09-13T16:24:30Z

Thanks, wanted to share some tips but it looks like you've been able to start just fine! Will review today and tomorrow

bgreni · 2024-09-13T17:24:55Z

Thanks, wanted to share some tips but it looks like you've been able to start just fine! Will review today and tomorrow

It's a very approachable codebase I had no issues at all really, well done!

saviorand · 2024-09-14T17:35:09Z

@bgreni looks great, I've just merged main since there was a new release today and fixed tests.
I'm trying out your current implementation with a couple examples, thinking to also write some new unit tests for this.
Right now if I replace the URI in client.mojo with http://httpbin.org/status/302 and run magic run mojo client.mojo it works (I get a 200). In theory the redirect path for this is:

http://httpbin.org/status/302
302 FOUND
/redirect/1
302 FOUND
/get
200 OK

One thing I noticed is the Content-Length and connection-close header values are off , e.g. according to redirect-checker the content-length for the last call should be 323 , but I'm getting 9593. This likely has nothing to do with your code and is a bug in Lightbug's (🥁) client, but I thought it would be interesting to fix it.

Another case I found where it breaks is if I use http://google.com as the URI in client.mojo . Then getaddrinfo seems to crash, likely an issue with how the strings are being handled when parsing the URI. Again, nothing to do with the follow-redirect functionality but we can try to debug to properly test follow-redirects as well.

If you are interested feel free to try and debug on your end, I am currently writing tests and can also help debugging if needed. Thanks again for this great contribution!

saviorand · 2024-09-15T10:13:26Z

lightbug_http/header.mojo

+struct StatusCode:
+    alias OK = 200
+    alias MOVED_PERMANENTLY = 301
+    alias FOUND = 302
+    alias TEMPORARY_REDIRECT = 307
+    alias PERMANENT_REDIRECT = 308
+    alias NOT_FOUND = 404


Potentially we could also handle 303 (See Other), 300 (Multiple Choices) and 304 (Not modified), although the logic for latter two might be more complicated. But can also leave it for now and implement in the future

https://developer.mozilla.org/en-US/docs/Web/HTTP/Redirections

bgreni · 2024-09-16T02:12:45Z

@saviorand Thank you for the review and testing effort, I'll look into those issues asap

bgreni · 2024-09-21T02:24:18Z

Ended up just reimplementing this due my own drastic changes. Also fixed an issue where the path query string was not being added to the request headerline.

The issue with Content-Length that you mentioned is still present, and when I tried google.com the location header is http://www.google.com and it seems to just redirect loop forever? (careful testing that google ip banned me for a few hours lol)

saviorand · 2024-09-21T14:22:28Z

@bgreni I think currently content-length is always zero in the HTTPResponse , at least when I'm making a request from the client. If I comment out this line it reads the content-length correctly, but not sure what happens in this case if the header is not present in the response

fn __init__(
        inout self,
        body_bytes: Bytes,
        headers: Headers = Headers(),
        status_code: Int = 200,
        status_text: String = "OK",
        protocol: String = strHttp11,
    ):
        self.headers = headers
        if HeaderKey.CONTENT_TYPE not in self.headers:
            self.headers[HeaderKey.CONTENT_TYPE] = "application/octet-stream"
        self.status_code = status_code
        self.status_text = status_text
        self.protocol = protocol
        self.body_raw = body_bytes
        self.set_connection_keep_alive()
        # self.set_content_length(len(body_bytes)) - comment this out, otherwise content-length = 0

I think it's set to 0 because of setting body to Bytes() here, and then it's not updated after parsing the actual body

fn from_bytes(owned b: Bytes) raises -> HTTPResponse:
        var reader = ByteReader(b^)

        var headers = Headers()
        var protocol: String
        var status_code: String
        var status_text: String

        try:
            protocol, status_code, status_text = headers.parse_raw(reader)
        except e:
            raise Error("Failed to parse response headers: " + e.__str__())

        var response = HTTPResponse(
            Bytes(),
            headers=headers,
            protocol=protocol,
            status_code=int(status_code),
            status_text=status_text,
        )

saviorand · 2024-09-24T08:39:04Z

@bgreni I wonder if the infinite loop with google has to do with an https redirect they have 🤔 although httpbin seems to work fine. Lightbug doesn't have TLS/HTTPS support yet

bgreni · 2024-09-25T04:44:18Z

I think it's set to 0 because of setting body to Bytes() here, and then it's not updated after parsing the actual body

Yes it would appear I forgot to include that logic on the response side...

bgreni · 2024-09-25T05:13:32Z

@saviorand Seems the loop was related to not updating the Host header after receiving a redirect specifying a new hostname. I end up getting a 200 but no content, which I assume could also be related to missing headers google requires?

saviorand · 2024-09-25T17:19:23Z

@bgreni when I print the buffer after reading from the connection like so:

 var bytes_recv = conn.read(new_buf)
        print("new_buf:", String(new_buf))

I see the body, so the issue has to be with how we're then adding it to HTTPResponse or reading it later down the line?

bgreni · 2024-09-25T18:26:12Z

@bgreni when I print the buffer after reading from the connection like so:
 var bytes_recv = conn.read(new_buf)
        print("new_buf:", String(new_buf))
I see the body, so the issue has to be with how we're then adding it to HTTPResponse or reading it later down the line?

Ah I see google is using the Transfer-Encoding: chunked header, which is currently not supported (I don't think it was before the refactor either?). Maybe we should create a separate ticket for that and leave that test case commented out for now?

Signed-off-by: Brian Grenier <grenierb96@gmail.com>

saviorand · 2024-09-25T20:58:08Z

@bgreni yup, that was not supported before the refactor as well. Let's merge 🔥

bgreni commented Sep 13, 2024

View reviewed changes

lightbug_http/sys/client.mojo Outdated Show resolved Hide resolved

bgreni force-pushed the follow-redirects branch from b0946c9 to 187a055 Compare September 13, 2024 16:10

bgreni marked this pull request as ready for review September 13, 2024 16:10

bgreni force-pushed the follow-redirects branch from 187a055 to 16288cf Compare September 13, 2024 17:23

saviorand reviewed Sep 15, 2024

View reviewed changes

bgreni force-pushed the follow-redirects branch from 7eadc6a to 6a4ca35 Compare September 21, 2024 02:20

bgreni force-pushed the follow-redirects branch from 6a4ca35 to 1ea5542 Compare September 21, 2024 03:26

bgreni force-pushed the follow-redirects branch from 1ea5542 to 3d2b60e Compare September 25, 2024 05:11

follow redirects

973c883

Signed-off-by: Brian Grenier <grenierb96@gmail.com>

bgreni force-pushed the follow-redirects branch from 3d2b60e to 973c883 Compare September 25, 2024 18:47

saviorand merged commit e695225 into Lightbug-HQ:main Sep 25, 2024

bgreni mentioned this pull request Sep 26, 2024

Cache persistent connections in client #65

Closed

Follow redirect responses #60

Follow redirect responses #60

Uh oh!

Conversation

bgreni commented Sep 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

saviorand commented Sep 13, 2024

Uh oh!

bgreni commented Sep 13, 2024

Uh oh!

saviorand commented Sep 14, 2024

Uh oh!

saviorand Sep 15, 2024

Choose a reason for hiding this comment

Uh oh!

bgreni commented Sep 16, 2024

Uh oh!

bgreni commented Sep 21, 2024

Uh oh!

saviorand commented Sep 21, 2024

Uh oh!

saviorand commented Sep 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bgreni commented Sep 25, 2024

Uh oh!

bgreni commented Sep 25, 2024

Uh oh!

saviorand commented Sep 25, 2024

Uh oh!

bgreni commented Sep 25, 2024

Uh oh!

saviorand commented Sep 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bgreni commented Sep 13, 2024 •

edited

Loading

saviorand commented Sep 24, 2024 •

edited

Loading