Can't get octet-stream response | net::ERR_ABORTED #2114

drizzle-mizzle · 2023-03-24T23:55:27Z

Description

When I try to GoToAsync on a page that responds with a application/octet-stream content type:

In headless mode, it throws up PuppeteerSharp.NavigationException: net::ERR_ABORTED.
In non-headless mode, it interprets response as a file that needs to be downloaded, throws up the same exception as in headless mode, but successfully downloads response as a binary file without extension to Downloads directory.

Minimal example reproducing the issue

using var browserFetcher = new BrowserFetcher();
await browserFetcher.DownloadAsync();
browser = await Puppeteer.LaunchAsync(new() { Headless = true} );

var page = await browser.NewPageAsync();
await page.SetRequestInterceptionAsync(true);

page.Request += (s, e) =>
{
    // sets POST method, adds some headers and binds serialized data
    var payload = CreateRequestPayload(HttpMethod.Post, data); 

    await e.Request.ContinueAsync(payload);
};

var response = await page.GoToAsync(url);
var content = await response.TextAsync();

Expected behavior:

As I know that in my particular case, this application/octet-stream response actually is just a text string without extension, I expect that var content will have this text data.
For example,

fetch(_same_request_).then((response) => response.text())

works absolutely fine, but sadly I can't use it because of a cloudflare protection. I've tested it in my normal browser, and it worked, but failed with EvaluateFunctionAsync.

Actual behavior:

PuppeteerSharp.NavigationException: net::ERR_ABORTED at _my_url_ at _my_url_
 ---> PuppeteerSharp.NavigationException: net::ERR_ABORTED at _my_url_
   at PuppeteerSharp.FrameManager.NavigateAsync(CDPSession client, String url, String referrer, String frameId) in C:\projects\puppeteer-sharp\lib\PuppeteerSharp\FrameManager.cs:line 197
   at PuppeteerSharp.FrameManager.NavigateFrameAsync(Frame frame, String url, NavigationOptions options) in C:\projects\puppeteer-sharp\lib\PuppeteerSharp\FrameManager.cs:line 79
   --- End of inner exception stack trace ---
   at PuppeteerSharp.FrameManager.NavigateFrameAsync(Frame frame, String url, NavigationOptions options) in C:\projects\puppeteer-sharp\lib\PuppeteerSharp\FrameManager.cs:line 89
Call finished

As I mentioned, in non-headless mode I actually still can get needed data, though I'll need to catch this exception and open it (data) as a file from my download directory. But I really need it to work in headless mode.

Versions

9.0.2 / net7.0

The text was updated successfully, but these errors were encountered:

drizzle-mizzle · 2023-03-25T00:05:57Z

I'm not sure if it's appropriate to mention the exact service that I'm trying to scrap bypassing the cloudflare, and how to reproduce the exact same request, but, if you'll need that data, just tell me.

kblok · 2023-03-27T13:16:42Z

net::ERR_ABORTED comes from the browser. I would try to implement the new headless mode, to see if that fixes it.

drizzle-mizzle · 2023-03-27T14:18:07Z

What do you mean by new headless mode?
(Or was it not addressed to me?)

kblok · 2023-03-27T14:45:30Z

@drizzle-mizzle this.

drizzle-mizzle · 2023-03-27T14:50:46Z

Ah, never knew about it. Thanks, I'll try it today.

kblok · 2023-03-27T14:53:18Z

You can try that out in puppeteer (node.js). We need to make a few changes to support this in .NET

amaitland · 2023-03-28T20:27:38Z

In headless mode, it throws up PuppeteerSharp.NavigationException: net::ERR_ABORTED.

For application/octet-stream then net::ERR_ABORTED is exactly what I'd expect. Chromium aborts displaying the page and triggers a download.

but successfully downloads response as a binary file without extension to Downloads directory.

You should be able to set the download path to achieve the same behaviour. Browser.setDownloadBehavior allows for specifying a folder for downloads.

Puppeteer itself doesn't yet support any of the download related methods/events

I would try to implement the new headless mode, to see if that fixes it.

I'd be very surprised if new headless changed the behaviour.

drizzle-mizzle · 2023-03-31T10:00:41Z

You should be able to set the download path to achieve the same behaviour.

In headless mode it just won't start download at all, so it's doesn't really matter.

drizzle-mizzle · 2023-03-31T12:13:23Z

Sorry, I was wrong. I've simply set setDownloadBehavior improperly. Now it works.

drizzle-mizzle · 2023-03-31T12:19:53Z

Well, my problem is solved on this rate. Thanks for the tip.

Though, I'm not sure if this issue should be closed or not. You say it's a Chromium problem in the first place, not a Puppeteer, but maybe at least it should be handled some other way? With more obvious exception type.

kblok closed this as completed Apr 2, 2023

drizzle-mizzle mentioned this issue Apr 7, 2023

1.1+ - Roadmap (bug fixes and more) realcoloride/node_characterai#17

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't get octet-stream response | net::ERR_ABORTED #2114

Can't get octet-stream response | net::ERR_ABORTED #2114

drizzle-mizzle commented Mar 24, 2023

drizzle-mizzle commented Mar 25, 2023

kblok commented Mar 27, 2023

drizzle-mizzle commented Mar 27, 2023 •

edited

Loading

kblok commented Mar 27, 2023

drizzle-mizzle commented Mar 27, 2023

kblok commented Mar 27, 2023

amaitland commented Mar 28, 2023

drizzle-mizzle commented Mar 31, 2023 •

edited

Loading

drizzle-mizzle commented Mar 31, 2023

drizzle-mizzle commented Mar 31, 2023

Can't get octet-stream response | net::ERR_ABORTED #2114

Can't get octet-stream response | net::ERR_ABORTED #2114

Comments

drizzle-mizzle commented Mar 24, 2023

Description

Minimal example reproducing the issue

Expected behavior:

Actual behavior:

Versions

drizzle-mizzle commented Mar 25, 2023

kblok commented Mar 27, 2023

drizzle-mizzle commented Mar 27, 2023 • edited Loading

kblok commented Mar 27, 2023

drizzle-mizzle commented Mar 27, 2023

kblok commented Mar 27, 2023

amaitland commented Mar 28, 2023

drizzle-mizzle commented Mar 31, 2023 • edited Loading

drizzle-mizzle commented Mar 31, 2023

drizzle-mizzle commented Mar 31, 2023

drizzle-mizzle commented Mar 27, 2023 •

edited

Loading

drizzle-mizzle commented Mar 31, 2023 •

edited

Loading