How to check if range-requests are in use? #8422

rogierlommers · 2017-05-18T12:58:35Z

We are having a hard time serving large PDFs to our customers with pdf.js. Some investigation learned us that the concept of "range requests" could fix this. Therefore we tried to generate a fastWebView-enabled PDF with ghostscript:

gs -q -dBATCH -dNOPAUSE -sDEVICE=pdfwrite -dCompatibilityLevel=1.5 -dFastWebView=true

The generated PDF is being served by Apache/2.4.18 which (correct me if I'm wrong) supports range requests.

Now how can I test that pdf.js actually uses range requests?

The text was updated successfully, but these errors were encountered:

yurydelendik · 2017-05-18T13:09:03Z

Now how can I test that pdf.js actually uses range requests?

There is no diagnostics information coming from PDF.js core yet. However browser console shall have 206 responses in the network monitor. If you don't see 206s for files more than 128k, then there is a problem with server -- inspect request and response HTTP headers for initial XHR.

Please notice some WebKit-based browser still have a defect with caching such requests, so we are disabling that for them (e.g. Safari).

Closing as answered. Provide more concrete information/example for better explanation. See also https://github.com/mozilla/pdf.js/wiki/Frequently-Asked-Questions#range

yurydelendik · 2017-05-18T13:14:35Z

Attaching screenshot for expected range request activity:

rogierlommers · 2017-05-19T13:10:14Z

I see 206 responses, but it looks like pdf.js still fetches the whole PDF and I'm using Chrome on MacOS. This is a supported combination, right?

yurydelendik · 2017-05-19T13:17:20Z

I see 206 responses, but it looks like pdf.js still fetches the whole PDF and I'm using Chrome on MacOS. This is a supported combination, right?

Correct.

@rogierlommers PDF.js will make an attempt to load entire PDF with first XHR and when headers come it will abort the fetch. With local connections, you might not see that since it's really fast. Try to do it at remote server. Also pay attention to the caching -- it's okay for content to be cached, but that means you might be receiving entire PDF from first XHR.

yurydelendik · 2017-05-19T13:31:07Z

(Assuming you guys are working on the same problem) See also #8425

yurydelendik · 2017-05-19T13:35:34Z

See that first 200 has only 4.0kb in length:

rogierlommers · 2017-05-22T05:46:12Z

Sorry for all my questions, but please have a look at attached screenshot. As you can see, I get 206s, indicating that range-requests are working fine. Right? But for some reason, Chrome is downloading the full PDF while I expect it to load only the first x bytes.

yurydelendik · 2017-05-22T12:29:46Z

But for some reason, Chrome is downloading the full PDF while I expect it to load only the first x bytes.

I don't understand what file and what is expected? By looking at 9789027673633.pdf 200 response, it downloaded 12.7kb, next 206 response asked 64.3kb. Unless your file only 12.7kb, then your next 206 requests/responses look fishy.

rogierlommers · 2017-05-22T12:45:06Z

Then we have a different understanding of this feature. My assumption was that:

if a PDF is web-optimzed
pdf.js only downloads the first x bytes
until the user selects other pages of the document
then the bytes corresponding to the other pages will be downloaded

Now my conclusion is that

pdf.js starts downloading a web-optimized pdf
if page 1 is succesfully downloaded, it starts render this page client-side
and continue downloading the remaining bytes of the document (regardless if the user has selected/requested these pages

yurydelendik · 2017-05-22T12:50:37Z

PDF.js has two other options, disableAutoFetch and disableStream. The former stops any range-requests downloading if enough data is fetched, the latter disables fetching for progressive download capable browsers. See also #7937 and https://github.com/mozilla/pdf.js/wiki/Debugging-PDF.js#url-parameters

rogierlommers · 2017-05-23T19:43:07Z

Thanks; it all works fine now.

dlandis · 2017-10-18T18:03:39Z

Hi @yurydelendik ,

I'm having a similar issue and I had a question about your comment:

See that first 200 has only 4.0kb in length:

So what should the response body be for the initial response with the 200 code before a range request has been made by pdf.js? Can the body be empty, for example, as long as there is a response header Accept-Ranges: bytes response header? Will that trigger pdf.js to make a range request?

Thanks

yurydelendik · 2017-10-18T18:07:29Z

All http responses needs to be valid, so first response must be piped in-full until it's cancelled.

dlandis · 2017-10-18T19:31:24Z

@yurydelendik Thanks for your response.

I'm wondering if it was ever discussed just performing range requests from the outset (maybe configurable via a param) ?

I noticed the RFC says:

A client MAY generate range requests without having received this header field for the resource involved .

This would potentially help the server (depending on how it was implemented) so it wouldn't have to load the whole document for that initial request.

And then the client, in those cases, wouldn't need to cancel that initial response and then switch to range requests. Wouldn't that be simpler?

Thanks

yurydelendik · 2017-10-18T19:37:49Z

@dlandis sorry, I don't follow your thoughts. There is an option to override default behavior -- you can implement PDFDataRangeTransport with only HTTP range requests. It's not possible in general case IMHO.

dlandis · 2017-10-18T19:55:29Z

you can implement PDFDataRangeTransport with only HTTP range requests

@yurydelendik Thanks again, it sounds like that is what I need. I don't suppose there is an example?

yurydelendik · 2017-10-18T20:08:36Z

See e.g. tests at https://github.com/mozilla/pdf.js/blob/master/test/unit/api_spec.js#L1277

richiepriya · 2019-09-19T08:27:07Z

@yurydelendik i have some issue in downloading the pdf using range requests. I am using spring boot application to provide download service. viewer.html makes 1st request which is cancelled since service supports range request and initiates partial request which is as expected but there are no further requests from browser, Its just one where i am expecting it to request till whole pdf is downloaded. Is there any special header that needs to be added in response so that browser sends all request to service.

vedidinakar · 2022-08-17T08:33:21Z

We have a pdf of size 200mb i want to load first 1 page once its downloaded to the browser. Can you please help me a way to achieve this

yurydelendik closed this as completed May 18, 2017

yurydelendik mentioned this issue May 22, 2017

Can't enable progressive rendering (i.e. range-request download) #8425

Closed

eltiffster mentioned this issue Nov 27, 2019

PDFjs viewer doesn't use range requests UVicLibrary/Vault#35

Closed

stephanrauh mentioned this issue Aug 14, 2020

Hot to render large PDF files without lazy loading behavior stephanrauh/ngx-extended-pdf-viewer#436

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to check if range-requests are in use? #8422

How to check if range-requests are in use? #8422

rogierlommers commented May 18, 2017

yurydelendik commented May 18, 2017

yurydelendik commented May 18, 2017

rogierlommers commented May 19, 2017

yurydelendik commented May 19, 2017

yurydelendik commented May 19, 2017

yurydelendik commented May 19, 2017

rogierlommers commented May 22, 2017 •

edited

Loading

yurydelendik commented May 22, 2017 •

edited

Loading

rogierlommers commented May 22, 2017

yurydelendik commented May 22, 2017

rogierlommers commented May 23, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

richiepriya commented Sep 19, 2019

vedidinakar commented Aug 17, 2022

How to check if range-requests are in use? #8422

How to check if range-requests are in use? #8422

Comments

rogierlommers commented May 18, 2017

yurydelendik commented May 18, 2017

yurydelendik commented May 18, 2017

rogierlommers commented May 19, 2017

yurydelendik commented May 19, 2017

yurydelendik commented May 19, 2017

yurydelendik commented May 19, 2017

rogierlommers commented May 22, 2017 • edited Loading

yurydelendik commented May 22, 2017 • edited Loading

rogierlommers commented May 22, 2017

yurydelendik commented May 22, 2017

rogierlommers commented May 23, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

dlandis commented Oct 18, 2017

yurydelendik commented Oct 18, 2017

richiepriya commented Sep 19, 2019

vedidinakar commented Aug 17, 2022

rogierlommers commented May 22, 2017 •

edited

Loading

yurydelendik commented May 22, 2017 •

edited

Loading