Don't return incomplete documents for HTTP.processDocuments #666

aurimasv · 2015-03-08T13:42:28Z

@adam3smith emailed me about an issue with Aleph translator. On https://aleph.univie.ac.at a ZU.processDocuments call to retrieve MARC metadata was returning an incomplete page (only a few META elements from the HEAD). The processDocuments call was accompanied by the following message in the Browser console:

The page was reloaded, because the character encoding declaration of the HTML document was not found when prescanning the first 1024 bytes of the file. The encoding declaration needs to be moved to be within the first 1024 bytes of the file.

So, what was happening was that the page was being loaded, the loading was stopped (firing a pageshow event, but not the load event), and Zotero was returning the incomplete document to translator (just past the META element with charset declaration).

This patch takes advantage to the load event not being fired in these cases to detect incomplete page loads. However, as commented here the same behavior is obtained for network errors (don't think these were being handled anyway), so we introduce some additional checks for those cases.

Probably doesn't work on Mac OS X

aurimasv added 3 commits March 8, 2015 06:06

Don't return incomplete documents for HTTP.processDocuments

6f0fb2d

Detect network errors in ZU.processDocuments

1bdaf2f

Close scraping progress window when closing browser

14b9cab

Probably doesn't work on Mac OS X

aurimasv mentioned this pull request Mar 17, 2015

Fix error handling when saving snapshots #679

Open

aurimasv referenced this pull request Jun 7, 2015

Some snapshot-related fixes

38eeab0

aurimasv mentioned this pull request Jun 11, 2015

Snapshots for some pages are incomplete #759

Closed

dstillman closed this Mar 3, 2021

dstillman deleted the branch zotero:4.0 March 3, 2021 08:51

dstillman reopened this Mar 3, 2021

dstillman deleted the branch zotero:4.0 April 5, 2023 07:34

dstillman closed this Apr 5, 2023

dstillman reopened this Apr 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't return incomplete documents for HTTP.processDocuments #666

Don't return incomplete documents for HTTP.processDocuments #666

aurimasv commented Mar 8, 2015

Don't return incomplete documents for HTTP.processDocuments #666

Are you sure you want to change the base?

Don't return incomplete documents for HTTP.processDocuments #666

Conversation

aurimasv commented Mar 8, 2015