You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In trying to scrape the site https://www.isca-speech.org/archive, an error in the dependency "request" is invoked pretty early on, and the program dies.
Expected behavior: [What you expect to happen]
I expect at least the top-level html files saved to my filesystem.
RangeError: Invalid string length
at Array.join (native)
at Request. (/home/rich/node_modules/request/request.js:1147:31)
at Request.emit (events.js:198:13)
at IncomingMessage. (/home/rich/node_modules/request/request.js:1083:12)
at Object.onceWrapper (events.js:286:20)
at IncomingMessage.emit (events.js:203:15)
at endReadableNT (_stream_readable.js:1129:12)
at process._tickCallback (internal/process/next_tick.js:63:19)
Additional Information
[Any additional information, configuration or data that might be necessary to reproduce the issue]
The behavior is at least repeatable on Windows 11.
The text was updated successfully, but these errors were encountered:
@rrgerber can you provide a reproduction repo including config and importantly lockfile. request is not a dependency of this project, there is no reason it should be getting called.
paul@Pauls-MacBook-Pro node-website-scraper % npm ls request
website-scraper@5.2.0 /Users/paul/Documents/GitHub/node-website-scraper
└── (empty)
In addition to @phawxby's suggestion you can try to take a look on logs.
Please also note that callbacks are not supported since version 5.0.0, please use async/await or promises, see usage examples
Configuration
version: [result of
npm ls website-scraper --depth 0
command]`-- website-scraper@5.2.0
options: [provide your full options object]
Description
In trying to scrape the site https://www.isca-speech.org/archive, an error in the dependency "request" is invoked pretty early on, and the program dies.
Expected behavior: [What you expect to happen]
I expect at least the top-level html files saved to my filesystem.
Actual behavior: [What actually happens]
Here is what actually happens:
$ node --max-old-space-size=24000 main.js
/home/rich/node_modules/request/request.js:1147
response.body = strings.join('')
^
RangeError: Invalid string length
at Array.join (native)
at Request. (/home/rich/node_modules/request/request.js:1147:31)
at Request.emit (events.js:198:13)
at IncomingMessage. (/home/rich/node_modules/request/request.js:1083:12)
at Object.onceWrapper (events.js:286:20)
at IncomingMessage.emit (events.js:203:15)
at endReadableNT (_stream_readable.js:1129:12)
at process._tickCallback (internal/process/next_tick.js:63:19)
Additional Information
[Any additional information, configuration or data that might be necessary to reproduce the issue]
The behavior is at least repeatable on Windows 11.
The text was updated successfully, but these errors were encountered: