-
Notifications
You must be signed in to change notification settings - Fork 68
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PhantomJSFetcher returns -1 StatusCode #383
Comments
Because of PhantomJS... PhantomJS doesn't return forever, and PhantomJSFetcher wait for its end.
OS : Linux (Deb) |
Are you using it only for HTML pages (best) or ALL pages? One thing you can try is setting up a short-enough timeout on PhantomJS. You can play with the
See if it makes any difference. Otherwise please share a URL that can help reproduce the problem if you can. |
some pages... returns 'fail' status but page.content is collected properly.(200 res code) So I modified phantom.js as following
I don't know..why PhantomJS got screwed. Anyway it works on the condition (Crawled content but phantom return fail). I sent them by email. Thank you.
|
Thanks! I have added your |
Done! It works well. I added '-1' to validStatusCode to handle URLs returning '-1' with 200 response. Thank you. |
Great! Thanks for confirming. |
I found that phantomjs returns -1 status code on some url.
(but connected properly)
I set a Breakpoint on CmdGrabber to figure out the problem.
URLs returning -1 status code didn't make any output. So the grabber couldn't process it.
In my guess, the reason is Java Script Error which could disable function of 'phantom.js'.
I'll send the URLs by email if you can check it.
Thank you.
The text was updated successfully, but these errors were encountered: