-
Notifications
You must be signed in to change notification settings - Fork 83
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No content is retrieved, potential error at the readHTMLTable stage. #23
Comments
Looks like the problem is still there. I'm not sure if I can be more helpful but it may be a good idea to look into this. I've been able to reproduce the problem using a different scholar profile, and on a different computer. Not sure where exactly the problem resides but it would seem that when trying to read the table from the page using
|
I can confirm that I see this. See also http://stackoverflow.com/questions/33741372/google-server-gives-a-server-error-with-the-first-request-in-private-browsing-mo That SO post notes that repeating the request bypasses the issue. Doing:
worked for me the second time, while getURL never worked. |
I can confirm that this works indeed. if What does @jkeirstead recommend in terms of mending this issue? Should the functions be written so that a test for content retrieval is performed and if this fails a pull of the content using the method outlined above (twice) is performed and the rest of the function runs on the content of the object? Not sure how long this issue with google will remain, seems to have been quite some days already. But seems like a fix, that if judged needed I'd be happy to help fixing :) |
I visited here from Stackoverflow.com. Your R package is pretty nice and it seems we have the same issue from Google now on. I would like to discuss any ideas for solving it and share any things that is helpful. |
Thanks for raising this issue and posting the fix. I'm inclined to wait to see if Google fixes this since that's what the error message suggests. |
Makes sense. If it take too long and want the fixes implemented I'll be happy to help. |
Ok guys, great! Just noticed the bug looking at an empty citation history plot on my personal blog. Let's hope they fix this soon! |
"Fixed the issue by having cookies when it requests URLs." see http://stackoverflow.com/questions/33741372/google-server-gives-a-server-error-with-the-first-request-in-private-browsing-mo |
That makes sense. I think httr GET looks after the cookie state. Sent from my iPhone
|
Thanks @LechMadeyski. That does indeed seem to be the problem; will try to get a fix out shortly. |
This has now been fixed and the latest version is available on dev; a CRAN release should be out very soon. For those who are curious, the problem was that cookies have to be accepted in order to access the content. The package now performs a one-off check for a dummy URL and then maintains a persistent Curl handle for future queries. |
It appears the issue is back, or at least for me. I try to compile data from several colleagues (so multiple get_profile() queries) and I got randomly stuck with the Any ideas how to fix this, or any workaround? |
I also have the same issue. Does anyone know how to fix it?
|
As of today (3AM CET, as the earliest measured occurrence):
Error in tables[[1]] : subscript out of bounds
[1] year cites
<0 rows> (or 0-length row.names)
I suspect something has changed in the google API?
Just tried figuring out (but I'm not that skilled) however a pull of the XML content like so using RCurl
returns a bunch of source code but with an error message at the end:
I'll let you decide if this is worth closing this case. I imagine it is, but there may be more to it that someone more expert can check out.
The text was updated successfully, but these errors were encountered: