Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with use of gutenberg_download function #8

Closed
joelcourtheyn opened this issue Feb 24, 2017 · 13 comments
Closed

Problem with use of gutenberg_download function #8

joelcourtheyn opened this issue Feb 24, 2017 · 13 comments

Comments

@joelcourtheyn
Copy link

I'm receiving following error message when trying to download. Can someone give advice ?

library(gutenbergr)
wuthering_heights <- gutenberg_download(768)
Determining mirror for Project Gutenberg from http://www.gutenberg.org/robot/harvest
Using mirror http://www.gutenberg.lib.md.us
Warning messages:
1: In .f(.x[[i]], ...) :
Could not download a book at http://www.gutenberg.lib.md.us/7/6/768/768.zip
2: Unknown column 'text'
3: In is.na(text) : is.na() applied to non-(list or vector) of type 'NULL'

@crogger62
Copy link

Me too. I wonder if the host is down?

@frnz347
Copy link

frnz347 commented Feb 24, 2017

I have a the same problem :(

@dgrtwo
Copy link
Collaborator

dgrtwo commented Feb 24, 2017

Unfortunately this Project Gutenberg mirror appears to be down today (e.g. see here).

The good news is that you can get around it by choosing another mirror from this page and giving it as the mirror argument. For example, please try:

wuthering_heights <- gutenberg_download(768, mirror = "http://mirrors.xmission.com/gutenberg/")

I'll try the original mirror again tomorrow, and if the problem continues I'll try contacting Project Gutenberg. (If that doesn't work I'll try to make an easier way to set backup mirrors in the software).

@frnz347
Copy link

frnz347 commented Feb 24, 2017

@dgrtwo thanks, it worked for me

@joelcourtheyn
Copy link
Author

@dgrtwo Thanks for your quick reply, it worked !
I was already guessing that there was something wrong with my computer connexion at the server part.
Congratulations to you and your colleague Julia Silge with your new book : Text Mining in R !
I discovered quite a lot of new things to me s.a. the very interesting gutenbergr package,

@dgrtwo
Copy link
Collaborator

dgrtwo commented Mar 28, 2017

FYI, I've committed a change to dev (26f0639) that fixes this. Planning to submit to CRAN shortly.

I've emailed Project Gutenberg twice and they haven't replied.

@dgrtwo dgrtwo closed this as completed Mar 28, 2017
@dgrtwo
Copy link
Collaborator

dgrtwo commented Apr 17, 2017

I haven't yet submitted to CRAN but Project Gutenberg was quite helpful and changed the default mirror on the site. This should now work (including in the CRAN version)!

@vs6f8
Copy link

vs6f8 commented Mar 29, 2019

This is still not working. I am getting the same error.

@crogger62
Copy link

Both of these worked for me:
wuthering_heights <- gutenberg_download(768, mirror = "http://mirrors.xmission.com/gutenberg/")
and
wuthering_heights <- gutenberg_download(768, mirror = "http://mirrors.xmission.com/gutenberg/")

What error are you getting?

@vs6f8
Copy link

vs6f8 commented Mar 29, 2019

Hi, I tried it again and am getting the same error, the error is as follows
image

@crogger62
Copy link

What happens if you navigate to the URL in the red text: "http://mirrors.xmission.com/gutenberg//7/6/768/768.zip" ? You should end up downloading 768.zip which is text for wuthering heights.

Your second warning "Unknown or uninitialised column: 'text'." seems to be a problem with getting the data into the data frame (tbl_df). One of the columns is called 'text' and it looks like there is a problem getting it there.

I'd suggest making sure your packages are up to date. Make sure you're loading dplyr
library(dplyr)
too.

@ccpsilva
Copy link

I have the same problem presented by (vs6f8 commented on 29 Mar), I have tried several repositories and still have the same error. Someone who had the same problem and managed to solve it could give me a hand.

@lejarx
Copy link

lejarx commented Jan 20, 2021

Thought this would still be useful, the default aleph mirror doesn't work.

Available mirrors are at
https://www.gutenberg.org/MIRRORS.ALL

I'm using ''http://www.gutenberg.org/dirs/'', it works.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants