Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make aclImdb/alldata-id.txt availible to Doc2vec notebook can be run #488

Closed
cbonnett opened this issue Oct 18, 2015 · 5 comments
Closed

Comments

@cbonnett
Copy link

I can not seem to find the aclImdb/alldata-id.txt that is used in the Doc2vec.
Am I blind ? or Is the file not available ?

@cbonnett
Copy link
Author

Or a link to where one can get the file ?

@gojomo
Copy link
Collaborator

gojomo commented Oct 18, 2015

Cell # 1 of the notebook will fetch (and unpack) the data if not present at the expected location – see the line:

wget --quiet http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz

(I've just checked, it's still available at that URL.)

If you don't have wget installed, but do have curl, you should be able to replace wget --quiet with curl -O.

@cbonnett
Copy link
Author

So I was blind ! Thanks for the clarification.

@rahul-ka
Copy link

Do you have the original tar file that you downloaded? I think the download has since been changed and the notebook isn't working as intended anymore.

@gojomo
Copy link
Collaborator

gojomo commented Jun 22, 2018

@rahul-ka What error are you getting? (It appears to me http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz is still available.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants