-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add PanCanAtlas Data Download #75
Conversation
uncompressed md5sum is equivalent
README.md
Outdated
# Create and activate conda environment | ||
conda env create --quiet --force --file environment.yml | ||
source activate pancancer-classifier | ||
git clone git@github.com:greenelab/pancancer.git |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this sufficient to get the data from git-lfs?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe if the user has Git LFS installed, this command will download LFS assets... probably worth testing, and perhaps mentioning in README to make sure to have git-lfs installed. Alternatively you can add https://anaconda.org/conda-forge/git-lfs to the environment. (clone comes before env install, so that doesn't make sense)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I think that is how it would work too. I will add that note here as well
Adding instructions to download raw, versioned data. These raw files are not tracked, but the processed datasets used in Way et al. 2018 are added here.