Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docker #1

Closed
rajewski opened this issue Aug 30, 2022 · 3 comments · Fixed by #5
Closed

Docker #1

rajewski opened this issue Aug 30, 2022 · 3 comments · Fixed by #5
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@rajewski
Copy link
Owner

Problem

All of the scripts here depend on the UC Riverside HPCC infrastructure in terms of install software like MrBayes, GATK, etc. In addition there are a number of rather off-the-map R packages.

From a reproducibility standpoint, it would be nice to wrap all of this into a single docker container with all dependencies pre-installed.

Solution

Create a Dockerfile to include in the repo that specifies how to build a container to house the analysis. Subsequently, this container should be built and made publicly available in Dockerhub.

Additional considerations

Inside the scripts, file paths will need to be changed. Submission scripts will need to either be removed or reworked to call Docker instead. Removal is probably better since most HPC environments would require Singularity over docker and the submission itself is likely to be very environment-specific (Slurm vs PBS, etc)

@rajewski rajewski added the enhancement New feature or request label Aug 30, 2022
@rajewski rajewski self-assigned this Aug 30, 2022
@rajewski rajewski added this to the Publication milestone Aug 30, 2022
@rajewski
Copy link
Owner Author

the main problems that I am running into are speedseq and whathap. They both have some complex dependencies which stymy a simple docker container.

@rajewski
Copy link
Owner Author

rajewski commented Aug 31, 2022

I found a speedseq docker image here, I'm still making a whatshap image, but I've also decided to transition the monolithic container to several smaller ones

  • bcftools
  • samtools
  • bwa
  • gatk
  • ffq and jq
  • speedseq
  • whatshap
  • R packages

rajewski added a commit that referenced this issue Sep 1, 2022
@rajewski
Copy link
Owner Author

rajewski commented Sep 1, 2022

ugggggh

  • plink
  • faststructure

Things I've sworn not to include because they would be so onerous to repeat and/or were not used in the final analysis

  • RAxML-NG
  • MrBayes
  • jModelTest
  • PArtitionFinder

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant