Created: 2022/08/10 15:03:33 Last modified: 2022/09/02 14:55:37
- Aim: This document documents/describes running the pore-c-snakemake pipeline
- Prerequisite software: slurm v20.11.6, conda v4.13.0, mamba v0.24.0, GNU coreutils, git v1.8.3.1
- OS: ORAC (CentOS Linux) (ESR production network)
Run bash script to get the reference guinea pig genome. See my script at ./scripts/04_pore_c_snakemake_pipeline_run/01_get_reference.sh and my conda env file at ./scripts/envs/conda.wget.1.21.3.yml
sbatch ./scripts/04_pore_c_snakemake_pipeline_run/01_get_reference.sh
Run bash script to run pore-c on all samples. See my script at ./scripts/04_pore_c_snakemake_pipeline_run/02_pore_c_snakemake_pipeline_run.sh and my configuration files at ./config/04_pore_c_snakemake_pipeline_run/
sbatch ./scripts/04_pore_c_snakemake_pipeline_run/02_pore_c_snakemake_pipeline_run.sh
Note. one of the pipeline steps errored out with the following error:
Calculating norms for zoom FRAG_2
Calculating norms for zoom FRAG_1Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at java.base/java.util.Arrays.copyOf(Arrays.java:3537)
at java.base/java.io.ByteArrayOutputStream.ensureCapacity(ByteArrayOutputStream.java:100)
at java.base/java.io.ByteArrayOutputStream.write(ByteArrayOutputStream.java:111)
at org.broad.igv.tdf.BufferedByteWriter.putLong(BufferedByteWriter.java:133)
at org.broad.igv.tdf.BufferedByteWriter.putDouble(BufferedByteWriter.java:115)
at juicebox.tools.utils.norm.NormVectorUpdater.putArrayValuesIntoBuffer(NormVectorUpdater.java:59)
at juicebox.tools.utils.norm.NormVectorUpdater.updateNormVectorIndexWithVector(NormVectorUpdater.java:46)
at juicebox.tools.utils.norm.NormalizationVectorUpdater.updateExpectedValueCalculationForChr(NormalizationVectorUpdater.java:198)
at juicebox.tools.utils.norm.NormalizationVectorUpdater.buildScale(NormalizationVectorUpdater.java:214)
at juicebox.tools.utils.norm.NormalizationVectorUpdater.updateHicFile(NormalizationVectorUpdater.java:126)
at juicebox.tools.clt.old.PreProcessing.run(PreProcessing.java:128)
at juicebox.tools.HiCTools.main(HiCTools.java:94)
Sounds like it ran out of java memory, will resolve later because it affected only one file I don't need right now
This couldn't be done in the pipeline so manually doing here, this is for ease of use when pushing to github repo
mkdir -p ./results/04_pore_c_snakemake_pipeline_run/pipeline_run/
mv ./results/04_pore_c_snakemake_pipeline_run/Pore-C-Snakemake/results/* ./results/04_pore_c_snakemake_pipeline_run/pipeline_run/
mv ./results/04_pore_c_snakemake_pipeline_run/Pore-C-Snakemake/pipeline_dag.png ./results/04_pore_c_snakemake_pipeline_run/pipeline_run/
mv ./results/04_pore_c_snakemake_pipeline_run/Pore-C-Snakemake/pipeline_rulegraph.png ./results/04_pore_c_snakemake_pipeline_run/pipeline_run/