Skip to content

SyntheticStrain change sacCer3 depths and tally simulation results#14

Merged
owlang merged 7 commits intomasterfrom
datasets
Oct 23, 2021
Merged

SyntheticStrain change sacCer3 depths and tally simulation results#14
owlang merged 7 commits intomasterfrom
datasets

Conversation

@owlang
Copy link
Copy Markdown
Contributor

@owlang owlang commented Oct 23, 2021

This pull request includes changes made to shift sacCer3 simulation depths (more shallow) and to parse, tally, and visualize the StrainID score results and performance (runtimes).

The scripts for running StrainID on the human simulations were stripped of a no longer accurate comment about how the script runs.
I needed to update the job/run_depth* and job/run_StrainID* PBS scripts with the new sacCer3 simulation depths (move from 1M, 2M, 3M, 4M, 5M to 10K, 50K, 100K, 500K, 1M, and 2M).

Helper files including the `depth_simulations.txt` file needed to be updated with the new depths and seeds, as well as the simulation script so that it could support 10K and 50K read depths.
Previous commit missing this straggling PBS script for running StrainID on CENPK  data at at depth of 100K.
The PBS script for generating the synthetic genomes to simulate off of is updated in this commit to point to the correct sacCer3 VCF files that contain ALL variants, and not just the subset of variants unique to the strain. The change in reference VCF files for the sacCer3 default run in a prior update precipitated the need for this update.
For each experiment (strain x depth), parse out the StrainID scores and runtimes using the tally PBS script and the two helper python scripts for parsing the results.
There were some typos leftover from a copy paste of the ENCODE processing scripts. This commit fixes the typos in the comment descriptions.
This commit includes a python script and an update to the tally job for generating jitter/strip plots of the simulation StrainID scores.

scripts/make_jitter.py
-seaborn library-based plot for showing spread of scores assigned to each strain for each simulation "experiment" (synthetic_strain x depth)
job/tally_results.sh
-PBS script updated with py script calls to generate figs
@owlang owlang merged commit 908adc3 into master Oct 23, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant