Skip to content

Updates responding to reviewer comments#21

Merged
owlang merged 11 commits intomasterfrom
revisions
Aug 21, 2023
Merged

Updates responding to reviewer comments#21
owlang merged 11 commits intomasterfrom
revisions

Conversation

@owlang
Copy link
Copy Markdown
Contributor

@owlang owlang commented Jul 19, 2023

Update code base with more scripts and simulations responding to reviewer comments

owlang added 11 commits May 14, 2023 11:36
ecoli db reference files updated to match aligner switch
For readability, the .gitignore file was reduced using wildcard characters
Add support for hg38 and expanded cell lines on StrainID

-add hg38_VCF db with similar cell lines to hg19_VCF
-add hg38_ENCODE db with ENCODE cell lines at least 250 variants big
-add scripts and refs for building full hg38_DepMap db
-update README to describe how Model.csv was downloaded.
-minor fix to generate_sacCer3_VariantDB.sh usage typo
-.gitignore updated appropriately
the presence of this script implies validation was performed on mm10. Removing now to avoid confusion for future users
add scripts to generate hg38 versions of the HELA and K562 datasets at 20M
missed some files in the reference DBs when I switched from bowtie2 to bwa for the aligner
create a second version 2 of the database with an expanded set of epitope sequences
-update README with more details about script execution
-add gzipped parsed text file results
-update gitignore with fixes to ENCODE StrainID directory typos
-change results parsing script to use pandas and include every Cell Line score instead of just the best one
update script for parsing and building violin plots from ENCODE StrainID results. Add option to filter by various assays and include README lines and some highlighted violin PNG figures
Our the ID scripts have long been refactored to use Python3 over Python2 but the usage was not fixed to reflect this. This commit fixes the usage statement
For Supplementary figure 2 and supplementary table 11, we pulled 100 ENCODE datasets and ran StrainID on them using the full hg38_DepMap reference to build at table and histogram of StrainID scores showing that "correct" strainid scores show good separation from "incorrect" ones.
@owlang owlang merged commit 7d4b912 into master Aug 21, 2023
@owlang owlang deleted the revisions branch August 28, 2023 12:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant