-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature to filter out refgene index source #61
Feature to filter out refgene index source #61
Conversation
…'s content - responsibility to filter out predicted model accession number is now caller's one for consistency
- refGene.txt and ncbiRefSeq.txt are [GenePred format](https://genome.ucsc.edu/FAQ/FAQformat.html#format9) and not NCBI's file
Codecov Report
@@ Coverage Diff @@
## master #61 +/- ##
=======================================
Coverage 45.63% 45.64%
=======================================
Files 16 16
Lines 1970 1974 +4
Branches 60 60
=======================================
+ Hits 899 901 +2
- Misses 1011 1013 +2
Partials 60 60
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
@@ -58,24 +58,30 @@ | |||
(update m :cds-end-stat keyword) | |||
(update m :exon-frames parse-exon-pos))) | |||
|
|||
(defn- load-ncbi-file |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since we released the function, I think it would be better to keep it and mark it as deprecated for backward compatibility.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@federkasten
Thank you for the review. It sounds good but if this function was used from outside of this namespace, that should be rewritten because it is defined as private. What do you think?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oops. Sorry, I missed this is a private function.
(when s | ||
(re-find #"^(NM|NR)_.+$" s))) | ||
|
||
(defn- load-genepred-file |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very helpful in finding an appropriate name! 🙏
@@ -58,24 +58,30 @@ | |||
(update m :cds-end-stat keyword) | |||
(update m :exon-frames parse-exon-pos))) | |||
|
|||
(defn- load-ncbi-file |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oops. Sorry, I missed this is a private function.
Problem
chr6_cox_hap2
but we sometimes don't need contig's records at all.Implementation
filter-fns
toload-ref-genes
andload-ref-seqs
to filter out unnecessary rows in these files.load-ncbi-file
toload-genepred-file
because these files format are named as GenePred (just refactoring).