-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix the 3’ rule on a trailing sub-sequence #63
Conversation
Codecov Report
@@ Coverage Diff @@
## master #63 +/- ##
==========================================
- Coverage 45.64% 45.63% -0.02%
==========================================
Files 16 16
Lines 1974 1992 +18
Branches 60 63 +3
==========================================
+ Hits 901 909 +8
- Misses 1013 1020 +7
- Partials 60 63 +3
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
4e176c5
to
f781731
Compare
f781731
to
e0106e7
Compare
"chr2" 26254257 "G" "GACT" '("NM_000183:c.4_6dupACT" | ||
"NM_001281512:c.4_6dupACT" | ||
"NM_001281513:c.-146_-144dupACT") ; cf. rs3839049 (+) | ||
"chr2" 47806844 "T" "TGATT" '("NM_000179:c.4068_4071dupGATT" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fix the VCF variant and HGVS for rs55740729.
"NM_001190274:c.*1272_*1275dupTCAA") ; cf. rs55740729 (+) | ||
"chr2" 26254257 "G" "GACT" '("NM_000183:c.5_7dupCTA" | ||
"NM_001281512:c.5_7dupCTA" | ||
"NM_001281513:c.-145_-143dupCTA") ; cf. rs3839049 (+) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
c.5_7dup
and c.-145_-143dup
correspond to rs3839049.
"NM_001309861:c.-39+12749_-39+12784delCGACTCCGGGGCGGCCCCTGACGCCCCAGCCGATCC" | ||
"NM_001309883:c.1172_1207delCGACTCCGGGGCGGCCCCTGACGCCCCAGCCGATCC" | ||
"NM_001077490:c.1172_1207delCGACTCCGGGGCGGCCCCTGACGCCCCAGCCGATCC" | ||
"NM_080425:c.1359_1394delCGACTCCGGGGCGGCCCCTGACGCCCCAGCCGATCC") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
migrated from #62, but the order is fixed.
@@ -235,6 +241,8 @@ | |||
;; prefer-deletion?, not actual example (+) | |||
"chr1" 47439008 "CCCGCAC" "C" {:prefer-deletion? false} '("p.P286_H287[3]") | |||
"chr1" 47439008 "CCCGCAC" "C" {:prefer-deletion? true} '("p.P292_H293del") | |||
"chr20" 58854572 "CCGCCCCAGCCGATCCCGACTCCGGGGCGGCCCCTGA" "C" {:prefer-deletion? true} | |||
'("p.P391_I402del" "p.S455_D466del") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
migrated from #62, but missing p.P391_I402del
is added.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the fix! LGTM👍🏻
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM 🙆♂️
I have fixed the missing consideration to the 3’ rule on a trailing sub-sequence.
The current 3’ rule of varity does not consider a trailing sub sequence. For example,
{:pos 1, :ref "C", :alt "CAT"}
insertion toCATA
sequence (result:CATATA
) is normalized to{:pos 3, :ref "T", :alt "TAT"}
. In this case, however,A
trails after the inserted bases which is a substring of the insertionsAT
, so{:pos 4, :ref "A", :alt "ATA"}
insertion toCATA
also returns the same result (CATATA
). The fixed 3’ rule in this PR can consider the trailing sub-sequence.This PR includes and passes #62 test cases.
I confirmed
lein test :all
passed, but the answer values of some existing test cases were changed along with this fix. You need to be careful about that.