You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I would like kindly inform that I have found some sequences in AntiRef that contain the string "nan". I believe this is due to how the sequences are reconstructed with pandas. The total number of sequences with the string "nan" are:
AntiRef100: 21,284
AntiRef90: 10,335
The code used to find these sequences was the following: grep 'nan' /home/ubuntu/MHK/antiref/<antiref_file> | grep -v '^>' | wc -l
The text was updated successfully, but these errors were encountered:
MiguelHK
changed the title
Sequences with missing nucleotides contain the string "nan"
Sequences with missing amino acids contain the string "nan"
Apr 18, 2024
MiguelHK
changed the title
Sequences with missing amino acids contain the string "nan"
Sequences with missing amino acids/incorrect ANARCI extraction contain the string "nan"
Apr 18, 2024
Hello! I would like kindly inform that I have found some sequences in AntiRef that contain the string "nan". I believe this is due to how the sequences are reconstructed with pandas. The total number of sequences with the string "nan" are:
The code used to find these sequences was the following:
grep 'nan' /home/ubuntu/MHK/antiref/<antiref_file> | grep -v '^>' | wc -l
The text was updated successfully, but these errors were encountered: