You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The first line of the pdb file after is: MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITKDEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRMIQQKRWDEWAVNMAKSRWYNQTPNRAKRVITTFRTGTWDAYK
however this doesn't correspond to the first line of the pdb.lookup which is 200l_A.
instead the first line through blast shows it belongs to 145l_A which is on this line of the lookup:
grep -i -a -n 145L_A pdb.lookup
159:158 145l_A 121
how is the ordering done so that the id's match?
The text was updated successfully, but these errors were encountered:
The lookup file points to a database key (first column of the .lookup file), which points to the .index (again first column).
In the index you can lookup the byte offset (second column) that points to the data file.
The data file is a special issue for the PDB, since we ship it as a clustered database. The full PDB data is split across two seperate files pdb_seq.0 and pdb_seq.1, the former contains only the cluster representatives and the latter all others.
I would recommend to do database manipulations with the various Foldseek/MMseqs2 commands.
executed: foldseek databases PDB pdb tmp
The first line of the pdb file after is: MNIFEMLRIDEGLRLKIYKDTEGYYTIGIGHLLTKSPSLNAAKSELDKAIGRNTNGVITKDEAEKLFNQDVDAAVRGILRNAKLKPVYDSLDAVRRAALINMVFQMGETGVAGFTNSLRMIQQKRWDEWAVNMAKSRWYNQTPNRAKRVITTFRTGTWDAYK
however this doesn't correspond to the first line of the pdb.lookup which is 200l_A.
instead the first line through blast shows it belongs to 145l_A which is on this line of the lookup:
grep -i -a -n 145L_A pdb.lookup
159:158 145l_A 121
how is the ordering done so that the id's match?
The text was updated successfully, but these errors were encountered: