Skip to content

Commit

Permalink
Add CATH50 #232
Browse files Browse the repository at this point in the history
  • Loading branch information
martin-steinegger committed Jan 26, 2024
1 parent bb09017 commit 6893dcc
Show file tree
Hide file tree
Showing 2 changed files with 17 additions and 1 deletion.
10 changes: 9 additions & 1 deletion data/structdatabases.sh
Original file line number Diff line number Diff line change
Expand Up @@ -150,9 +150,17 @@ case "${SELECTION}" in
push_back "${TMP_PATH}/pdb"
INPUT_TYPE="FOLDSEEK_DB"
;;
"CATH50")
if notExists "${TMP_PATH}/cath50.tar.gz"; then
downloadFile "https://foldseek.steineggerlab.workers.dev/cath50.tar.gz" "${TMP_PATH}/cath50.tar.gz"
downloadFile "https://foldseek.steineggerlab.workers.dev/cath50.version" "${TMP_PATH}/version"
fi
tar xvfz "${TMP_PATH}/cath50.tar.gz" -C "${TMP_PATH}"
push_back "${TMP_PATH}/cath50"
INPUT_TYPE="FOLDSEEK_DB"
;;
esac


if notExists "${OUTDB}.dbtype"; then
case "${INPUT_TYPE}" in
"FOLDSEEK_DB")
Expand Down
8 changes: 8 additions & 0 deletions src/FoldseekBase.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -397,5 +397,13 @@ std::vector<DatabaseDownload> externalDownloads = {
"https://www.rcsb.org",
true, Parameters::DBTYPE_AMINO_ACIDS, structdatabases_sh, structdatabases_sh_len,
{}
},
{
"CATH50",
"CATH domain database (combined AlphaFold and PDB CATH clustered at 50% seq.id.).",
"Bordin et al. AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms. Communications Biology, 6, 160 (2023)",
"https://www.cath.info",
true, Parameters::DBTYPE_AMINO_ACIDS, structdatabases_sh, structdatabases_sh_len,
{}
}
};

0 comments on commit 6893dcc

Please sign in to comment.