-
Notifications
You must be signed in to change notification settings - Fork 460
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Databases used in the mmseq2 search, local version #20
Comments
We are working on preparing the preprint and will make the databases available then. This should hopefully happen very soon. |
Thanks a lot! Looking forward to reading your paper! |
I am also interested in running ColabFold (MMseqs2 works great for me) on a local installation, or a way that allows us to programmatically call it for 10E4-10E5 of molecules. Looking forward to a solution one way or another, and also about reading the details behind in a preprint. |
Hello, Thanks for all this work, ColabFold is just great ! |
We are so sorry for the delay. We have the database ready but our FTP storage space is limited. We asked our IT for an increase of the quota. Once we get it approved we will upload the database and scripts how to build and run it. |
@martin-steinegger nothing to be sorry about, you are doing a fantastic job with this project ! And thanks for the quick answer. Have you also thought about storing these datasets and the database in the cloud with e.g. the AWS Open Dataset repository (and/or the equivalent thing on Google Cloud ?) |
@fstrozzi thank you! We would be happy to host our databases on the open dataset repository. But we were never successful when applying to Google or AWS. |
We have uploaded the ColabFold databases at https://colabfold.mmseqs.com. You can find instructions how to create MMseqs2 databases from these archives in the MMseqs2 wiki. We also finished merging all the MMseqs2 changes back to the main repository (starting from commit soedinglab/MMseqs2@f651879 it should work). We will make running everything easier as soon as possible, however you should be able to get a local ColabFold installation running. |
Hello,
I would like to run locally the msa building step of the colab notebook and use the exact same set of databases to do some comparison with other databases.
Is it possible to get access to the set of databases the mmseq2 server is using as well as the version of mmseqs2 and the specific command lines executed on the server?
In the slides you presented (awesome presentation!), you mentioned you are using a 30%id clustered DB built from SMAG, MGNIFY, BFD, and MetaEuk. Do you provide somewhere a downloadable version of the master 30%seq_id db?
Thanks a lot!
The text was updated successfully, but these errors were encountered: