ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning
-
Updated
Mar 14, 2025 - Python
ProTrek: Navigating the Protein Universe through Tri-Modal Contrastive Learning
Predict protein folding structures using ColabFold. Gain a deeper understanding of protein folding prediction with AlphaFold2 and MMseqs2. Run the Jupyter notebook on UCloud, learn to interpret results, predict protein structures of interest. Technical requirements provided. Enhance your knowledge of protein folding and AlphaFold2's principles. Fam
protclust is a Python library for protein sequence analysis that integrates MMseqs2 for fast clustering and provides tools for creating robust machine learning datasets. It offers cluster-aware data splitting to prevent sequence similarity bias in model evaluation, along with comprehensive protein embedding capabilities for feature generation.
This repository contains a set of scripts and workflows designed to search for Respiratory Complex I (NADH Ubiquinone Oxidoreductase) subunits in prokaryotic genomes and proteomes.
Protein Clustering
Add a description, image, and links to the mmseqs2 topic page so that developers can more easily learn about it.
To associate your repository with the mmseqs2 topic, visit your repo's landing page and select "manage topics."