Protein Clustering
-
Updated
Sep 9, 2022 - Jupyter Notebook
Protein Clustering
Gene download from NCBI database, sequence alignment and clusterization, realization of different bioinformatics algorithms within ITMO 4 sem course
Dereplicate long sequences
MeShClust2: Application of alignment-free identity scores in clustering long DNA sequences
Cluster up to millions of peptide sequences on shared sequence motifs.
CD HIT cluster file parser
Bioinformatic Tools for analyzing targeted amplicon sequencing developed by Nicholas Hathaway of Bailey Lab
Code to reproduce the experiments and the proposed visualization from 'Data mining in the development of mHealth apps: assessing in-app navigation through Markov Chain analysis'
This repository contains all the source files required to run DeLUCS, a deep learning clustering algorithm for DNA sequences.
MeShClust: an intelligent tool for clustering DNA sequences
Multiple sequence alignment with top benchmark scores scalable to thousands of sequences. Generates replicate alignments, enabling assessment of downstream analyses such as trees and predicted structures.
MMseqs2: ultra fast and sensitive search and clustering suite
Add a description, image, and links to the sequence-clustering topic page so that developers can more easily learn about it.
To associate your repository with the sequence-clustering topic, visit your repo's landing page and select "manage topics."