Skip to content

PDB Plus LLM Contact Map embdiing for ML classification research work

Notifications You must be signed in to change notification settings

pchourasia1/PDB_Plus_LLM_Contact_Map

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PDB_Plus_LLM_Contact_Map

PDB Plus LLM Contact Map embedding for ML classification research work

We propose a novel method for designing numerical embeddings in Euclidean space for proteins by leveraging 3D structure information, specifically employing the concept of contact maps. These embeddings are synergistically combined with features extracted from LLMs and traditional feature engineering techniques to enhance the performance of embeddings in supervised protein analysis. Experimental results on benchmark datasets, including PDB Bind and STCRDAB, demonstrate the superior performance of the proposed method for protein function prediction.

File Spike2Vec_PDB_Bind_3792_seq_k_3 is large so we are providing a raw data file with code to generate the embeddings.

About

PDB Plus LLM Contact Map embdiing for ML classification research work

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages