Skip to content

AyushTyagi2/EvoDist

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Evolutionary Distance Analysis Using Codon Frequencies

πŸ“Œ Project Overview

This project explores evolutionary relationships between species based on codon usage frequencies using Machine Learning & Distance Metrics. The primary goal is to calculate evolutionary distances using Euclidean distance and analyze the correlation between species.

Additionally, a Random Forest model is trained to predict species classification based on codon frequency, and evolutionary trends are visualized.


πŸ”¬ Key Features

  • Preprocess Codon Usage Data 🧬
  • Compute Evolutionary Distances πŸ“
  • Apply Machine Learning (Random Forest Regression) πŸ€–
  • Feature Importance Analysis πŸ“Š
  • Visualize Species Relationships πŸ–ΌοΈ

πŸ“‚ Dataset

The project uses a Codon Usage Frequency dataset (codon_usage.csv), where:

  • Each row represents a species
  • Columns contain relative codon frequencies
  • SpeciesName is the label column

βš™οΈ Tech Stack

  • Python 🐍
  • Pandas, NumPy for data processing
  • Scipy for Euclidean distance computation
  • Scikit-learn for ML modeling
  • Matplotlib for visualization

πŸš€ Installation & Setup

# Clone the repository
git clone https://github.com/your-repo/evolutionary-distance
cd evolutionary-distance

# Install dependencies
install the necessary dependencies

πŸ“Š Results & Observations

  • Close evolutionary species have smaller distances (e.g., Rattus norvegicus & Mus musculus)
  • Machine Learning captures key codon importance in species classification
  • Visualization helps interpret relationships effectively

πŸ“Œ Future Work

  • πŸ—οΈ Improve the model with deep learning (CNNs for sequence data)
  • πŸ”¬ Explore other distance metrics (Manhattan, Mahalanobis)
  • 🧬 Extend analysis to larger genetic datasets

πŸ‘¨β€πŸ’» Author

Ayush Tyagi

πŸ“§ Reach me at Ayush04coder@gmail.com

πŸ› οΈ Contributions & PRs are welcome!

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors