Skip to content

PixelSergey/ProteinRisk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ProteinRisk

A protein risk assessment tool that analyzes amino acid sequences for harmful structural motifs and validates their claimed organism of origin.

Features

  • Organism Verification: Validates whether a protein sequence likely originates from the claimed organism
  • Harmful Motif Detection: Identifies dangerous structural patterns including:
    • HEXXH groups (metalloprotease motifs)
    • Pore-forming protein domains
  • Two-Stage Analysis:
    1. Organism authenticity check
    2. Structural risk assessment

Quick Start

# Install dependencies
cd probe_code
pip install -r requirements.txt

# Run the web application
cd ..
flask --app webapp run

The app will be available at http://localhost:5000

How It Works

Submit a protein sequence and organism origin through the web interface. The system performs two validation stages:

  1. Stage 1: Verifies the sequence matches the organism's expected signature patterns
  2. Stage 2: Scans for known harmful protein structures

Returns a comprehensive risk assessment with confidence scores.

Project Structure

  • webapp/: Flask web application with frontend and backend logic
  • probe_code/: ML models and protein analysis utilities
    • detector_probe1.py, detector_probe2.py: Neural network models for harm detection
    • metalloprotease_sequences.py, pore_forming_sequences.py: Training data and analysis
    • Pre-trained models: probe1_model.pt, probe2_model.pt

About

A protein risk assessment tool for the Varsity Hackathon 2026

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors