A protein risk assessment tool that analyzes amino acid sequences for harmful structural motifs and validates their claimed organism of origin.
- Organism Verification: Validates whether a protein sequence likely originates from the claimed organism
- Harmful Motif Detection: Identifies dangerous structural patterns including:
- HEXXH groups (metalloprotease motifs)
- Pore-forming protein domains
- Two-Stage Analysis:
- Organism authenticity check
- Structural risk assessment
# Install dependencies
cd probe_code
pip install -r requirements.txt
# Run the web application
cd ..
flask --app webapp runThe app will be available at http://localhost:5000
Submit a protein sequence and organism origin through the web interface. The system performs two validation stages:
- Stage 1: Verifies the sequence matches the organism's expected signature patterns
- Stage 2: Scans for known harmful protein structures
Returns a comprehensive risk assessment with confidence scores.
- webapp/: Flask web application with frontend and backend logic
- probe_code/: ML models and protein analysis utilities
detector_probe1.py,detector_probe2.py: Neural network models for harm detectionmetalloprotease_sequences.py,pore_forming_sequences.py: Training data and analysis- Pre-trained models:
probe1_model.pt,probe2_model.pt