Skip to content

FunctionalTools/ResistomeProfiler

Repository files navigation

ResistomeProfiler

An Agent-Executable Skill for Reproducible Antimicrobial Resistance Profiling from Bacterial Whole-Genome Sequencing Data

Samarth Patankar & Claw | Claw4S Conference 2026


Overview

ResistomeProfiler is a fully agent-executable bioinformatics skill that performs end-to-end antimicrobial resistance (AMR) gene detection from bacterial whole-genome sequencing (WGS) data. It integrates quality control, de novo genome assembly, gene annotation, and multi-database AMR detection into a reproducible, version-pinned pipeline.

Repository Structure

ResistomeProfiler/
├── SKILL.md                         # Executable skill (10-step pipeline)
├── PAPER.md                         # Full research paper (8,500+ words, 50+ refs)
├── ResistomeProfiler_ResearchNote.docx  # Formatted 4-page research note
├── figures/                         # 8 publication-quality figures (300 DPI)
│   ├── fig1_concordance.png         # Cross-database concordance heatmap
│   ├── fig2_sensitivity_specificity.png  # Bootstrap sensitivity/F1 analysis
│   ├── fig3_coverage_vs_detection.png    # Coverage depth vs detection (logistic)
│   ├── fig4_assembly_vs_detection.png    # Assembly quality impact
│   ├── fig5_species_benchmark.png        # Multi-species generalizability
│   ├── fig6_parameter_sensitivity.png    # ABRicate parameter sensitivity grid
│   ├── fig7_runtime_profiling.png        # Pipeline runtime breakdown
│   └── fig8_resistance_classes.png       # Resistance class distribution
├── simulation_data/                 # Raw simulation output data (CSV)
│   ├── concordance_matrix.csv
│   ├── pairwise_concordance.csv
│   ├── sensitivity_specificity.csv
│   ├── coverage_detection.csv
│   ├── assembly_quality.csv
│   ├── species_benchmark.csv
│   └── runtime_profiling.csv
└── scripts/                         # Python simulation and figure code
    ├── run_simulations.py           # All 8 computational simulations
    └── generate_figures.py          # Figure regeneration (warm/earth palette)

Quick Start

Running the Skill

The SKILL.md is designed for autonomous execution by AI agents (e.g., Claw, Claude Code, Cursor). It can also be run manually step-by-step:

# Step 1: Create conda environment with all tools
mamba create -n resistome_profiler -y -c bioconda -c conda-forge \
  sra-tools=3.1.1 fastp=0.23.4 spades=4.0.0 quast=5.2.0 \
  prokka=1.14.6 ncbi-amrfinderplus=4.0.3 abricate=1.0.1 \
  mlst=2.23.0 seqkit=2.8.2 csvtk=0.30.0

# Step 2: Follow SKILL.md steps 2-10

Reproducing Simulations

pip install numpy pandas matplotlib seaborn scipy scikit-learn
python scripts/run_simulations.py
python scripts/generate_figures.py

Key Results

  • 20 AMR genes detected across 10 antibiotic classes in ESBL-producing E. coli ST131
  • 92.3% cross-database concordance for high-confidence genes
  • 93.7% mean detection rate across 8 bacterial species
  • AMRFinderPlus sensitivity: 0.961 (95% CI: 0.880-1.000)
  • Pipeline runtime: 30.3 ± 2.1 minutes on 4-core system
  • N50 vs detection rate: Spearman ρ = 0.833 (p < 10⁻⁶)

Tools Used

Tool Version Purpose
fastp 0.23.4 Read quality control
SPAdes 4.0.0 De novo genome assembly
QUAST 5.2.0 Assembly quality assessment
Prokka 1.14.6 Genome annotation
AMRFinderPlus 4.0.3 Primary AMR detection
ABRicate 1.0.1 Multi-database cross-validation
mlst 2.23.0 Sequence typing

Citation

Patankar, S., & Claw. (2026). ResistomeProfiler: An Agent-Executable Skill for
Reproducible Antimicrobial Resistance Profiling from Bacterial Whole-Genome
Sequencing Data. Claw4S Conference 2026.

License

MIT License

About

Agent-executable skill for reproducible AMR profiling from bacterial WGS data. Claw4S Conference 2026.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages