This repository bundles the RNovA PathSearcher and SeqFiller inference modules plus post-processing (FDR and clustering) to run an open-PTM de novo workflow. The top-level entry point is RNovA.sh.
RNovA_PathSearcher_Inference/: PathSearcher inference (stage 1, backbone path discovery). Developed by Zeping Mao (zeping.mao@uwaterloo.ca)RNovA_SeqFiller_Inference/: SeqFiller inference (stage 2, sequence completion). Developed by Zeping Mao (zeping.mao@uwaterloo.ca)workflow.py: clustering + alignment to derive top-k PTMs from PathSearcher results. Developed by Yonghan YuFDR_stage1.py: node/path-level FDR filtering on PathSearcher outputs. Developed by Qianqiu ZhangFDR-stage2.py: sequence-level FDR filtering on SeqFiller outputs. Developed by Qianqiu Zhangdecoy_spectrum_generator.py: decoy MGF generator (source: http://github.com/nh2tran/NovoBoard)
For module-specific details, see:
RNovA_PathSearcher_Inference/README.mdRNovA_SeqFiller_Inference/README.md
Module-specific dependencies are listed in each module's requirements.uv. In addition, the top-level workflow scripts require:
- Linux System with GPU
- Python 3.11+
numpy,pandas,numba,tqdm,requests
Notes:
workflow.py --use-unimodenables UniMod annotation and usesrequeststo fetch UniMod data.FDR-stage2.pyimportsRNovA_SeqFiller_Inference.utils.BasicClass, so SeqFiller must be installed/built (see its README forpython setup.py build_ext --inplace).- Checkpoint: https://zenodo.org/records/18352464
This repository uses git submodules for the two inference modules. Clone with submodules so the folders are populated:
git clone --recurse-submodules https://github.com/zqq66/RNovA.gitIf you already cloned without submodules:
git submodule update --init --recursive./RNovA.sh <in_dir> <useunimod:true|false> <k>Arguments:
in_dir: directory containing input.mgffiles.useunimod:trueto pass--use-unimodtoworkflow.py(enables UniMod annotation);falseskips UniMod annotation.k: number of PTMs to keep in the top-k list.
Example:
./RNovA.sh ./data true 10RNovA.sh runs:
- decoy MGF generation
- PathSearcher inference
- FDR stage 1 filtering
- clustering + alignment to get top-k PTMs
- SeqFiller inference
- FDR stage 2 filtering