Generate reusable JSON job bundles from BioGRID interactor exports and seed-sequence bait protein inputs.
- A config file that lists your bait proteins and job-set definitions.
- An input directory containing:
main_sequences.txtwith named protein sequence blocks.- One BioGRID-style export per seed protein, named
<PROTEIN>.txt.
- Optionally, a cached accession-to-sequence TSV.
- Edit
configs/job_sets.jsonto match your proteins and grouping strategy. - Put your input files under
input_data/. - Run:
python generate_outputs.py --repo-root .Generated files are written to generated_job_output/ by default.
main_sequences.txt uses named blocks like:
PROTEIN_A:
MSEQUENCEEXAMPLE
PROTEIN_B:
MSEQUENCEEXAMPLE
Each per-protein interaction file should be a BioGRID-style tab3 txt export for that protein.
configs/job_sets.jsonis only a generic example; replace every placeholder before use.