Hello vg team,
Thank you for developing and maintaining vg and giraffe.
I observed that vg giraffe can be extremely slow on Illumina paired-end WGS data mapped to a minigraph-cactus pangenome graph on an HPC system. Some samples (fungal pathogen) took more than a day to finish with default settings, even after trying common solutions such as colocating inputs and outputs, copying index files (including .dist) to local scratch etc. In contrast, when I disabled rescue using -A none, the same samples completed in minutes and produced .gam files consistently.
After disabling rescue, the .gam file looks like:
This suggests that rescue is the dominant performance bottleneck in my case. I would appreciate clarification on the trade-offs of disabling rescue, particularly regarding sensitivity loss and whether -A none is generally acceptable for high-coverage Illumina WGS and downstream SNP or population-genetic analyses.
Or do you have any other suggestions on how to speed up the process?
Thank you very much for your time and guidance.
Best regards,
Sumit