Skip to content

vg haplotypes errors about mc graph top-level chain being a loop #4772

@ASLeonard

Description

@ASLeonard

Hi,

I was trying to create a haplotype index for a 77-sample mc graph (built with v3.1) using vg 1.70. The steps were

vg gbwt --gbz-input graph.gbz --r-index graph.ri
vg index--no-nested-distance --dist-name graph.dist graph.gbz

vg haplotypes --haplotype-output graph.hapl --distance-index graph.dist --r-index graph.ri graph.gbz

But the haplotypes command reports an issue with top-level chains being a loop. This error seems to have appeared around v1.68 in c73f635. Going back to vg 1.67 gives the error

error: [vg haplotypes] HaplotypePartitioner::partition_haplotypes(): there are 1020 top-level chains and 1957 weakly connected components; haplotype sampling cannot be used with this graph

I believe asking cactus to make the hapl index does use reference tips to upweight nodes, so I can also check that later if necessary, but otherwise this should almost the same procedure.

Best,
Alex

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions