Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

conversion of GFA to rGFA #23

Open
gsc74 opened this issue Apr 27, 2022 · 4 comments
Open

conversion of GFA to rGFA #23

gsc74 opened this issue Apr 27, 2022 · 4 comments

Comments

@gsc74
Copy link

gsc74 commented Apr 27, 2022

@lh3, I'm curious to know , whether we can convert GFA to rGFA? . As per rGFA format, we need to maintain 3 additional information along with each segment lines (S). So,
Assuming we have GFA with segments links;

S	s1	AAT
S	s2	T

can't we just add ;

S	s1	AAT	LN:i:3	SN:Z:chr1	SO:i:90374744	SR:i:0
S	s2	T	LN:i:1	SN:Z:chr1	SO:i:176753158	SR:i:0

Will it be valid rGFA ?

@JosephLalli
Copy link

A GFA to rGFA tool would be extremely helpful for my use case at the moment. At a minimum, I could use more information about how the SO value is calculated; it's not 100% clear to me, and trying to determine what it should be while parsing the reference human pangenome freeze 1 (which only has intermittent SO tags) has proven to be unexpectedly complex!

@colindaven
Copy link

colindaven commented May 8, 2023

I think the reverse is true, a rGFA to GFA conversion tool is (also) severely lacking.

While minigraph is apparently the most efficient and promising pangenome tool at present, the use of rGFA prohibits downstream analysis using the odgi and vg toolkits (at least to my knowledge). Therefore, no SNP calling, odgi pavs calling etc is possible.

@peterdfields
Copy link

Has there been any progress made on this type of tool, either in gfatools or other toolchain?

@colindaven
Copy link

colindaven commented Aug 26, 2024

I think at present in 2024, I'd recommend creating new pangenomic graphs using Minigraph-cactus to get GFA format output (if possible).

There is discussion on this topic here too - https://www.biostars.org/p/9601440/#9601480

There is still a lot of room for improvement in the pangenomic tool GFA and rGFA landscape(s).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants