collapse redundancy #8

yannickwurm · 2020-05-11T10:54:35Z

Hello,

thanks for this tool - extremely handy.

To keep the 1 row per SNP integrity of my VCF I want to use standard mode.
However, the many commas make subsequent parsing challenging.

It would be superb if there were a way to intelligently collapse the commas. For example:

if all are identical (e.g. for allele frequencies), just report one
or report the "worst possible outcome"

I'm supposing you haven't impleemnted an automated way of doing this?

Thanks
Yannick

rhpvorderman · 2020-05-12T08:08:44Z

Dear Yannick,

Unfortunately the original maintainer of this tool has left our organisation. This tool hasn't been actively maintained since.

What is it exactly that you want the tool to do. Can you provide an example?
Also can you give an example of the problems you get in subsequent parsing? Maybe there is an easier way to solve this problem.

Best regards,
Ruben

yannickwurm · 2020-05-12T11:14:56Z

Dear Ruben, thank you for the rapid reply. For example, in the gnomad frequency column, I was getting 0.4563,0.4563,0.4563,0.4563,0.4563 for a single entry - when I guess there were five transcripts for the gene. I was indeed able to resolve this issue, with the following R code ``` for (column_id in vcf_columns) { # eliminate redundancy where it exists vcf_tibble$fix[[column_id]] <- unlist(lapply(strsplit(x = as.character(vcf_tibble$fix[[column_id]]), split = ","), function(x) { paste(unique(x), collapse=",") })) } ``` Thanks again and kind regards, Yannick

…

On 12 May 2020, at 09:08, Ruben Vorderman ***@***.***> wrote: Dear Yannick, Unfortunately the original maintainer of this tool has left our organisation. This tool hasn't been actively maintained since. What is it exactly that you want the tool to do. Can you provide an example? Also can you give an example of the problems you get in subsequent parsing? Maybe there is an easier way to solve this problem. Best regards, Ruben — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fbiopet%2Fvepnormalizer%2Fissues%2F8%23issuecomment-627184343&data=02%7C01%7C%7C3ee46d135fc349e0a88208d7f64bba4c%7C569df091b01340e386eebd9cb9e25814%7C0%7C0%7C637248677428556748&sdata=2bud6iIjLx3rAPVT3pS00sZyc05fgjAyiW7kkjOh49c%3D&reserved=0>, or unsubscribe <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAABEUYIN3MWIEMPS7QF72KLRRD7ZXANCNFSM4M5ZUDTA&data=02%7C01%7C%7C3ee46d135fc349e0a88208d7f64bba4c%7C569df091b01340e386eebd9cb9e25814%7C0%7C0%7C637248677428556748&sdata=3n4g0vtiAHmSit9t%2FUK2ntkJIp0mgVfKH8Ejev%2B30W4%3D&reserved=0>.

rhpvorderman · 2020-05-12T12:34:31Z

Dear Yannick,

I am happy you were able to solve the issue. Have a nice day!

Best regards,
Ruben

rhpvorderman closed this as completed May 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

collapse redundancy #8

collapse redundancy #8

yannickwurm commented May 11, 2020

rhpvorderman commented May 12, 2020

yannickwurm commented May 12, 2020 via email

rhpvorderman commented May 12, 2020

collapse redundancy #8

collapse redundancy #8

Comments

yannickwurm commented May 11, 2020

rhpvorderman commented May 12, 2020

yannickwurm commented May 12, 2020 via email

rhpvorderman commented May 12, 2020