-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
collapse redundancy #8
Comments
Dear Yannick, Unfortunately the original maintainer of this tool has left our organisation. This tool hasn't been actively maintained since. What is it exactly that you want the tool to do. Can you provide an example? Best regards, |
Dear Ruben,
thank you for the rapid reply.
For example, in the gnomad frequency column, I was getting 0.4563,0.4563,0.4563,0.4563,0.4563 for a single entry - when I guess there were five transcripts for the gene.
I was indeed able to resolve this issue, with the following R code
```
for (column_id in vcf_columns) {
# eliminate redundancy where it exists
vcf_tibble$fix[[column_id]] <- unlist(lapply(strsplit(x = as.character(vcf_tibble$fix[[column_id]]),
split = ","),
function(x) { paste(unique(x), collapse=",") }))
}
```
Thanks again and kind regards,
Yannick
… On 12 May 2020, at 09:08, Ruben Vorderman ***@***.***> wrote:
Dear Yannick,
Unfortunately the original maintainer of this tool has left our organisation. This tool hasn't been actively maintained since.
What is it exactly that you want the tool to do. Can you provide an example?
Also can you give an example of the problems you get in subsequent parsing? Maybe there is an easier way to solve this problem.
Best regards,
Ruben
—
You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fbiopet%2Fvepnormalizer%2Fissues%2F8%23issuecomment-627184343&data=02%7C01%7C%7C3ee46d135fc349e0a88208d7f64bba4c%7C569df091b01340e386eebd9cb9e25814%7C0%7C0%7C637248677428556748&sdata=2bud6iIjLx3rAPVT3pS00sZyc05fgjAyiW7kkjOh49c%3D&reserved=0>, or unsubscribe <https://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FAABEUYIN3MWIEMPS7QF72KLRRD7ZXANCNFSM4M5ZUDTA&data=02%7C01%7C%7C3ee46d135fc349e0a88208d7f64bba4c%7C569df091b01340e386eebd9cb9e25814%7C0%7C0%7C637248677428556748&sdata=3n4g0vtiAHmSit9t%2FUK2ntkJIp0mgVfKH8Ejev%2B30W4%3D&reserved=0>.
|
Dear Yannick, I am happy you were able to solve the issue. Have a nice day! Best regards, |
Hello,
thanks for this tool - extremely handy.
To keep the 1 row per SNP integrity of my VCF I want to use standard mode.
However, the many commas make subsequent parsing challenging.
It would be superb if there were a way to intelligently collapse the commas. For example:
I'm supposing you haven't impleemnted an automated way of doing this?
Thanks
Yannick
The text was updated successfully, but these errors were encountered: