Skip to content

Annotate bed overlaps to fix #1806#1808

Merged
adamnovak merged 6 commits intovgteam:masterfrom
adamnovak:annotate-bed-overlaps
Aug 3, 2018
Merged

Annotate bed overlaps to fix #1806#1808
adamnovak merged 6 commits intovgteam:masterfrom
adamnovak:annotate-bed-overlaps

Conversation

@adamnovak
Copy link
Copy Markdown
Member

This should fix #1806 by allowing you to use vg annotate -x graph.xg -b features.bed -g reads.gam to annotate reads with the BED feature names that they overlap. The annotation is "features", accessible with jq as .annotation.features, and is string-vector-valued. BED records with no name get the empty string as their feature name. Multiple BED records with the same name appear only once in the annotation.

This is designed for large-ish, sparse-ish features spanning many nodes; it won't perform well with many small features on a single node, because within a node I am using a linear scan to find overlaps.

@ghost ghost assigned adamnovak Aug 2, 2018
@ghost ghost added the in progress label Aug 2, 2018
@cgcloud-jenkins
Copy link
Copy Markdown

@cgcloud-jenkins
Copy link
Copy Markdown

@adamnovak adamnovak force-pushed the annotate-bed-overlaps branch from 8546d03 to 64aa55a Compare August 2, 2018 23:25
@cgcloud-jenkins
Copy link
Copy Markdown

Jenkins vg tests complete for PR 1808. View the full report here.

19 tests passed, 0 tests failed and 10 tests skipped in 20025 seconds

Tests produced 2790 warnings. 2790 were for lower-than-expected alignment scores

@adamnovak adamnovak merged commit ae4cda8 into vgteam:master Aug 3, 2018
@ghost ghost removed the in progress label Aug 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Allow vg annotate to annotate reads by overlap with one or more BED files

2 participants