Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

explore: Find unknown recombinants in dataset #12

Open
ktmeaton opened this issue Dec 5, 2023 · 0 comments
Open

explore: Find unknown recombinants in dataset #12

ktmeaton opened this issue Dec 5, 2023 · 0 comments

Comments

@ktmeaton
Copy link
Collaborator

ktmeaton commented Dec 5, 2023

A current limitation in rebar is that you somewhat need to known which populations in the dataset are recombinants. I'd like to write a new subcommand maybe explore that will check all pairwise combinations in a dataset.

So it would work like: rebar explore --dataset-dir dataset/sars-cov-2 --output-dir output/explore/sars-cov-2.

And you could supply the --min-parents/--max-parents arguments like in rebar run. If --min-parents 2 and --max-parents 2, rebar would compare every population against every other pairwise combination. It would be a pretty hefty calculation, but could be a good assessment for Issue #1 (efficiency improvements).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant