Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Choose primers to trim #625

Closed
5 tasks done
donkirkby opened this issue Oct 15, 2020 · 2 comments
Closed
5 tasks done

Choose primers to trim #625

donkirkby opened this issue Oct 15, 2020 · 2 comments

Comments

@donkirkby
Copy link
Member

donkirkby commented Oct 15, 2020

We currently have three sets of primers that we can trim:

  • HCV
  • MidHCV
  • SARS-CoV-2

The SARS-CoV-2 set has over 100 primers in it, so it tends to randomly match a significant number of reads. Let users specify which primers are expected in the samples, and only trim those. For the micall_watcher script, we can either upload the sample name in a text file, and let the pipeline parse the project out of it, or upload the project code in a text file.

  • add project code to trimming step, and choose primer set
  • create sample_info.csv file in release_test_microtest.py and pass to singularity image
  • create sample_info.csv file in micall_watcher.py and upload to Kive
  • choose primer set in docker driver with command-line option
  • let BaseSpace user choose primer set
@donkirkby donkirkby added this to the 7.14 milestone Oct 15, 2020
@donkirkby donkirkby pinned this issue Oct 21, 2020
@donkirkby
Copy link
Member Author

I think a sample_info.csv file with the sample name and project code in separate columns should work. The sample name can be displayed in reports and graphs, and the project code wouldn't need to be parsed out of the sample name. If we decide we want more details later, we can add more columns. Each sample would receive a file with one row in it.

sample project
E1234-HIV_S1 HIV

This file would also be useful when we start collating all the result files for a run folder in the next release of Kive.

@donkirkby
Copy link
Member Author

@cbrumme, we don't want the mapping stage or the BLAST search to narrow the set of references based on the project code, do we? We want to look for cross contamination from other samples, right? This task is just for narrowing the set of primers to trim.

donkirkby added a commit that referenced this issue Dec 4, 2020
Regenerate 2170 microtest, because of changes to primer trimming, rename project code in 2190, and 2200 to check that primer trimming works.
Split primers into two sets: HCV and SARSCOV2.
@donkirkby donkirkby unpinned this issue Dec 9, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant