Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add functions to extract UMI at the 3' end of reads #86

Closed
huipan1973 opened this issue Jun 20, 2024 · 5 comments
Closed

Add functions to extract UMI at the 3' end of reads #86

huipan1973 opened this issue Jun 20, 2024 · 5 comments

Comments

@huipan1973
Copy link

Is there an option to extract UMI at the 3' end of reads? Could you add that function?

@huipan1973
Copy link
Author

It looks like the pipeline performs UMI extraction before adaptor trimming. Our 3' UMI is upstream of the 3' adapter. Is there a way to trim the adapter and then extract the UMI?

@iraiosub
Copy link
Owner

iraiosub commented Jun 21, 2024

If your read design is insert-UMI-adaptor, where the UMI is located before the adaptor sequence, you can still use riboseq-flow (which uses UMI-tools) to extract the UMI and move it to the header without removing the adaptor, and subsequently trim the adaptor.

You can achieve this by providing the right options for UMI-tools in riboseq-flow: i.e. --umi_extract_method (equivalent to --extract-method in UMI-tools) as 'regex' and --umi_pattern (equivalent to --bc-pattern in UMI-tools) as the regular expression fit for your read structure and aim. You need to go to the UMI-tools documentation to figure out the settings and expression appropriate for your case.

As this functionality is already supported by riboseq-flow, I will now close this issue.

@huipan1973
Copy link
Author

huipan1973 commented Jun 21, 2024 via email

@iraiosub
Copy link
Owner

Hi Hui,

if you provide the correct regex to UMI-tools to make sure the adaptor is retained in the reads during the process, you can first extract the UMIs and and then do adaptor trimming, without the need to swap the order of these steps in riboseq-flow. We implement UMI extraction before adaptor trimming for technical reasons. Since the pipeline works with your library design as it is, we will not modify the workflow to accommodate this use case.

@huipan1973
Copy link
Author

huipan1973 commented Jun 21, 2024 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants