Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reads with / characters in the read names cause the pipeline to fail #57

Open
2 tasks done
marc-jones opened this issue Oct 1, 2021 · 1 comment
Open
2 tasks done
Labels
bug Something isn't working

Comments

@marc-jones
Copy link

Check Documentation

I have checked the following places for your error:

Description of the bug

If / characters are in the read names, then the pipeline will fail at the dedup step. This is caused, however, by STAR truncating read names based on the / character. This removes the UMI information which was added earlier in the pipeline, causing the dedup step to fail.

Steps to reproduce

Steps to reproduce the behaviour:

Running the pipeline with reads with a / in the read name. Example

Expected behaviour

Either a verbose error message saying that the reads are incompatible, or automatic removal or replacement of the / characters.

@marc-jones marc-jones added the bug Something isn't working label Oct 1, 2021
@marc-jones
Copy link
Author

marc-jones commented Oct 1, 2021

Interestingly, when there's a space in the read it works correctly:

# initial format
@ERR039787.1 HWI-EAS202_223:6:1:37:2026/1

# format after move_umi
@ERR039787.2_GTTCCAC HWI-EAS202_223:6:1:41:30/1

# format after STAR mapping
ERR039788.1477620_GAAGCCC

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant