Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chromap for processing the complex barcodes #42

Closed
lin-zhongbao opened this issue Dec 4, 2021 · 5 comments · Fixed by #92
Closed

Chromap for processing the complex barcodes #42

lin-zhongbao opened this issue Dec 4, 2021 · 5 comments · Fixed by #92

Comments

@lin-zhongbao
Copy link

Hi!

Apologies if I have missed this somewhere, and I am trying to process some scATAC-Seq data with chromap, there were more than one barcode and UMI molecular in the data, which there were two link sequence in the barcode reads and barcodes were not continuous.

chromap can process this data?

if so, how about the barcode mode is BC1+Adaptor1+BC2+Adaptor2+BC3+Adaptor3+BC4+UMI+PolyT?
BC1=8bp,BC2=8bp,BC3=8bp,BC4=8bp,UMI=12bp,Adaptor1=30bp,Adaptor2=25bp,Adaptor3=20bp

Thanks
Tao

@mourisl
Copy link
Collaborator

mourisl commented Dec 4, 2021

Hi Tao, Chromap only supports continuous barcode sequences, so you need to generate a new barcode file by concatenating those barcode segments.

@AlicePsyche
Copy link

Hi @lin-zhongbao

I am just wondering if you have found a more elegant solution now. I am working on a similar dataset as you, scATAC-seq with combinatory indexing. Is there any tool that can extract the CB just like kalliso? Seems to be a common feature in scRNA-seq mapping software but not in the ATAC field. Thanks in advance, any suggestions are appreciated.

Best,
Alice

@mourisl
Copy link
Collaborator

mourisl commented Apr 21, 2022

We have updated the --read-format option to accommodate the complex barcode by using multiple bc fields. UMI is not used in Chromap, but for the barcode, you can use the option: --read-format bc:0:7,bc:38:45,bc:66:73,bc:94:101

@AlicePsyche
Copy link

Hi, I just tried it on my new datasets. Combined with the whitelist, it worked extremely well!!!! Great thanks!! (Also, kind of shocked to see how fast it is now.....nearly 1min for 10M reads.....

@badoi
Copy link

badoi commented Sep 30, 2022

@AlicePsyche how are you using the whitelist using the split barcode? is your barcode whitelist for each split segment or for the combined sequence?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants