Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

10X sc TCR sequencing outputs #1387

Closed
bshim181 opened this issue Oct 12, 2023 · 2 comments
Closed

10X sc TCR sequencing outputs #1387

bshim181 opened this issue Oct 12, 2023 · 2 comments

Comments

@bshim181
Copy link

Hello,

I was comparing the outputs from MiXCR and Cellranger and I noticed that there are some differences between number of cell barcodes identified and the chains it gets assigned with. I was first wondering what annotation MiXCR uses as default? is it based off of IMGT? or is it more of custom generated?

Furthermore, I noticed from two outputs, Cellranger did not have any cell barcodes that were supported under 25 reads and I was wondering if I could enforce some kind of threshold for contig/barcode detection with read number.

Lastly, I noticed that there were multi chain grouping functionality added to the MiXCR in Version 4.5.0. Is it possible to explain how the multi-chain clonotypes have been determined (brief explanation of the algorithm behind it)?

@bshim181 bshim181 changed the title 10 sc TCR sequencing outputs 10X sc TCR sequencing outputs Oct 16, 2023
@ejohnson643
Copy link

Bumping this! I would also really like to know how any multi-chain clonotypes are identified!

@mizraelson
Copy link
Member

Hi,

  • By default, MiXCR utilizes its own built-in library. However, if required, you can use IMGT.

  • MiXCR automatically calculates dynamic thresholds for Reads per UMI, UMIs per cell, and several other advanced filters. These filters are determined based on the barcode distribution in each sample to optimize performance when applied collectively. Filtering all cells by a fixed number of reads (e.g., 25) wouldn't be suitable in many scenarios. While it's possible to write a new preset to hardcode a 25 reads/cell limit, I'd strongly advise against it. Based on our experience, the filters we employ provide more accurate immune repertoire analysis.

  • The grouping algorithm operates in the following manner: We first filter out non-productive clones and cell barcodes that combine chains, which are obviously from duplets (e.g., a combination of TCR and BCR or IGK and IGL in a single cell). Clones are then grouped by CELL barcodes. Subsequently, we attempt to merge cell groups, adding to the most substantial group any group that is fully encompassed by the target. This method results in the formation of cell groups sharing the same chains. We plan on refining this algorithm in the next release to better cater to data with significant noise.

Sincerely,
Mark

@milaboratory milaboratory locked and limited conversation to collaborators Oct 23, 2023
@mizraelson mizraelson converted this issue into discussion #1400 Oct 23, 2023

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants