-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No subdirectories generated after assemble step #21
Comments
Hi Linda,
BraCeR is not intended for use with data generated using the 10x Genomics Chromium.
BraCeR expects sequencing reads to be demultiplexed into separate fastq files for each cell so that it can run on one cell at a time. If you’re not doing that it’s unlikely to work. Even if you are doing that, I don’t know what performance to expect if you use 10x data and, if you’re using the 10x VDJ assay, I recommend you use Cell Ranger to assemble your BCR sequences.
If you wish to use BraCeR’s clonality inference and lineage tree construction methods you should then be able to use the Cell Ranger VDJ sequences as input.
Best,
Mike
… On 13 Aug 2018, at 19:27, Linda-Lan ***@***.***> wrote:
Hi BraCeR team,
Thank you for the previous solutions which are work!
I use FASTQ file containing #1 and #2 mates from paired-end sequencing generated from 10x genomic system. This sample supposed to be detected 500+ cells +400 paired vdj through 10x’s software-cellranger vdj.
I got the following output which unlike test data contains 3 cells. Unsure whether I should
Set cell_name for 500+ cells that I won’t be able to know until BCR construction. Do you have any solution for this? Please let me know if you need more information!
Mac-Pro:outs patrickwilson$ cd ..
Mac-Pro:319vdj patrickwilson$ ls
319-VDJ_S11_L006_I1_001.fastq 319-VDJ_S11_L006_R1_001.fastq.gz outs
319-VDJ_S11_L006_I1_001.fastq.gz 319-VDJ_S11_L006_R2_001.fastq
319-VDJ_S11_L006_R1_001.fastq 319-VDJ_S11_L006_R2_001.fastq.gz
Mac-Pro:319vdj patrickwilson$ cd outs
Mac-Pro:outs patrickwilson$ pwd
/Users/patrickwilson/Desktop/linda/319vdj/outs
Mac-Pro:outs patrickwilson$ ls
319vdj
Mac-Pro:outs patrickwilson$ cd 319vdj/
Mac-Pro:319vdj patrickwilson$ ls
BLAST_output aligned_reads trimmed_reads
IgBLAST_output expression_quantification unfiltered_BCR_seqs
Trinity_output filtered_BCR_seqs
Mac-Pro:319vdj patrickwilson$
Thank you,
Linda
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
Hi Mike, |
Hi Mike and @idalind, Thank you for reply. I do intend to use BraCeR’s clonality inference and lineage tree construction methods because the visualization figure is clear and informative! Here is my output from Cellranger vdj, would like to know which would be input files? Best, [lindalan@midway-login2 vdj_out]$ ls |
Hi @mstubb , any news on this? |
Hi @Linda-Lan and @JohnMCMa . Apologies for the late reply. I will have a look at this in the next few days and will get back to you soon. |
Hi again. I would try parsing the 10x output file "filtered_contig.fasta" into separate fasta files for each cell (according to the cell barcodes given by 10x), for example in python like this:
This will give a fasta file for each cell (with the barcode as cell name, but you could change this if you like), which each needs to be provided to bracer assemble. This is very quick, but needs to be done for each fasta file. Example for one fasta file:
I hope these instructions help, and please let me know if you have any further issues. |
Hi idalind, Thank you for the python script. It successfully generate fasta files of each cells from "filtered_contig.fasta". Should I use assemble function or I can directly use summarise? Because the sequence in fasta looks already assembled by cell ranger, containing VDJ genes. Lindas-MacBook-Pro:~ lindalan$ cd 327vdj/ Lindas-MacBook-Pro:327vdj lindalan$ cat test/CACACAACAGTCGTGC.fasta
I then run this commend, Traceback (most recent call last): Do you have any thoughts on this issue? Thank you! |
Hi @Linda-Lan, |
Hi BraCeR team,
Thank you for the previous solutions which are work!
I use FASTQ file containing #1 and #2 mates from paired-end sequencing generated from 10x genomic system. This sample supposed to be detected 500+ cells +400 paired vdj through 10x’s software-cellranger vdj.
I got the following output which unlike test data contains 3 cells. Unsure whether I should
Set cell_name for 500+ cells that I won’t be able to know until BCR construction. Do you have any solution for this? Please let me know if you need more information!
Mac-Pro:outs patrickwilson$ cd ..
Mac-Pro:319vdj patrickwilson$ ls
319-VDJ_S11_L006_I1_001.fastq 319-VDJ_S11_L006_R1_001.fastq.gz outs
319-VDJ_S11_L006_I1_001.fastq.gz 319-VDJ_S11_L006_R2_001.fastq
319-VDJ_S11_L006_R1_001.fastq 319-VDJ_S11_L006_R2_001.fastq.gz
Mac-Pro:319vdj patrickwilson$ cd outs
Mac-Pro:outs patrickwilson$ pwd
/Users/patrickwilson/Desktop/linda/319vdj/outs
Mac-Pro:outs patrickwilson$ ls
319vdj
Mac-Pro:outs patrickwilson$ cd 319vdj/
Mac-Pro:319vdj patrickwilson$ ls
BLAST_output aligned_reads trimmed_reads
IgBLAST_output expression_quantification unfiltered_BCR_seqs
Trinity_output filtered_BCR_seqs
Mac-Pro:319vdj patrickwilson$
I put the last step showing on my end of assemble step for your reference.
##Running Kallisto##
##Making Kallisto indices##
[build] loading fasta file /scratch/outs/319vdj/expression_quantification/kallisto_index/319vdj_transcriptome.fa
[build] k-mer length: 31
[build] warning: clipped off poly-A tail (longer than 10)
from 1549 target sequences
[build] warning: replaced 4 non-ACGUT characters in the input sequence
with pseudorandom nucleotides
[build] counting k-mers ... done.
[build] building target de Bruijn graph ... done
[build] creating equivalence classes ... done
[build] target de Bruijn graph has 1285321 contigs and contains 126172909 k-mers
##Quantifying with Kallisto##
[quant] fragment length distribution will be estimated from the data
[index] k-mer length: 31
[index] number of targets: 200,371
[index] number of k-mers: 126,172,909
[index] number of equivalence classes: 825,477
[quant] running in paired-end mode
[quant] will process pair 1: /scratch/outs/319vdj/trimmed_reads/319-VDJ_S11_L006_R1_001_val_1.fq
/scratch/outs/319vdj/trimmed_reads/319-VDJ_S11_L006_R2_001_val_2.fq
[quant] finding pseudoalignments for the reads ... done
[quant] processed 6,016,781 reads, 2,008,887 reads pseudoaligned
[quant] estimated average fragment length: 182.116
[ em] quantifying the abundances ... done
[ em] the Expectation-Maximization algorithm ran for 648 rounds
Thank you,
Linda
The text was updated successfully, but these errors were encountered: