Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Already on GitHub? Sign in to your account

fastq-multx not identifying barcodes #60

Open
SJRussell opened this Issue May 10, 2017 · 5 comments

Comments

Projects
None yet
4 participants

I have a fastq file with 1.7 million reads of 75 bps, and 28 different barcodes. My command:
``fastq-multx -l barcodes.txt all_samples.fastq -e -o %.fq -x"

Part of my barcode file: barcodes.txt:
image

Head of my fastq, with some barcodes highlighted:
image

My output, which leaves most reads multiplexed:
picture2

Suggestions? What does (shifted) mean?

Thanks,

Stewart

Having this problem as well, for some reason fastq-multx is not recognizing the barcode correctly.

Collaborator

drsuuzzz commented Jul 18, 2017

Can you give some more information about the type of assay and sequencing equipment used?

Sure, Illumina Miseq, paired ends. R1 forward and R3 reverse sequences.

/Users/###/miniconda2/bin/fastq-multx -B /Volumes/homes/###/SRA/Columbia.Gut.Murine/map/reversecomplement/reversecomplement.txt /Volumes/homes/###/SRA/Columbia.Gut.Murine/2_fastq/lane1_NoIndex_L001_R1_001.fastq /Volumes/homes/###/SRA/Columbia.Gut.Murine/2_fastq/lane1_NoIndex_L001_R3_001.fastq -o /Volumes/homes/###/SRA/Columbia.Gut.Murine/SRA/R3/R3.%.fastq /Volumes/homes/###/SRA/Columbia.Gut.Murine/SRA/R3/R1.%.fastq
/Users/###/miniconda2/bin/fastq-multx -B /Volumes/homes/###/SRA/Columbia.Gut.Murine/map/forward/forward2.txt /Volumes/homes/###/SRA/Columbia.Gut.Murine/2_fastq/lane1_NoIndex_L001_R1_001.fastq /Volumes/homes/###/SRA/Columbia.Gut.Murine/2_fastq/lane1_NoIndex_L001_R3_001.fastq -o /Volumes/homes/###/SRA/Columbia.Gut.Murine/SRA/R1/R1.%.fastq -o /Volumes/homes/###/SRA/Columbia.Gut.Murine/SRA/R1/R3.%.fastq
Using Barcode File: /Volumes/homes/###/SRA/Columbia.Gut.Murine/map/forward/forward2.txt

Returns:

  1. Empty fastq files

screen shot 2017-07-18 at 8 21 48 am

I know each of these samples should have 5000-10000 amplicons per sample.

The behavior is also very erratic, it works with some fastq files (demultiplexes appropriately) and on other runs it doesn't demultiplex at all and returns empty files.

Primer barcodes.
screen shot 2017-07-18 at 8 23 02 am

I am also using: Version: 1.3.1

Here is cat of the fastq file

+
AAAAAFFAA11>AEGGGFGCGGHGGGAEHHFHHGHHEGGGHH1B01BF/BFEGCEE@/B2222BFFFF1BEFACEHF1B@/EEGEE<FHH1F00?0<B11<//?>///1?/C////1?F0<0>FCGFF=<GEGG<-..<<E00:CGHHHH:
@MISEQ01:40:000000000-ART6C:1:1101:16675:2021 1:N:0:
GAAATATCCTTTGCAGTAGCGCCAATATGAGAAGAGCCATACCGCTGATTCTGCGTTTGCTGATGAACTAAGTCAACCTCAGCACTAACCTTGCGAGTCATTTCTTTGATTTGGTCATTGGTAAAATACTGACCAGCCGTTTGAGCTTGAG
+
AAA3AFFFFFFDGG54BDEGGCGGFFHHGGFCFHFHGHHDHHHGGGGDHHHHHHGGGGGHHFFGHHHHHHHGHHHFHHHHHHHFGGHGHHHHHHGGGGHGFHGHHHHGGHHHHHFHHGHGGHHGHHHHHHGHHFHHHGGGGHFHGFHHHEG
@MISEQ01:40:000000000-ART6C:1:1101:19362:2021 1:N:0:
TACGTAGGGGGCAAGCGTTATCCGGATTTACTGGGTGTAAAGGGGGCGCAGACGGCAATGCAAGCCAGGAGTGAAAGCCCGGGGCCCAACCCCGGGACTGCTCTTGGAACTGCATGGCTGGAGTACAGGCGGGGCAGGCGGAATTCCTAAT
+

Would you be able to attach some sample data that we can work with to recreate the problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment