Why my NA12878 test result is not same with NA12878_example_output_G.txt? #32

PavitaKae · 2019-11-22T03:45:07Z

This is my command.
~/HLA-LA/src/HLA-LA.pl --BAM NA12878.mini.cram --graph PRG_MHC_GRCh38_withIMGT --sampleID NA12878 --maxThreads 40

This is my test result.
R1_bestguess_G.txt

The text was updated successfully, but these errors were encountered:

AlexanderDilthey · 2019-12-20T13:48:47Z

Hi @PavitaKae, very difficult to tell - looking at the output file you provided, coverage on the class I genes (HLA-A, -B, -C) is very low. This would indicate that either the test file is corrupted, or that something with the read extraction process has gone wrong. Did you modify the reference extraction files in any way? Could you send an md5 of NA12878.mini.cram? And could you capture all of STDOUT and STDERR and post it here?

PavitaKae · 2019-12-25T04:41:15Z

This is my MD5sum for NA12878 file -> 45d1769ffed71418571c9a2414465a12
I didn't modify your reference graph, just download and make graph by following manual.
I attach file for .out and .err.

41436.out.txt
41436.err.txt

AlexanderDilthey · 2020-07-29T16:29:03Z

There is some issue with read extraction - in your output log, it says processBAM::extractSeeds(): getReadIDs 833136 reads, collected 402762 read IDs., whereas it should say processBAM::extractSeeds(): getReadIDs 13649900 reads, collected 1373415 read IDs..

In your error log, there is a message from Picard: To execute picard run: java -jar $EBROOTPICARD/picard.jar (also, there are some warning messages about the locale that come from Perl, but I don't think these matter too much).

If you go into the working directory for the sample (e.g. HLA-LA/working/NA12878_mini), R_1.fastq and R_2.fastq should both be about 500Mb in size (I would expect them to be smaller on your system), and extraction.bam should be a little bit larger than 310Mb (I would expect this to be the case on your system).

I think that there is some issue with Picard - if you execute the extraction command, i.e. /tarafs/biobank/data/modules/.local/easybuild/software/Miniconda3/4.4.10/envs/noon/bin/picard SamToFastq VALIDATION_STRINGENCY=LENIENT I=/tarafs/biobank/data/home/pkaewpro/proj0015/HLA-LA/working/NA_test/extraction.bam F=/tarafs/biobank/data/home/pkaewpro/proj0015/HLA-LA/working/NA_test/R_1.fastq F2=/tarafs/biobank/data/home/pkaewpro/proj0015/HLA-LA/working/NA_test/R_2.fastq FU=/tarafs/biobank/data/home/pkaewpro/proj0015/HLA-LA/working/NA_test/R_U.fastq 2>&1, manually, do you get an error message?

PavitaKae · 2021-01-26T07:08:03Z

Hi, AlexanderDilthey
I back to run again, it look good. Because i choose to install new picard program.
Thank you for your response. :D

AlexanderDilthey added the question label Dec 20, 2019

PavitaKae closed this as completed Jan 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why my NA12878 test result is not same with NA12878_example_output_G.txt? #32

Why my NA12878 test result is not same with NA12878_example_output_G.txt? #32

PavitaKae commented Nov 22, 2019 •

edited

Loading

AlexanderDilthey commented Dec 20, 2019

PavitaKae commented Dec 25, 2019

AlexanderDilthey commented Jul 29, 2020

PavitaKae commented Jan 26, 2021

Why my NA12878 test result is not same with NA12878_example_output_G.txt? #32

Why my NA12878 test result is not same with NA12878_example_output_G.txt? #32

Comments

PavitaKae commented Nov 22, 2019 • edited Loading

AlexanderDilthey commented Dec 20, 2019

PavitaKae commented Dec 25, 2019

AlexanderDilthey commented Jul 29, 2020

PavitaKae commented Jan 26, 2021

PavitaKae commented Nov 22, 2019 •

edited

Loading