Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

profile and optimize ApplyBQSR #1033

Closed
akiezun opened this issue Oct 21, 2015 · 3 comments
Closed

profile and optimize ApplyBQSR #1033

akiezun opened this issue Oct 21, 2015 · 3 comments
Assignees
Milestone

Comments

@akiezun
Copy link
Contributor

akiezun commented Oct 21, 2015

the goal is to be at least same as gatk3.4 on single thread. This is for the walker version of the tool.
The ticket can be split into a) profile and b) optimize if needed

@akiezun
Copy link
Contributor Author

akiezun commented Oct 23, 2015

Case1:
chr10 of CEUTrio.HiSeq.WEx.b37.NA12892.bam from the gatk bundle, size 1.2Gb

running on Mac OS X 10.9.5 x86_64; Java HotSpot(TM) 64-Bit Server VM 1.8.0_25-b17, local SSD

11.30 min GATK v3.4-46-gbc02625
9 min GATK4 .pre-alpha-51-ge098c21

time for GATK3

real    11m21.538s
user    17m24.320s
sys 0m8.073s

gatk4

real    9m7.691s
user    9m2.250s
sys 0m22.603s

Profile shows that almost all time goes into the bqsr work and nothing obvious can be removed

image

@droazen
Copy link
Collaborator

droazen commented Oct 23, 2015

Can you repeat this same test over an NFS filesystem from a Broad server?

@akiezun
Copy link
Contributor Author

akiezun commented Oct 26, 2015

case 2, on NFS
ref /humgen/1kg/reference/human_g1k_v37.fasta
bam /xchip/cga_home/akiezun/data/CEUTrio.HiSeq.WEx.b37.NA12892.chr10.bam (1.9Gb)

Running on host dataflow01
Linux 2.6.32-573.3.1.el6.x86_64 amd64; Java HotSpot(TM) 64-Bit Server VM 1.8.0-b132.

12.47 minutes GATK v3.4-46-gbc02625
11.8 minutes GATK 4.pre-alpha-45-g168cd60

gatk3 time

real    12m32.499s
user    18m58.643s

gatk4 time

real    11m49.027s
user    12m20.699s

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants