# Germline BrevDev Blueprint

NVIDIA Parabricks® is the only GPU-accelerated computational genomics toolkit that delivers fast and accurate analysis for sequencing centers, clinical teams, genomics researchers, and next-generation sequencing instrument developers. Parabricks provides GPU-accelerated versions of tools used every day by computational biologists and bioinformaticians—enabling significantly faster runtimes, workflow scalability, and lower compute costs.

The toolkit includes full compatibility with workflow languages and managers (WDL, NextFlow, Cromwell) to easily intertwine GPU- and CPU-powered tasks, as well as support for easy cloud deployment (AWS, GCP, Terra, and DNAnexus).

[Workflow diagram]

## Dataset

This data is / comes from ...

In [None]:
! wget -O parabricks_sample.tar.gz https://s3.amazonaws.com/parabricks.sample/parabricks_sample.tar.gz

In [None]:
! tar xzvf parabricks_sample.tar.gz

In [None]:
! tree parabricks_sample

Add an output directory

## Alignment

In [None]:
%%sh

DOCKER_IMAGE="nvcr.io/nvidia/clara/clara-parabricks:4.4.0-1"

DATA_DIR="parabricks_sample/Data"
REF_DIR="parabricks_sample/Ref"

REF="${REF_DIR}/Homo_sapiens_assembly38.fasta"
FASTQ_1="${DATA_DIR}/sample_1.fq.gz"
FASTQ_2="${DATA_DIR}/sample_2.fq.gz"
OUT_BAM="${DATA_DIR}/sample.bam"

docker run --gpus all --rm \
    -v `pwd`:`pwd` \
    ${DOCKER_IMAGE} pbrun fq2bam \
    --ref ${REF} \
    --in-fq ${FASTQ_1} ${FASTQ_2} \
    --out-bam ${OUT_BAM}


## Variant Calling

In [None]:
%%sh

DOCKER_IMAGE="nvcr.io/nvidia/clara/clara-parabricks:4.4.0-1"

DATA_DIR="parabricks_sample/Data"
REF_DIR="parabricks_sample/Ref"

REF="${REF_DIR}/Homo_sapiens_assembly38.fasta"
IN_BAM="${DATA_DIR}/sample.bam"

docker run --gpus all --rm \
    -v `pwd`:`pwd` \
    ${DOCKER_IMAGE} pbrun deepvariant \
    --ref ${REF} \
    --in-bam ${DATA_DIR}/sample.bam \
    --out-variants ${IN_BAM} 

## Next Steps