Second generation of the ICGC DCC release ETL built on Spark

ICGC DCC - Release

Second generation of the ICGC DCC ETL build on Spark. For the first generation ETL project, please see the dcc-etl repository.


To build the application execute the following from the command line:

mvn clean package


For a high-level overview of the application please see

Sub-system modules:


For information how to build a custom version of Spark please see

Running Application

For general instructions how to run a data processing with the dcc-release application please see

DCC Instructions

For DCC specific instructions please see internal documentation.


For information on FATHMM, please see