Second generation of the ICGC DCC release ETL built on Spark

Second generation of the ICGC DCC ETL build on Spark. For the first generation ETL project, please see the dcc-etl repository.


To build the application execute the following from the command line:

mvn clean package


For a high-level overview of the application please see

Sub-system modules:


For information how to build a custom version of Spark please see

Running Application

For general instructions how to run a data processing with the dcc-release application please see

DCC Instructions

For DCC specific instructions please see internal documentation.


For information on FATHMM, please see