Fluffy is an NIPT analysis pipeline written in Python and Nextflow. Fluffy predicts chromosomal trisomies using fetal fraction estimation by AMYCNE. Fluffy also computes multiQC data of the input sequence files. Fluffy outputs one csv file with a line per input sample which summarizes the main results per sample, including the Z-score of chromosomes prone to trisomies.
Fluffy can be directly run in Nextflow using:
nextflow main.nf --samplesheet /path/to/samplesheet.csv --fastq /path/to/fastq/folder --output /path/to/output/folder -c config.conf
Typically, the samplesheet and fastq folder is produced through demultiplexing. If the Samplesheet file has a line before the header (commonly "[Data]") add the following to the command:
--skipline true
The Fluffy pipeline requires two distinct inputs: the samplesheet and the a folder with fastq files per sample.
The samplesheet should have a header with the following column names:
FCID,Lane,Sample_ID,SampleRef,index,index2,SampleName,Control,Recipe,Operator,Sample_Project
There should be one comm-separated line for each sample to be analyzed
The folder with fastq files should contain one folder for each sample to be analysed. The folder names should contain {Sample_ID}. The standard prefix is 'Sample_' resulting in folder name Sample_{Sample_ID}. If folder prefix is different add:
--prefix prefix
to the run command.
Inside this folder should be all the fastq files for the sample. There should be at least one file for the forward reads and one file for reverse reads. If multiple lanes are used during sequencing this is not a problem. The file names should be '*{sample_ID}*_R1*fastq.gz' for forward reads and '*{sample_ID}*_R2*fastq.gz' for reverse reads. All fastq files should be gzipped
Fluffy produces several output files.
- summary.csv This file contains a header and one line per sample summarizing all the data extracted from the sample during analysis
- multiqc.html This file contains the multiQC report of all samples
- One folder per sample with folder name {Sample_ID}. In this folder there will be the output file of each separate step in the Fluffy pipeline
Dependencies:
Nextflow
Install the dependencies then download Fluffy.
git clone --recursive https://github.com/Clinical-Genomics/Fluffy_nextflow.git
You will need to download/create the following files:
Reference fasta (indexed using bwa)
WisecondorX reference files (created using the reference mode)
PREFACE model file (optional)
An example config file can be found in the configs folder. Default parameters are specied. Read the comments in the config.conf file for more information on the individual variables.