The dataset analyzed here consists of two sets of files and is described in:
DOI: TBD
All sequencing files are available via the NCBI database and were gathered using a MiSeq machine, with a paired-end 2*300 bp approach, based on amplicons.
The pipeline relies on snakemake for the automation of the analysis, which in turns uses conda to create the environment for each of the specific steps in the pipeline. The other python package necessary to get the pipeline started is Pandas.
Other softwares included in this project, separately from those available through conda include:
The snakefile corresponding to the analysis of the initial library of fragments, together with the definition of the different conda environments is available in the folder initial_library_analysis
The snakefile corresponding to the analysis of the positive interaction results, together with the definition of the different conda environments is available in the folder y2h_positives_analysis