Step 1: Download the relevant data

This repository hosts the source code and instructions to reproduce the analysis and results from our study on epilepsy and copy number variation.

If you just want the CNV calls, find them in the FigShare repository, or directly here.

To review the code and resulting graphs/numbers, have a look at the R-markdown reports in the reports folder and scripts in the src folder.

To rerun the analysis, follow these steps:

Step 1: Download the relevant data

The necessary data has been deposited on FigShare. Depending on the analysis, you might not need to download all the data.

Still, the easiest way is to download all the data (1.5 GB) and unzip it in the data folder.

Soon, we will prepare different packs that will be downloadable with:

downloadBenchmark.sh (XX Mb) for the in silico benchmark of PopSV and comparison with existing methods, using the Twin study and CageKid datasets.
downloadEpilepsy.sh (XX Mb) for the SV analysis in the epilepsy/control cohort.

Step 2: Install R dependencies

Many different packages are used throughout the analysis. The commands to install them are written in the installDependencies.R. To install all the necessary packages open R and run source("installDependencies.R").

Step 3: Compile the R-markdown reports

The raw R-markdown reports are located in the src folder. To recompile them simply run:

library(rmarkdown)
render("XXX.Rmd")

You can also compile a bunch of reports using the compile*.sh scripts. For example:

sh compileEpilepsy.sh

You can already see the reports produced by these scripts in the reports folder.

Notes

The code was tested on fresh dockerized Ubuntu with R 3.2.5. Windows is not recommended as the parallel package is not available.

Results might differ slightly from the ones in our paper because several results are based on permutations or sampling procedures.

The number of permutations for a few analysis have been reduced in order to be able to compute them on a laptop in a reasonable amount of time. For the paper we used high performance computers to increase the number of permutations.

For the most time-consuming steps, we used the caching option during the report compilation. It means that it will take the normal time to compile the first time, but will avoid rerunning the long steps on the following compilations.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
data		data
others		others
reports		reports
src		src
.gitignore		.gitignore
README.md		README.md
compileAll.sh		compileAll.sh
compileAnnotations.sh		compileAnnotations.sh
compileEpilepsy.sh		compileEpilepsy.sh
compilePopSVbenchmark.sh		compilePopSVbenchmark.sh
installDependencies.R		installDependencies.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

others

others

reports

reports

src

src

.gitignore

.gitignore

README.md

README.md

compileAll.sh

compileAll.sh

compileAnnotations.sh

compileAnnotations.sh

compileEpilepsy.sh

compileEpilepsy.sh

compilePopSVbenchmark.sh

compilePopSVbenchmark.sh

installDependencies.R

installDependencies.R

Repository files navigation

Step 1: Download the relevant data

Step 2: Install R dependencies

Step 3: Compile the R-markdown reports

Notes

About

Releases 5

Packages

Languages

jmonlong/epipopsv

Folders and files

Latest commit

History

Repository files navigation

Step 1: Download the relevant data

Step 2: Install R dependencies

Step 3: Compile the R-markdown reports

Notes

About

Resources

Stars

Watchers

Forks

Languages