SingleMoleculeFootprinting is an R package build around QuasR and tailored for the analysis Single Molecule Footprinting (SMF) data.
SMF is a high-throughput sequencing technology developed in the Krebs laboratory. It consists of marking accessible genomic cytosines in the GpC and CpG contexts using exogenous methytransferase enzymes and of subsequently performing bisulfite sequencing (BS). Consequently, cytosines protected by the binding of DNA-interacting proteins (e.g. TFs, nucleosomes, GTFs, etc..) will result unmethylated, while the accessible cytosines will result methylated.
With the present package, we provide functions to perform basic SMF data analysis starting from aligned bam files up to the biological interpretation of results over single sites.
To ensure compatibility with our downstream tools, we recommend aligning sequencing reads using the QuasR function qAlign as follows
cl = makeCluster(40)
prj = QuasR::qAlign(sampleFile = sampleFile,
genome = genome,
aligner = "Rbowtie",
projectName = "prj",
paired = "fr",
bisulfite = "undir",
alignmentParameter = "-e 70 -X 1000 -k 2 --best -strata",
alignmentsDir = "./",
cacheDir = tempdir(),
clObj = cl)
For more details on how to structure the sampleFile argument we refer to the qAlign documentation. For more details on SMF data preprocessing we refer to the computational steps of our methods manuscript link to pre-print.
To install SingleMoleculeFootprinting, execute the following
remotes::install_github(repo = "https://github.com/Krebslabrep/SingleMoleculeFootprinting.git", ref = "main", build_vignettes = FALSE)
For instructions on how to use the SingleMoleculeFootprinting package and an example of analysis, consult our vignette