Sherman

A simple Bisulfite FastQ Read Simulator (BiQRS)

Sherman can simulate ungapped high-throughput datasets for bisulfite sequencing (BS-Seq) or standard experiments. It allows the user to introduce various 'contaminants' into the sequences, such as basecall errors, SNPs, adapter fragments etc., in order to evaluate the influence of common problems observed in many Next-Gen Sequencing experiments. For more specific information please refor to the Sherman User Manual.

Generate any number of sequences
Generate either completely random sequences or use genomic sequences (genome can be specified)
Generates single-end or paired-end data
Adjustable bisulfite conversion rate from 0-100% for either all cytosines or cytosines in CH and CG context individually
Generate directional or non-directional libraries
Generate sequences in base-space or SOLiD color-space format
Adjustable default Phred quality score (Sanger encoding, Phred+33 format)
Sequences can have constant Phred qualities throughout the read or can have quality scores following an exponential decay curve, which will eventually result in basecall errors (note that this is handled slightly different for base- and color-space data)
Introduce a variable number of random SNPs into each read
Introduce a fixed amount of adapter sequence at the 3' end of all sequences
Introduce a variable amount of adapter sequence at various positions at the 3' end of reads
Write out a truth set of introduced bisulfite conversions ('positional_changes.txt')

Sherman_Nanopore

This version of Sherman doesn't do anyting particularly clever but it allows a maximum sequence length of 300,000 bp and therefore also a lot of SNPs (e.g 15% error rate and read lengths of 10000 bp). It also does't mind if the reference sequence contains Ns (which Sherman normally rejects). This version does not have the option --truth_set.

Credits

Sherman was written by Felix Krueger, as part of the Babraham Bioinformatics group, now maintained at Altos Labs, Cambridge Institute.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
docs		docs
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
Sherman		Sherman
Sherman_Nanopore		Sherman_Nanopore
Sherman_User_Manual.md		Sherman_User_Manual.md
_config.yml		_config.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sherman

A simple Bisulfite FastQ Read Simulator (BiQRS)

Sherman_Nanopore

Credits

About

Releases 3

Packages

Languages

License

FelixKrueger/Sherman

Folders and files

Latest commit

History

Repository files navigation

Sherman

A simple Bisulfite FastQ Read Simulator (BiQRS)

Sherman_Nanopore

Credits

About

Resources

License

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages