GitHub - Ivan-Pchelin/genotyping-by-sequencing: Here one can find python scripts, used for sorting the world sample of Trichophyton mentagrophytes and T. interdigitale ITS region sequences.

Here one can find python scripts, used for sorting the world sample of Trichophyton mentagrophytes and T. interdigitale ITS region sequences.

The script puttorights.py selects nucleotide sequences from infile.txt (GenBank format), containing exact matches with test element. The element should be modified directly in the code.

The script compactor.py merges identical sequences into groups (genotypes). Its input is in FASTA format. The sequences in the input must be cropped.

The script attributetogenotypes.py compares two fasta files: the dataset "sequence.fasta" and the file "referenceset.fasta" with reference sequences. The result will appear in the file "attributed.txt", where the first column contains names from "sample.fasta" and the second one contains a name from the reference dataset, if found. The code and the T. mentagrophytes / T. interdigitale reference dataset of ribosomal ITS region sequences were described by Taghipour et al. 2019, the corrected genotyping approach and the updated dataset were published by Nikkholgh et al. 2023.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
LICENSE		LICENSE
README.md		README.md
attributetogenotypes.py		attributetogenotypes.py
compactor.py		compactor.py
puttorights.py		puttorights.py
referenceset.fasta		referenceset.fasta
sample.fasta		sample.fasta

License

Ivan-Pchelin/genotyping-by-sequencing

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages