-
Notifications
You must be signed in to change notification settings - Fork 9
Characterization of diploid genomes using k mer spectra analysis
This is a follow-up tutorial to the Introduction to K-mer spectra analysis.
resources needed
This tutorial expect following software to be installed:
If you use conda, you can follow this tutorial how to make a conda environment that includes all the software needed in this section.
Probably the easiest way to get a feel for fitting genomescope models is to go through multiple k-mer histograms and fit those models. We have collected histograms of several species in a collection you can download from following url.
wget https://github.com/KamilSJaron/oh-know/blob/main/data/data_for_genomescope.tar.gz
tar -xzvf data_for_genomescope.tar.gz
There histograms are showing various patterns and problems of wrongly fitted models, some of the common pitfalls are explained in the lecture introducing this section.
- Lecture on YouTube: Using K-mers for genome characterization
Now, let's fit all the genomescope models. If you are not sure how to go about it, see the crayfish example bellow. If you manage to fit the models 1. - 8., check also the bunch of pretty messed up k-mer spectra of nematodes in the funky_nematodes
direcotry.
- Begonia
- Bombina
- Cape Bees
- Crayfish
- Mercurialis
- Springtails
- Stick insects
- Strawberry
- BONUS Funky nematodes
You can check our guest lecture: allo or auto ploid? What do I expect my kmer spectra look like? by Hannes Becher to see other approaches how to fit models to a k-mer spectrum.
Introduction
k-mer spectra analysis
- 📖 Introduction to K-mer spectra analysis
- 📖 Basics of genome modeling
- ⚒ manual model fitting (for better understanding of the underlying model)
- ⚒ simple diploid
- 📖 Common difficulties in characterisation of diploid genomes using k mer spectra analysis
- ⚒ low coverage (pitfall) - to be merged
- ⚒ very homozygous diploid
- ⚒ highly heterozygous diploid
- ⚒ Genome size of a repetitive genome (pitfall)
- ⚒ Wrong ploidy (pitfall)
- 📖 Characterization of polyploid genomes using k mer spectra analysis
- ⚒ Autotetraploid
- ⚒ Allotetraploid
- ⚒ Estimating ploidy (smudgeplot)
- 📖 Genome modeling as a quality control
- ⚒ Contamination (pitfall)
- ⚒ k-mers in an assembly (Mercury/KAT)
- 📖 Analysing genome skimming data
Separation of chromosomes
- 📖Separate sub-genomes of an allopolyploid
- 📖Separating chromosomes by comparison of sequencing libraries
Species assignment using short k-mers
Others