Skip to content

rapidspeciation/biodiversity_genomics_course

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

529 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Welcome to the Biodiversity Genomics course

held in Mendoza, Argentina in November 2025

For the course taught in July 2024 at IKIAM University in Tena, Ecuador, see the folder 2024.

This course is taught by Karin Näsvall, Nicol Rueda, Fernando Seixas, and Joana Meier from the Wellcome Sanger Institute and by Melisa Olave (CONICET, Argentina). Some of the course material is based on the speciation genomics course by Joana Meier and Mark Ravinet.

The course website showing logistics and useful information about Mendoza is here.

Course structure

Day 0:

  • Slides basic introduction to unix and the command line
  • Exercise on how to use the command line
  • Exercise on more use of the command line
  • Exercise for more advanced users (awk, variables, arrays, writing a bash script)

Day 1:

  • Slides introducing biodiversity genomics and sequencing technologies
  • Slides introducing the structure of Illumina reads and fastq files
  • Exercise on exploring Illumina reads and visualising the quality with fastqc which uses these input files
  • Slides on interpreting fastqc output of RAD data
  • Exercise on filtering and trimming reads
  • Slides on assembling reference genomes

Day 2:

  • Slides on aligning reads to a reference genome
  • Exercise on aligning reads to a reference genome
  • Slides on calling variants and genotypes
  • Exercise on calling variants and genotypes

Day 3:

  • Excercise on filtering vcf files
  • Slides on population structure visualisation with PCA (principal components analysis)
  • Excercise on running a PCA
  • Slides on Phylogenomics: gene trees, species trees and networks
  • Excercise on phylogenomic networks with gene trees

Day 4:

Day 5:

  • slides on detecting selection at long time-scales
  • exercise on detecting selection at long time-scales
  • slides on large-scale sequencing initiatives
  • slides on comparative genomics
  • exercise on genome synteny

Advanced materials:

Learning more

Physalia courses

Physalia courses offers lots of great courses on genomics, bioinformatics, and related fields. The Speciation Genomics course was a course taught via Physalia.

Speciation genomics course website

https://speciationgenomics.github.io/ contains many more tutorials, including topics not covered here, such as demographic modeling, haplotype-based tests for selection, advanced unix and R tutorials, simulating data with SliM, etc.

Speciation and population genomics workshop

Workshop website contains many useful tutorials that are quite advanced, including machine learning and pangenomics.

tidypopgen (R package for population genomics):

website

Biodiversity Genomics Conference

28 October - 1 November 2024: Online, free conference on biodiversity genomics across all time zones: https://www.biodiversitygenomicsconference.org

For RAD or UCE data

If you have short-read data for only a subset of the genome because you used a reduced-representation technique (e.g. RAD or UCE), most of the tutorial will still be relevant. For RAD or UCE data you do not necessarily need a reference genome, unless you want to run the genome scans for finding regions with high differentiation or introgression. If you have RAD (restriction-enzyme associated DNA) data, you can either follow the steps in our tutorial with mapping reads to a reference genome or if you do not have a reference genome, you can do a de novo assembly, i.e. make your own reference for just the RAD loci. The most widely used tool for RAD data analysis is STACKS or if you have a phylogenetic dataset with many species, we can recommend ipyrad. If you are working with polyploids, check out polyRAD. If you have UCE (ultra-conserved elements) data, have a look at this website for guidance.

Publications we recommend:

Introduction to Unix

Reviews on biodiversity genomics:

Publications related to the examples in the course:

  • Genomics of Neotropical biodiversity indicators: two butterfly radiations with rampant chromosomal rearrangements and hybridisation van der Heijden et al. 2024
  • Genomic evidence reveals three W-autosome fusions in Heliconius butterflies Rueda et al. 2024

Papers or manuals for some of the tools mentioned in this course:

Reviews on phylogenomics

Reviews on speciation

Reviews or examples about introgression

Reviews or examples on chromosome evolution

Review on next-generation sequencies technologies

Population genomic diversity and divergence statistics

  • Patterns of Z chromosome divergence among Heliconius species highlight the importance of historical demography Van Belleghem, et al., 2017
  • Complex modular architecture around a simple toolkit of wing pattern genes Van Belleghem, 2018

About

biodiversity genomics course by Nicol Rueda, Karin Näsvall, Fernando Seixas and Joana Meier from the Wellcome Sanger Institute

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors