Skip to content

Latest commit

 

History

History
170 lines (102 loc) · 6.63 KB

index.md

File metadata and controls

170 lines (102 loc) · 6.63 KB

Introduction to single-cell RNA-seq data analysis - Bitesize

7 week course

Bioinformatics Core Facility, Cancer Research UK Cambridge Institute

Github link

Repo

Instructors

  • Abbi Edwards - Bioinformatics Core, Cancer Research UK Cambridge Institute
  • Ashley Sawle - Bioinformatics Core, Cancer Research UK Cambridge Institute
  • Chandra Chilamakuri - Bioinformatics Core, Cancer Research UK Cambridge Institute
  • Jason Skelton - Genomics Core, Cancer Research UK Cambridge Institute
  • Kamal Kishore - Bioinformatics Core, Cancer Research UK Cambridge Institute
  • Matthew Eldridge - Bioinformatics Core, Cancer Research UK Cambridge Institute

Outline

This workshop is aimed at biologists interested in learning how to perform basic single-cell RNA-seq analyses.

This will focus on the droplet-based assay by 10X genomics and include running the accompanying cellranger pipeline to align reads to a genome reference and count the number of read per gene, reading the count data into R, quality control, normalisation, data set integration, clustering and identification of cluster marker genes, as well as differential expression and abundance analyses. You will also learn how to generate common plots for analysis and visualisation of gene expression data, such as TSNE, UMAP and violin plots.

(Materials linked to below will be updated closer to the time of delivery)

Prerequisites

Some basic experience of using a UNIX/LINUX command line is assumed

Some R knowledge is assumed and essential. Without it, you will struggle on this course. If you are not familiar with the R statistical programming language we strongly encourage you to work through an introductory R course before attempting these materials. We recommend our Introduction to R course You will also need a CI cluster account and be familiar with its use.

Data

  • The course data is based on 'CaronBourque2020' relating to pediatric leukemia, with four sample types, including:
    • pediatric Bone Marrow Mononuclear Cells (PBMMCs)
    • three tumour types: ETV6-RUNX1, HHD, PRE-T
  • The data used in the course can be downloaded from Dropbox (the file is 4.2GB compressed and XXGB when uncompressed, so make sure you have enough space on your computer). Please note that:
    • these data have been processed for teaching purposes and are therefore not suitable for research use;

How will it work?

Each week there will be a taught Tuesday session (1:00-3:30 pm) in the room 009 and a Friday clinic online session (2-3pm). Calendar invites for these meetings (on teams) should have been sent to your CRUK email address.

It is assumed you will try the exercises between the two sessions. At the clinic the speaker will go through the exercises and answer any questions. During the week you can also ask questions at anytime to your designated member of the bioinformatics core. You should have had an email identifying who this but if you are unsure please contact analysisteam-bioinformatics@cruk.cam.ac.uk.

Schedule

Week 1

  • Introduction to Single Cell Methods - Abbi Edwards
  • Preamble: data set and workflow - Chandra Chilamakuri
  • Alignment and cell counting with Cellranger - Chandra Chilamakuri

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Additional Resources

Acknowledgements

This course was initialy based on the OSCA (Orchestrating Single-Cell Analysis) book with subsequent modifications by

Abigail Edwards

Ashley Sawle

Chandra Chilamakuri

Kamal Kishore

Stephane Ballereau

Hugo Tavares

Katarzyna Kania