This repository contains data and notebooks for the single-cell session in the 2022 EMBL-EBI T-cell bioinformatics course
The goal of these two sessions will be to introduce basic concepts for exploratory analysis of single-cell RNA and paired TCR sequencing data.
We will use a dataset from the HCA Tissue Immune Cell Atlas containing human T cells across a range of lymphoid and non-lymphoid tissues. This will allow us to cover a broad spectrum of T cell types.
-
scRNA-seq data available in the data folder of this repo and at Tissue Immune Cell Atlas
-
scVDJ-seq data request access to TCR contigs in GDrive
T-cell-bioinformatics-course_scRNAseq.ipynb
- workflow to perform QC, integration and annotation of single-cell gene expression (GEX)
T-cell-bioinformatics-course_scTCRseq.ipynb
- workflow to integrate VDJ and GEX data, determine clonotypes and repertoire sharing
pip3 install
pandas numpy scipy matplotlib seaborn
dict = dict(zip(keys, values))
df = pd.DataFrame()
list = []
AnnData
https://anndata.readthedocs.io/en/latest/
pd.read_csv()
.to_csv()
type()
Dataframes:
df.columns
df.shape
df['column'].unique()
df['column'].nunique()
df.head(2)
df.tail(2)
df['column.value_counts
https://www.w3schools.com/python/python_intro.asp
https://www.bioinformatics.babraham.ac.uk/training.html