Skip to content

Code and other figures for the "C3d protein in multiple myeloma: segregating tumor immunity and autoimmunity" article

Notifications You must be signed in to change notification settings

tstephie/C3d-protein-in-multiple-myeloma-segregating-tumor-immunity-and-autoimmunity-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

C3d protein in multiple myeloma: segregating tumor immunity and autoimmunity Analysis

Code and other plots from scRNA-seq analysis for the "C3d protein in multiple myeloma: segregating tumor immunity and autoimmunity" article

The analysis was done in two parts: CellRanger: Pre-processing of FASTQ to counts Seurat: Processing of counts to downstream analyses shown in the article

CellRanger

We used the human MYC transgene in our mice model, so we had to create a custom genome reference to align the reads and create counts. We used GRCm38 as mouse reference and added the human MYC gene (ENSG00000136997) from GRCh38. The steps for this process is below (files will be named differently for own analysis):

  1. find the MYC gene in GTF file in the human reference
grep ‘ENSG00000136997’ human_genes.gtf
  1. extract gene entries from GTF file in human reference
grep ‘ENSG00000136997’ human_genes.gtf > human_MYC_genes.gtf
  1. change seqname/chromosome entry in MYC GTF file
sed -i ‘s/chr8/chr8_human/’ human_MYC_genes.gtf
  1. change gene name entry in MYC GTF file
sed -i 's/gene_name "MYC";/gene_name "hMYC";/' human_MYC_genes.gtf
  1. find seqname/chromosome sequences in FASTA file in human reference (range)
grep ‘>chr8’ human_genome.fa
grep ‘>chr9’ human_genome.fa
  1. extract sequences from FASTA file in human reference (not include ‘>chr9’ line which is last line)
sed -n '/>chr8/,/>chr9/p' human_genome.fa | sed '$d' > human_MYC_chr8.fa
  1. change seqname/chromosome name to match GTF file in MYC FASTA file
sed -i ‘s/chr8/chr8_human/’ human_MYC_chr8.fa
  1. combine FASTA files
cat mouse_genome.fa human_MYC_chr8.fa > new_genome.fa
  1. combine GTF files
cat mouse_genes.gtf human_MYC_genes.gtf > new_genes.gtf
  1. We then indexed the new custom reference:
cellranger mkref --genome=new_genome --fasta=new_genome.fa --genes=new_genes.gtf
  1. Then we aligned the FASTQ files with this indexed reference in CellRanger. You can refer to the CellRanger manual for more information here: (https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/using/tutorial_ct)

Seurat

R code can be found in the “mgdmb_c3d_scrnaseq_analysis_final.Rmd” file

Input

The input files for this part are:

  • Multiple_Myeloma_10X_mice_list_v3.xlsx = contain metadata for samples in dataset
  • {*}_filtered_feature_bc_matrix.h5 = contains filtered count matrix from CellRanger; should have one for every sample
  • {*}_filtered_contig_annotations.csv = contains filtered contigs (VDJ information) from CellRanger; should have one for every sample

The metadata file can be found in the tables_and_objects folder. The count matrices and contigs can be found in GEO.

QC

Filtering thresholds can be found at lines 155-222. Violin and scatter QC plots can be found in the plots/qc folder for before and after the filtering step.

Unbatched Analysis

We first ran through the standard processing pipeline to check for batch effect. We also checked for cell cycle batch effect at lines 331-346. You will need the mouse_cell_cycle_genes.rds file, which contains mouse cell cycle genes, to check for cell cycle batch effect. You can find the PCA and other QC plots in the plots/qc folder and the UMAP plots in the plot/umap folder with “_unbatch” in the file name.

Integration Analysis

We then ran through the integration pipeline since we found batch effect from the “run_id”, which indicates that the samples were sequenced at different times. You can find the PCA and other QC plots in the plots/qc folder and the UMAP plots in the plot/umap folder with “_integrated” or “_integrate” in the file name.

Plasma Cell Analysis

You can find the DE genes from a previous study in the gene_expression_in_MM_fig2a_data.xlsx file in the tables_and_objects folder.

Code for Figures Found in Article

You can find the code for the figures in the article at:

  • Figure 3A = lines 747-772
  • Figure S3A = lines 628-632
  • Figure S3B = lines 636-640
  • Figure S3C = lines 647-653

About

Code and other figures for the "C3d protein in multiple myeloma: segregating tumor immunity and autoimmunity" article

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published