Predictability of B cell clonal persistence and immunosurveillance in breast cancer

This github repository contains the code used to run the analyses and generate the figures in the manuscript:

Sammut SJ et al. Predictability of B cell clonal persistence and immunosurveillance in breast cancer.

Data access

Raw BCR sequencing data have been uploaded to the European Genome-Phenome Archive (EGA00002343328). If access to raw sequencing data is required please submit a request via the EGA. Once approval from the Data Access Committee is secured processed data can also be provided through direct communication with corresponding authors. This includes data within the ../data/processed/bcr-data, ../data/processed/tcr-data, ../data/processed/MrDarcy and ../data/unprocessed folders.

The metastatic breast cancer cohort was previously described in https://doi.org/10.1016/j.celrep.2019.04.098 whilst the early breast cancer cohort was previously described in https://doi.org/10.1038/s41586-021-04278-5.

Directory structure

This repository has seven folders:

Directory	Description
`../BCRNetworks`	Contains R scripts to generate BCR Network centrality analyses
`../data`	Contains the data required to generate the analyses described in the manuscript
`../metadata`	Contains sample metadata files
`../output`	Stores output generated by R scripts
`../python`	Contains python code required to screen antibody sequences against a database of antibody sequences known to bind to antigen
`../R`	Contains R scripts required to generate the analyses described in the manuscript
`../resources`	Contains resource files (such as gene lists) used within the analyses

The code and data reside in an encrypted compressed file while the manuscript is undergoing peer review. The password to unencrypt this file is the concatenation of the case-sensitive first and second words in the introduction, without a space delimiter.

Manuscript analyses

The scripts included in this repository will allow you to recreate the analyses described within our manuscript. We have used R version 4.1.2 and Python version 3.10.1.

Loading directory structure

To automatically load the directory and file structure, please specify the location of the root directory in a variable called dir.base and source the loadData.R file, as shown below:

    dir.base <- "~/BCR-Immunosurveillance/"
    source (paste0(dir.base,"R/loadData.R"))

BCR Network centrality analyses

In the manuscript we describe a BCR Network centrality analysis pipeline (Figure 4). To run this pipeline:

1.Create and activate a conda container https://docs.conda.io/projects/miniconda/en/latest/ and install networkx and numpy:

    conda create -n mrdarcy python=3.9
    conda activate mrdarcy
    conda install networkx
    conda install numpy

2.Download cdhit from https://github.com/weizhongli/cdhit/releases and install.

3.The code to run the analysis can be found in ../BCRNetworks/R folder. Example scripts that generate networks for the multiple sclerosis and early breast cancer cohorts have been provided in the ../BCRNetworks/examples folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BCRNetworks

BCRNetworks

R

R

data_for_upload

data_for_upload

metadata

metadata

output

output

python

python

resources

resources

README.md

README.md

Repository files navigation

Predictability of B cell clonal persistence and immunosurveillance in breast cancer

Data access

Directory structure

Manuscript analyses

Loading directory structure

BCR Network centrality analyses

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
BCRNetworks		BCRNetworks
R		R
data_for_upload		data_for_upload
metadata		metadata
output		output
python		python
resources		resources
README.md		README.md

sjslab/BCR-Immunosurveillance

Folders and files

Latest commit

History

Repository files navigation

Predictability of B cell clonal persistence and immunosurveillance in breast cancer

Data access

Directory structure

Manuscript analyses

Loading directory structure

BCR Network centrality analyses

About

Resources

Stars

Watchers

Forks

Languages