Scripts and notebooks for reproducing analysis from the 2020 dropkick manuscript.
-
Run
install.shto installdropkickand dependencies for completing downstream analysis. It is recommended to initiate a new virtual python environment to avoid conflicts. -
(optional) Download six replicates of human placenta data and perform CellBender remove-background by following instructions in the
cellbender/directory. -
(optional) Build a high-background simulation from human PBMC data by following instructions in the
simulation/directory. -
To add EmptyDrops and CellRanger v2 labels to all raw
.h5adfiles indata/, runemptydrops_cellranger.sh. This will deposit labels into the original.h5adfile. -
run_and_test.shwill performdropkickfiltering on all.h5adfiles indata/and then compare outputs toCellRanger_2andEmptyDropslabels. -
test_3907.shwill combine the 3907 human colorectal cancer datasets and reduce dimensions, showing differences between filtering tools. -
test_placenta.shwill combine the human placenta datasets downloaded in step 2 and reduce dimensions. -
test_simulation.shwill reduce dimensions for the high-background simulation made in step 3 above. -
manualfilter_example.ipynboutlines our manual filtering approach performed on inDrop sequencing samples for comparison todropkick.
NOTE: This entire process may take several minutes to complete on a well-equipped machine. The emptydrops_cellranger.sh script typically completes in ~5 min per input file (.h5ad). The run_and_test.sh script may also take a couple minutes for dimension reduction, clustering, and embedding to visualize dropkick vs. EmptyDrops and CellRanger filtering.
Full documentation is available at KenLauLab.github.io/dropkick.
