# Tutorials

There are two ways to run `scar`. For Python users, we recommend the Python API; for R users, we recommend the command line tool.

## Run scar with API

* [sgRNA assignment in scCRISPRseq](scAR_tutorial_sgRNA_assignment.ipynb)
* [Denoising ADT in CITE-seq data](scAR_tutorial_denoising_CITEseq.ipynb)
* [Denoising mRNA](scAR_tutorial_mRNA_denoising.ipynb)

## Run scar with command line tool

We can also run `scAR` by:

    scar raw_count_matrix.csv -t technology -o output

`raw_count_matrix.csv`, a file of raw count matrix (MxN) with cells in rows and features in columns  

 | cells | gene_0  | gene_1    | ...   | gene_18000   |
|---:|:-------------|:-----------|:------|:------|
| *cell_0* | 12  | 3       | ...   | 82     |
| *cell_1* | 13  | 0    | ...   | 78     |
| *cell_2* | 35  | 30       | ...   | 170     |
| ... | ...  | ...    | ...   | ...     |
| *cell_8000* | 12  | 3       | ...   | 82     |

`technology`, a string, either 'scRNAseq' or 'CROPseq' or 'CITEseq'.


<div class="alert alert-info">

Note

An extra argument ``ambient_profile`` is recommended to achieve better results.

``ambient_profile`` represents the probability of occurrence of each ambient transcript and can be empirically estimated by averging cell-free droplets.
</div>


Then, we can run `scAR` by:
    
     scar raw_count_matrix.csv -t technology -e ambient_profile.csv -o output
    

    
`ambient_profile.csv`, a file of ambient profile (Nx1) in cell-free droplets  

| genes | ambient profile  |
|---:|:-------------|
| **gene_0** | .0003  |
| **gene_1** | .00004  |
| **gene_2** |  .00003 |
| ... | ...  |
| **gene_18000** | .0012 |

<div class="alert alert-warning">

Warning

``ambient_profile`` should sum to one. The gene order should be consistent with `raw_count_matrix`.
</div>


For other optional arguments and parameters, run:

    scar --help

The output folder contains four (or five) files:   

	output
	├── denoised_counts.csv
	├── expected_noise_ratio.csv
	├── BayesFactor.csv
	├── expected_native_freq.csv 
	└── assignment.csv
    
    
`denoised_counts.csv`, denoised count matrix.  
`expected_noise_ratio.csv`, estimated noise ratio.  
`BayesFactor.csv`, bayesian factor of ambient contamination.  
`expected_native_freq.csv`, estimated native frequencies.  
`assignment.csv`, feature assignment, e.g., sgRNAs, tags, and etc.. Gernerated under 'cropseq' mode.  

<div class="alert alert-info">

Note

`.pickle` format is also supported besides `.csv` files
</div>
