Skip to content

KazukiNakamae/temp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 

Repository files navigation

MaChIAto_logo

MaChIAto is a third party software of CRISPResso (https://github.com/pinellolab/CRISPResso2) to get a stricter classification of the targeted amplicon sequencing (also called deep sequencing) and profile the relationship with genomic context (e.g., Thermodynamics and mutation patterns) in the CRISPR-Cas9/Prime Editor targeting site.

MaChIAto would be helpful for people who want to

  • get a more accurate classification of knock-out, homology-based knock-in, and Prime Editing. (Case I)
  • quantify the rate of imprecise editing reads, including the reads with donor-reversed integration. (Case II)
  • obtain the local alignment of the target site and knock-in junction. (Case III)
  • analyze the sequence involved with InDels and imprecise editing. (Case IV)
  • find the useful features for the efficiency prediction. (Case V)

If you have the same purpose with the above cases, you can check the Case Studies section!

Quick Start

We demonstrate a simple example. Before that, you have to build the environment according to the section of Preparation.

1. Download MaChIAto with example files

Click "Download ZIP".

download_zip.png

Alternatively, enter the following command terminal.

git clone https://github.com/KazukiNakamae/MaChIAto.git;

2. Open a terminal and go the directory.

cd (...)/MaChIAto

3. Re-classify the allele frequency table derived from CRISPResso2.

source activate MaChIAto_env; # *You do not need to enter it again once you did it.

The re-classification command of MaChIAto (Classifier)

python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
(other parameters);
conda deactivate;

The other parameters varies per experiment type {knock-out, homology-based knokc-in, Prime Editing}.

The example is for the knock-out analysis.

Case of knock-out
python MaChIAto/MaChIAto.py \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence);

*The case of the knock-out and knock-in is shown in the section of MaChIAto (MaChIAto Classifier).

4. Check the classification result.

You can get the classification result of the editing. The result is visualized as the pie chart based on the ALL_dataframe.txt.

We provide the detailed description how to read them in the section of the MaChIAtoClassifier output.

Quick Start (Optional)

MaChIAto can profile and visualize the MaChIAto output using MaChIAto Aligner, MaChIAto Analyzer, and MaChIAto Reviewer.

5. Get local alignment and the mutation profiling using MaChIAto Aligner.

source activate MaChIAto_Aligner_env; # *You do not need to enter it again once you did it.

The alignment command of MaChIAto Aligner

Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
(directory of MaChIAto Classifier) \
(output directory);
conda deactivate;

If your donor insert has the extra sequence outside of the homology arm, you should enter it. Please refer to the section of MaChIAtoAligner.

6. Check the alignment result.

You can get the local alignment of the editing. The result is visualized as the map.

We provide a detailed description of how to read them in the section of the MaChIAtoAligner output.

7. Aggregate the multiple data.

If you have multiple data (n>3) to profile the characteristics, you should aggregate it using the following command.

source activate MaChIAto_Analyzer_env; # *You do not need to enter it again once you did it.
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-ol (knock-out label) \
-t (calculation target) \
conda deactivate;

It is used for the knock-out analysis. The calculation target for MaChIAto Analyzer/Reviewer is protospacer. You can target the homology arm and RT template using another command. Please see the section of MaChIAtoAnalyzer.

8. Investigate the relationship with (epi-)genomic context

You can see the correlation with the >70 (epi-)genomic context using MaChIAto Analyzer.

Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
(directory of collect_MaChIAto_data.py) \
(output directory);
conda deactivate;

MaChIAtoAnalyzer basically use > 70 genomic context. If you want to use epigenomic context or other context, please see the section of MaChIAtoAnalyzer.

9. Check the correlation.

You can get the correlation between the editing efficacy and the context. The result is visualized as the scatter plot.

We provide a detailed description of how to read them in the section of the MaChIAtoAnalyzer output.

10. Profile the mutation tendency.

source activate MaChIAto_Reviewer_env; # *You do not need to enter it again once you did it.
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
(the prefix of MaChIAto Classifier output) \
(the prefix of MaChIAto Aligner output) \
(directory of collect_MaChIAto_data.py) \
(output directory);
conda deactivate;

11. Check the mutation profile.

You can get the mutation profile of the editing. The result is visualized as the bar plot and pie chart.

We provide a detailed description of how to read them in the section of the MaChIAtoReviewer output.

Preparation

Change default shell

1. Set bash as default shell

chsh -s /bin/bash;

Install software

1. install miniconda3 from Conda

(https://docs.conda.io/en/latest/miniconda.html)

2. install R (>version R-4.0.1) from the CRAN

(https://cran.ism.ac.jp)

3. install Xcode from the App Store

(https://apps.apple.com/jp/app/xcode/id497799835?mt=12)

sudo xcodebuild -license;

4. install XQuartz from the XQuartz project

(xquartz.macosforge.org)

Build the environment using conda

1. Environment for MaChIAto_(MaChIAto Classifier)

conda create --name MaChIAto_env;
source activate MaChIAto_env;
conda install -c anaconda python=3.8;
pip install --upgrade pip;
pip install regex tqdm argparse biopython numpy matplotlib GPy gpyopt datetime pandas;
conda deactivate;

2. Environment for MaChIAtoAligner

conda create --name MaChIAto_Aligner_env;
source activate MaChIAto_Aligner_env;
conda install -c bioconda python=3.8 bwa=0.7.17 samtools=1.9;
conda deactivate;

3. Environment for MaChIAtoAnalyzer

conda create --name MaChIAto_Analyzer_env;
source activate MaChIAto_Analyzer_env;
conda install -c anaconda python=3.8 wget;
conda install -c bioconda emboss;
conda install -c bioconda oligoarrayaux;
pip install --upgrade pip;
pip install numpy regex tqdm pandas;
conda deactivate;

4. Environment for MaChIAtoReviewer

conda create -n MaChIAto_Reviewer_env --clone MaChIAto_Aligner_env;
source activate MaChIAto_Reviewer_env;
conda deactivate;

Usage

Usage of MaChIAto Classifier

Parameter list

-m or --mode {str}: This parameter allows for the specification of the type of analysis: “CRISPResso” and “CRISPResso2” is allowed in the latest version

-a or --amplicon_seq {str}: This parameter allows the user to enter the amplicon sequence used for the CRISPResso. The length should be >105bp due to the setting for the other parameter.

-g or --guide_seq {str}: This parameter allows for the specification of the sgRNA sequence used for the CRISPResso. The length of the sequence should be 20nt without PAM. The MaChIAto convention is to depict the expected cleavage position using the value of the parameter three nt 3' from the end of the guide.", required=True)

-cf or --crispreeso_file {str}: This parameter allows for the specification of the “Alleles_frequency_table.txt” from CRISPResso. When this parameter is used, “CRISPResso” should be entered as -m parameter.', (default: "./Alleles_frequency_table.txt") (optional)

-ccf or --crispreeso2_file {str}: This parameter allows for the specification of the “Alleles_frequency_table.zip” from CRISPResso2. When this parameter is used, “CRISPResso2” should be entered as -m parameter.', (default: "./Alleles_frequency_table.zip") (optional)

-d or --donor_seq {str}:This parameter allows for the specification of the expected HDR amplicon used for the CRISPResso. The length of sequence should be >12bp in knock-out/knock-in analysis. In knock-out analysis, this parameter should not be entered, and then the value will be given fake parameter (“TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT”), and some internal settings are changed for knock-out analysis. However, the fake parameter will be poly-C|G|A if amplicon sequence contains poly-T sequence.', (default: "") (optional)

-e or --expected_ki_amplicon_seq {str}: This parameter allows for the specification of the expected knock-in amplicon sequence used for the CRISPResso after HDR. The length of the sequence should be >12bp. In knock-out analysis, this parameter should not be entered, and then the value will be given fake parameter including fake donor sequence, and some internal settings are changed for knock-out analysis.', (default: "") (optional)

-o or --output_folder {str}: This parameter allows for the specification of the output directory to use for the analysis (default: current directory) (default: "./") (optional)

-lh or --length_left_homologyarm {int}: This parameter allows for the specification of the length of 5’ homology arm (default: 20). The length of the sequence should be >17bp, and the flanking sequence with the homology arm needs the length of >24bp in the expected amplicon sequence. In the prime editing analysis, the 5’ homology arm is considered as the RT template (default: 20) (optional)

-rh or --length_right_homologyarm {int}: This parameter allows for the specification of the length of 3’ homology arm (default: 20). The length of the sequence should be >17bp, and the flanking sequence with the homology arm needs the length of >24bp in the expected amplicon sequence. In the prime editing analysis, the 3’ homology arm is considered as the prime binding site (default: 20) (optional)

-cn or --location_comp_nick {int}: This parameter allows for the specification of the complementary strand nick location [3prime direction is +] (default: 90). This parameter is used in the prime editing and should be over the homology arm's length to which the nickase is adjacent' (default: 90) (optional)

-n or --name : This parameter allows for the specification of the name, which will be included output directory (default: “untitled-X”). If MaChIAto Analyzer and MaChIAto Reviewer will be used in the following analysis, the value should be “{target_name}-{sample label }” (e.g., DBF4B-C) and underbar "_" should not be used', (default: "untitled-X") (optional)

--primeediting_analysis : Re-classify the data as prime editing analysis. This option forces the setting to change for prime editing analysis.' (optional)

Advanced parameter (for developers)

--force_knockout_analysis : Usually, MaChIAto re-classify the data as knock-in analysis. This option forces the setting to change for knock-out analysis. Under this mode, the length of indicator on the knock-in donor is the maximum value, and the threshold value for alignment of the knock-in sequence is 1.0. If this parameter is not entered, MaChIAto can automatically set up this mode by finding some characteristics of the knock-out sample. For example, MaChIAto checks that there is no donor sequence or expected knock-in sequence as input, and there are less three kinds HDR variants among input data. (optional)

--skip_optimization : Usually, MaChIAto runs Bayesian optimization for finding the optimized setting. This option allows MaChIAto to skip the process of optimization. The option is made for debugging. So, the option should not be used in the usual analysis. However, if the optimization process disturbs an accurate analysis, this option might be useful. (optional)

--copy_optimization : Usually, MaChIAto runs Bayesian optimization for finding the optimized setting. This option allows MaChIAto to use the provided optimization data instead of the optimization. If you analyze NGS data with the setting of previously analyzed data, this option is useful. Especially in substitution editing, we can recommend applying the option to the negative/positive control sample. However, you should understand that a classification error might frequently occur. (optional)

--provided_optimization_file : This parameter allows for the specification of the “MaChIAto_optimized_param.csv” from the analyzed MaChIAto folder. When this parameter is used, --copy_optimization parameter is required.', (default: "./MaChIAto_optimized_param.csv") (optional)

Template command

Case of knock-out
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-n (sample name)-(label name);
Case of homology-based knock-in
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-d (donor insert sequence) \
-lh (length of 5\' homology arm) \
-rh (length of 3\' homology arm) \
-n (sample name)-(label name);
Case of Prime Editing (Substitution/Deletion editing)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-lh (length of prime binding site) \
-rh (length of RT template) \
-cn (distance between two nick sites) \
--primeediting_analysis \
-n (sample name)-(label name);
Case of Prime Editing (Insertion editing)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-d (donor insert sequence) \
-lh (length of prime binding site) \
-rh (length of RT template) \
-cn (distance between two nick sites) \
--primeediting_analysis \
-n (sample name)-(label name);

If you want to use the result of CRISPResso version1, the -ccf should be replased into -cf. Example for CRISPResso version1

python MaChIAto/MaChIAto.py \
-m CRISPResso \
-cf (Alleles_frequency_table.txt of the CRISPResso output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-n (sample name)-(label name);

*The sample name and label name can be arbitrary. If you analyze the multiple experiment (e.g. knock-out and knock-in), the name should be different from others.

The example of the command

Case of knock-out
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702171907_AAVS1-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-n AAVS1-KO;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174222_DBF4B-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-n DBF4B-KO;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185531_CD40LG-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-n CD40LG-KO;
Case of homology-based knock-in
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702172008_AAVS1-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-e CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCAGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGCTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n AAVS1-KI1;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174343_DBF4B-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-e TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n DBF4B-KI1;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185650_CD40LG-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-e TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n CD40LG-KI1;
Case of untreated sample for negative control (Optional)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702171854_AAVS1-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-n AAVS1-NC;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174209_DBF4B-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-n DBF4B-NC;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185510_CD40LG-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-n CD40LG-NC;
Case of homology-based knock-in that is the different condition from the above method (Optional)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702172115_AAVS1-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-e CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCAGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGCTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n AAVS1-KI2;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174513_DBF4B-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-e TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n DBF4B-KI2;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185815_CD40LG-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-e TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n CD40LG-KI2;
Case of Prime Editing (Substitution editing)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287821/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a acttggtgccaaattcttctcccctgggaagcatccctggacacttcccaaaggaccccagtcactccagcctgttggctgccgctcactttgatgtctgcaggccagatgagggctccagatggcacattgtcagagggacacactgtggcccctgtgcccagccctgggctctctgtacatgaagcaactccagtcccaaatatgtagctgtttgggaggtcagaaatagggggtccaggagcaaactccccccaccccctttccaaagcccattccctctttag \
-g gatgtctgcaggccagatga \
-e acttggtgccaaattcttctcccctgggaagcatccctggacacttcccaaaggaccccagtcactccagcctgttggctgccgctcactttgatgtctgcaggccagaAgagggctccagatggcacattgtcagagggacacactgtggcccctgtgcccagccctgggctctctgtacatgaagcaactccagtcccaaatatgtagctgtttgggaggtcagaaatagggggtccaggagcaaactccccccaccccctttccaaagcccattccctctttag \
-lh 13 \
-rh 22 \
-cn 57 \
-n VEGFA_1TtoA_rep1-PE \
--primeediting_analysis;
Case of Prime Editing (Deletion editing)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287989/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacctgaggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-g gtcatcttagtcattacctg \
-e ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacGagTtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-lh 15 \
-rh 12 \
-cn 41 \
-n RNF2_12CTdel_6GtoT-PE \
--primeediting_analysis;
Case of Prime Editing (Insertion editing)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287978/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacctgaggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-g gtcatcttagtcattacctg \
-e ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattaccAAtggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-d cAA \
-lh 15 \
-rh 14 \
-cn 41 \
-n RNF2_2AAins_34GAdel-PE \
--primeediting_analysis;

MaChIAtoClassifier output

The classification result is visualized in the "MaChIAto_alignment_pie_chart.png" "MaChIAto_alignment_pie_chart.png" has the following labels:

  • Unmodified: The unedited amplicon sequence
  • Perfect Editing: The edited sequence that a user intended to generate (e.g., precise knock-in sequence, expected substitution/deletion/insertion...)
  • Imperfect Editing: The sequence that is simular to "Perfect Editing" sequence. However, the junction sequence is imperfect. *The class is specilized for the homology-based knock-in. In knock-out and Prime Editing, the "Imperfect Editing" reads can be condidered as just InDels. InDels: The sequence that has a trace of editing (e.g., deletion/insertion/frequent substitutions) around the target site.
  • Deletion: The InDel sequence whose length is less than that of the unedited amplicon sequence
  • Insertion: The InDel sequence whose length is more than that of the unedited amplicon sequence
  • Frequent substitutions: The InDel sequence whose length is equal to that of the unedited amplicon sequence UC: Unclear sequence that has the insufficient information to be recognized as the focusing sequence.
  • UC (Ambiguous Editing): There is the expected sequence (e.g., knock-in cassette and Reverse transcript). However, the information of the end of amplicon is unclear. For example, The edited reads, which do not have the site of complementary strand nicking, are classified into this class.
  • UC (Others): There is not enough sequence informaton for the classification. The unedited amplicon read with sequence errors and read amplified from the other locus can be classified into the this class.

MaChIAto Aligner

Parameter list

Input directory: The directory that MaChIAto Classifier generated.

Output prefix: The directory into which the output directory is saved.

Left extra sequence (optional): The outside sequence flanking 5'-homology arm of the donor.

Right extra sequence (optional): The outside sequence flanking 3'-homology arm of the donor.

Template command

Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
(Input directory) \
(Output prefix) \
(Left extra sequence) \
(Right extra sequence);

*when you use external storage for save output, the process can be too slow. We recommend that you use the tool in the local storage.

Example of the command

Case of knock-out
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112127_on_AAVS1-KO \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215418_on_DBF4B-KO \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215432_on_CD40LG-KO \
example/MaChIAto_Aligner_output;
Case of homology-based knock-in
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112140_on_DBF4B-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723225210_on_AAVS1-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215508_on_CD40LG-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Case of untreated sample for negative control (Optional)
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215528_on_AAVS1-NC \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215540_on_DBF4B-NC \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Aligner_output;
Case of homology-based knock-in that is the different condition from the above method (Optional)
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723225231_on_AAVS1-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215620_on_DBF4B-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215633_on_CD40LG-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Case of Prime Editing (Substitution editing)
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112157_on_VEGFA1TtoArep1-PE \
example/MaChIAto_Aligner_output;
Case of Prime Editing (Deletion editing)
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112511_on_RNF212CTdel6GtoT-PE \
example/MaChIAto_Aligner_output;
Case of Prime Editing (Insertion editing)
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112446_on_RNF22AAins34GAdel-PE \
example/MaChIAto_Aligner_output;

MaChIAtoAligner output

[Coming soon]

MaChIAto Analyzer

collect_MaChIAto_data.py

Parameter list

-i or --indir {str}: input directory which includes the outputs of MaChIAto Classifier

-o or --outdir {str}: output directory

-ul or --untreated_label {str}: untreated label (This must be one.)' (default: "machiato_dummy_sample") (optional)

-ol or --knock_out_label {str}: knock-out label' (default: []) (optional)

-il or --knock_in_label {str}: knock-in label' (default: []) (optional)

-sc or --scaffold_seq {str}: scaffold sequence of sgRNA (default: "gttttagagctagaaatagcaagttaaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgc") (optional)

-t or --target_type {str}: target sequence type {"lmh" | "rmh" | "bmh" | "elmh" | "ermh" | "ebmh" | "protospacer"} (default: "bmh") (optional)

--ignore_list {str}: The list of ignore target set contains target names which are not desired to analyze for some reasons. The data (e.g. DBF4B-A, DBF4B-B, DBF4B-C, DBF4B-D) including target name (e.g. DBF4B) shown in the list is skipped through the process of MaChIAto Analyzer. The format should be comma-separated like “TargetA, TargetB, …” “example_data” directory has “ignore_list.csv” as example. (default: "") (optional)

Template command

Case of Double knock-in analysis (*ADVANCE: The analysis includes the comparison between two knock-in methods.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-il (knock-in label 1) (knock-in label 2) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Single knock-in analysis (*STANDARD)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-il (knock-in label) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Simple knock-in analysis (*SIMPLE: The analysis can be applied when there is no knock-out sample used as control.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-il (knock-in label) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Double knock-out analysis (*ADVANCE: The analysis includes the comparison between two knock-out methods.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label 1) (knock-out label 2) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Single knock-out analysis (*STANDARD)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-t (calculation target) \
--ignore_list (list of samples ignored);

*If you do not enter "-ul (untreated label)", the process can work. However, some filtering process will be skipped.

Example of the command

Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Double_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO \
-il KI1 KI2 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Single knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Single_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO \
-il KI1 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Simple knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Simple_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il KI1 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Double knock-out analysis: MS2-tagged sgRNA/calculation target: protospacer
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Double_knockout_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO KI1 \
-t protospacer \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Single knock-out analysis: knock-out using MS2-tagged sgRNA/calculation target: protospacer
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Single_knockout_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ol KO \
-t protospacer \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/PrimeEditing_RT_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il PE \
-t ermh \
--ignore_list example/data/ignore_list_for_primeediting_analysis.csv;
Simple knock-in analysis: Prime Editing/calculation target: protospacer
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/PrimeEditing_protospacer_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il PE \
-t protospacer \
--ignore_list example/data/ignore_list_for_primeediting_analysis.csv;

MaChIAto Analyzer

Parameter list

Summary directory: The summary directory generated with “collect_MaChIAto_data.py”.

Output prefix: The directory into which the output directory is saved.

Name of extra data (optional): The name of the feature group that the next argument includes.

Table of extra data (optional): The pathname of extra data added by the user. The data should be a .csv file.

Template command

Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
(Summary directory) \
(Output prefix) \
(Name of extra data) \
(Table of extra data);

Example of the command

Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Double_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Single knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Single_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Simple knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Simple_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Double knock-out analysis: MS2-tagged sgRNA/calculation target: protospacer
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Double_knockout_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Single knock-out analysis: knock-out using MS2-tagged sgRNA/calculation target: protospacer
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Single_knockout_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/PrimeEditing_RT_analysis \
example/MaChIAto_Analyzer_output;
Simple knock-in analysis: Prime Editing/calculation target: protospacer
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/PrimeEditing_protospacer_analysis \
example/MaChIAto_Analyzer_output;

MaChIAtoAnalyzer output

[Coming soon]

MaChIAto Reviewer

Parameter list

Summary directory: The summary directory generated with “collect_MaChIAto_data.py”.

Output prefix: The directory into which the output directory is saved.

Template command

Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R
(Summary directory)
(Output prefix);

Example of the command

Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Double_knockin_analysis \
example/MaChIAto_Reviewer_output;
Single knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Single_knockin_analysis \
example/MaChIAto_Reviewer_output;
Simple knock-in analysis: PITCh using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Simple_knockin_analysis \
example/MaChIAto_Reviewer_output;
Double knock-out analysis: MS2-tagged sgRNA/calculation target: protospacer
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Double_knockout_analysis \
example/MaChIAto_Reviewer_output;
Single knock-out analysis: knock-out using MS2-tagged sgRNA/calculation target: protospacer
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Single_knockout_analysis \
example/MaChIAto_Reviewer_output;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/PrimeEditing_RT_analysis \
example/MaChIAto_Reviewer_output;
Simple knock-in analysis: Prime Editing/calculation target: protospacer
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/PrimeEditing_protospacer_analysis \
example/MaChIAto_Reviewer_output;

MaChIAtoReviewer output

[Coming soon]

CaseStudies

[Coming soon]

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages