MaChIAto is a third party software of CRISPResso (https://github.com/pinellolab/CRISPResso2) to get a stricter classification of the targeted amplicon sequencing (also called deep sequencing) and profile the relationship with genomic context (e.g., Thermodynamics and mutation patterns) in the CRISPR-Cas9/Prime Editor targeting site.
MaChIAto would be helpful for people who want to
- get a more accurate classification of knock-out, homology-based knock-in, and Prime Editing. (Case I)
- quantify the rate of imprecise editing reads, including the reads with donor-reversed integration. (Case II)
- obtain the local alignment of the target site and knock-in junction. (Case III)
- analyze the sequence involved with InDels and imprecise editing. (Case IV)
- find the useful features for the efficiency prediction. (Case V)
If you have the same purpose with the above cases, you can check the Case Studies section!
We demonstrate a simple example. Before that, you have to build the environment according to the section of Preparation.
Click "Download ZIP".
Alternatively, enter the following command terminal.
git clone https://github.com/KazukiNakamae/MaChIAto.git;
cd (...)/MaChIAto
source activate MaChIAto_env; # *You do not need to enter it again once you did it.
The re-classification command of MaChIAto (Classifier)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
(other parameters);
conda deactivate;
The other parameters varies per experiment type {knock-out, homology-based knokc-in, Prime Editing}.
The example is for the knock-out analysis.
python MaChIAto/MaChIAto.py \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence);
*The case of the knock-out and knock-in is shown in the section of MaChIAto (MaChIAto Classifier).
You can get the classification result of the editing. The result is visualized as the pie chart based on the ALL_dataframe.txt.
We provide the detailed description how to read them in the section of the MaChIAtoClassifier output.
MaChIAto can profile and visualize the MaChIAto output using MaChIAto Aligner, MaChIAto Analyzer, and MaChIAto Reviewer.
source activate MaChIAto_Aligner_env; # *You do not need to enter it again once you did it.
The alignment command of MaChIAto Aligner
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
(directory of MaChIAto Classifier) \
(output directory);
conda deactivate;
If your donor insert has the extra sequence outside of the homology arm, you should enter it. Please refer to the section of MaChIAtoAligner.
You can get the local alignment of the editing. The result is visualized as the map.
We provide a detailed description of how to read them in the section of the MaChIAtoAligner output.
If you have multiple data (n>3) to profile the characteristics, you should aggregate it using the following command.
source activate MaChIAto_Analyzer_env; # *You do not need to enter it again once you did it.
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-ol (knock-out label) \
-t (calculation target) \
conda deactivate;
It is used for the knock-out analysis. The calculation target for MaChIAto Analyzer/Reviewer is protospacer. You can target the homology arm and RT template using another command. Please see the section of MaChIAtoAnalyzer.
You can see the correlation with the >70 (epi-)genomic context using MaChIAto Analyzer.
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
(directory of collect_MaChIAto_data.py) \
(output directory);
conda deactivate;
MaChIAtoAnalyzer basically use > 70 genomic context. If you want to use epigenomic context or other context, please see the section of MaChIAtoAnalyzer.
You can get the correlation between the editing efficacy and the context. The result is visualized as the scatter plot.
We provide a detailed description of how to read them in the section of the MaChIAtoAnalyzer output.
source activate MaChIAto_Reviewer_env; # *You do not need to enter it again once you did it.
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
(the prefix of MaChIAto Classifier output) \
(the prefix of MaChIAto Aligner output) \
(directory of collect_MaChIAto_data.py) \
(output directory);
conda deactivate;
You can get the mutation profile of the editing. The result is visualized as the bar plot and pie chart.
We provide a detailed description of how to read them in the section of the MaChIAtoReviewer output.
chsh -s /bin/bash;
(https://docs.conda.io/en/latest/miniconda.html)
(https://apps.apple.com/jp/app/xcode/id497799835?mt=12)
sudo xcodebuild -license;
(xquartz.macosforge.org)
conda create --name MaChIAto_env;
source activate MaChIAto_env;
conda install -c anaconda python=3.8;
pip install --upgrade pip;
pip install regex tqdm argparse biopython numpy matplotlib GPy gpyopt datetime pandas;
conda deactivate;
conda create --name MaChIAto_Aligner_env;
source activate MaChIAto_Aligner_env;
conda install -c bioconda python=3.8 bwa=0.7.17 samtools=1.9;
conda deactivate;
conda create --name MaChIAto_Analyzer_env;
source activate MaChIAto_Analyzer_env;
conda install -c anaconda python=3.8 wget;
conda install -c bioconda emboss;
conda install -c bioconda oligoarrayaux;
pip install --upgrade pip;
pip install numpy regex tqdm pandas;
conda deactivate;
conda create -n MaChIAto_Reviewer_env --clone MaChIAto_Aligner_env;
source activate MaChIAto_Reviewer_env;
conda deactivate;
-m or --mode {str}: This parameter allows for the specification of the type of analysis: “CRISPResso” and “CRISPResso2” is allowed in the latest version
-a or --amplicon_seq {str}: This parameter allows the user to enter the amplicon sequence used for the CRISPResso. The length should be >105bp due to the setting for the other parameter.
-g or --guide_seq {str}: This parameter allows for the specification of the sgRNA sequence used for the CRISPResso. The length of the sequence should be 20nt without PAM. The MaChIAto convention is to depict the expected cleavage position using the value of the parameter three nt 3' from the end of the guide.", required=True)
-cf or --crispreeso_file {str}: This parameter allows for the specification of the “Alleles_frequency_table.txt” from CRISPResso. When this parameter is used, “CRISPResso” should be entered as -m parameter.', (default: "./Alleles_frequency_table.txt") (optional)
-ccf or --crispreeso2_file {str}: This parameter allows for the specification of the “Alleles_frequency_table.zip” from CRISPResso2. When this parameter is used, “CRISPResso2” should be entered as -m parameter.', (default: "./Alleles_frequency_table.zip") (optional)
-d or --donor_seq {str}:This parameter allows for the specification of the expected HDR amplicon used for the CRISPResso. The length of sequence should be >12bp in knock-out/knock-in analysis. In knock-out analysis, this parameter should not be entered, and then the value will be given fake parameter (“TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT”), and some internal settings are changed for knock-out analysis. However, the fake parameter will be poly-C|G|A if amplicon sequence contains poly-T sequence.', (default: "") (optional)
-e or --expected_ki_amplicon_seq {str}: This parameter allows for the specification of the expected knock-in amplicon sequence used for the CRISPResso after HDR. The length of the sequence should be >12bp. In knock-out analysis, this parameter should not be entered, and then the value will be given fake parameter including fake donor sequence, and some internal settings are changed for knock-out analysis.', (default: "") (optional)
-o or --output_folder {str}: This parameter allows for the specification of the output directory to use for the analysis (default: current directory) (default: "./") (optional)
-lh or --length_left_homologyarm {int}: This parameter allows for the specification of the length of 5’ homology arm (default: 20). The length of the sequence should be >17bp, and the flanking sequence with the homology arm needs the length of >24bp in the expected amplicon sequence. In the prime editing analysis, the 5’ homology arm is considered as the RT template (default: 20) (optional)
-rh or --length_right_homologyarm {int}: This parameter allows for the specification of the length of 3’ homology arm (default: 20). The length of the sequence should be >17bp, and the flanking sequence with the homology arm needs the length of >24bp in the expected amplicon sequence. In the prime editing analysis, the 3’ homology arm is considered as the prime binding site (default: 20) (optional)
-cn or --location_comp_nick {int}: This parameter allows for the specification of the complementary strand nick location [3prime direction is +] (default: 90). This parameter is used in the prime editing and should be over the homology arm's length to which the nickase is adjacent' (default: 90) (optional)
-n or --name : This parameter allows for the specification of the name, which will be included output directory (default: “untitled-X”). If MaChIAto Analyzer and MaChIAto Reviewer will be used in the following analysis, the value should be “{target_name}-{sample label }” (e.g., DBF4B-C) and underbar "_" should not be used', (default: "untitled-X") (optional)
--primeediting_analysis : Re-classify the data as prime editing analysis. This option forces the setting to change for prime editing analysis.' (optional)
--force_knockout_analysis : Usually, MaChIAto re-classify the data as knock-in analysis. This option forces the setting to change for knock-out analysis. Under this mode, the length of indicator on the knock-in donor is the maximum value, and the threshold value for alignment of the knock-in sequence is 1.0. If this parameter is not entered, MaChIAto can automatically set up this mode by finding some characteristics of the knock-out sample. For example, MaChIAto checks that there is no donor sequence or expected knock-in sequence as input, and there are less three kinds HDR variants among input data. (optional)
--skip_optimization : Usually, MaChIAto runs Bayesian optimization for finding the optimized setting. This option allows MaChIAto to skip the process of optimization. The option is made for debugging. So, the option should not be used in the usual analysis. However, if the optimization process disturbs an accurate analysis, this option might be useful. (optional)
--copy_optimization : Usually, MaChIAto runs Bayesian optimization for finding the optimized setting. This option allows MaChIAto to use the provided optimization data instead of the optimization. If you analyze NGS data with the setting of previously analyzed data, this option is useful. Especially in substitution editing, we can recommend applying the option to the negative/positive control sample. However, you should understand that a classification error might frequently occur. (optional)
--provided_optimization_file : This parameter allows for the specification of the “MaChIAto_optimized_param.csv” from the analyzed MaChIAto folder. When this parameter is used, --copy_optimization parameter is required.', (default: "./MaChIAto_optimized_param.csv") (optional)
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-n (sample name)-(label name);
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-d (donor insert sequence) \
-lh (length of 5\' homology arm) \
-rh (length of 3\' homology arm) \
-n (sample name)-(label name);
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-lh (length of prime binding site) \
-rh (length of RT template) \
-cn (distance between two nick sites) \
--primeediting_analysis \
-n (sample name)-(label name);
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf (Alleles_frequency_table.zip of the CRISPResso2 output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-e (expected editing amplicon seqeunce) \
-d (donor insert sequence) \
-lh (length of prime binding site) \
-rh (length of RT template) \
-cn (distance between two nick sites) \
--primeediting_analysis \
-n (sample name)-(label name);
If you want to use the result of CRISPResso version1, the -ccf should be replased into -cf. Example for CRISPResso version1
python MaChIAto/MaChIAto.py \
-m CRISPResso \
-cf (Alleles_frequency_table.txt of the CRISPResso output) \
-o (output directory) \
-a (wt amplicon sequence) \
-g (protospaser sequence) \
-n (sample name)-(label name);
*The sample name and label name can be arbitrary. If you analyze the multiple experiment (e.g. knock-out and knock-in), the name should be different from others.
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702171907_AAVS1-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-n AAVS1-KO;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174222_DBF4B-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-n DBF4B-KO;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185531_CD40LG-B/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-n CD40LG-KO;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702172008_AAVS1-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-e CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCAGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGCTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n AAVS1-KI1;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174343_DBF4B-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-e TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n DBF4B-KI1;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185650_CD40LG-C/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-e TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n CD40LG-KI1;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702171854_AAVS1-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-n AAVS1-NC;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174209_DBF4B-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-n DBF4B-NC;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185510_CD40LG-A/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-n CD40LG-NC;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702172115_AAVS1-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCACTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-g ACCCCACAGTGGGGCCACTA \
-e CAGGGGTCCGAGAGCTCAGCTAGTCTTCTTCCTCCAACCCGGGCCCCTATGTCCACTTCAGGACAGCATGTTTGCTGCCTCCAGGGATCCTGTGTCCCCGAGCTGGGACCACCTTATATTCCCAGGGCCGGTTAATGTGGCTCTGGTTCTGGGTACTTTTATCTGTCCCCTCCACCCCACAGTGGGGCCAGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGCTAGGGACAGGATTGGTGACAGAAAAGCCCCATCCTTAGGCCTCCTCCTTCCTAGTCTCCTGATATTGGGTCTAACCCCCACCTCCTGTTAGGCAGATTCCTTATCTGGTGACACACCCCCATTTCCTGGAGCCATCTCTCT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n AAVS1-KI2;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702174513_DBF4B-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-g TTCACGGAAACCCGTTGACT \
-e TTTCTGAGCAAAGAAGTAAGTTACATCGTGTCCAGCCGCAGAGAAGTAAAGGCAGAGAGCAGTGGGAAAAGCCATAGAGGCTGCCCTAGCCCTAGCCCCAGTGAGGTCAGAGTGGAAACATCGGCCATGGTTGATCCAAAAGGCAGCCACCCCAGGCCTTCACGGAAACCCGTTGGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGACTCGGTAAGAACCTCATGTAGGAAAGGTATGCTGTGTAAACAAAGGAAGTAGGCTTTGGTGAGCCTTTTAAGGCAGTCTCTGGTTTTAATTTCATCTGGATTAATTTAATCTCTCATTTGAATTCAGGGACTGAGAATATTTTATCATAGCTCTG \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n DBF4B-KI2;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_20190702185815_CD40LG-D/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-g CTTATCCTTACTGAACTGTG \
-e TGTAGAGGCAGATATCATTAGCTGTATTCTCCTTCCGAATGACATTTATCATATCCTTGTTATTCCAAAATAGATAGAAGATGAAAGGAATCTTCATGAAGATTTTGTATTCATGAAAACGATACAGAGATGCAACACAGGAGAAAGATCCTTATCCTTACTGAACTGACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAGGTGAGGAGATTAAAAGCCAGTTTGAAGGCTTTGTGAAGGTAAGCAGCTTAATTACTGGTAAAAGTGTCATTGAAATATTTTACTACATTTGCTAGATCGGGAAACTGACAATGCCAATGTTTAAAGATTGGTTATAGACACAGACACACAGACACACACACACATAT \
-d GACTATAAGGACCACGACGGAGACTACAAGGATCATGATATTGATTACAAAGACGATGACGATAAG \
-n CD40LG-KI2;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287821/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a acttggtgccaaattcttctcccctgggaagcatccctggacacttcccaaaggaccccagtcactccagcctgttggctgccgctcactttgatgtctgcaggccagatgagggctccagatggcacattgtcagagggacacactgtggcccctgtgcccagccctgggctctctgtacatgaagcaactccagtcccaaatatgtagctgtttgggaggtcagaaatagggggtccaggagcaaactccccccaccccctttccaaagcccattccctctttag \
-g gatgtctgcaggccagatga \
-e acttggtgccaaattcttctcccctgggaagcatccctggacacttcccaaaggaccccagtcactccagcctgttggctgccgctcactttgatgtctgcaggccagaAgagggctccagatggcacattgtcagagggacacactgtggcccctgtgcccagccctgggctctctgtacatgaagcaactccagtcccaaatatgtagctgtttgggaggtcagaaatagggggtccaggagcaaactccccccaccccctttccaaagcccattccctctttag \
-lh 13 \
-rh 22 \
-cn 57 \
-n VEGFA_1TtoA_rep1-PE \
--primeediting_analysis;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287989/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacctgaggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-g gtcatcttagtcattacctg \
-e ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacGagTtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-lh 15 \
-rh 12 \
-cn 41 \
-n RNF2_12CTdel_6GtoT-PE \
--primeediting_analysis;
python MaChIAto/MaChIAto.py \
-m CRISPResso2 \
-ccf example/data/CRISPResso_on_SRR10287978/Alleles_frequency_table.zip \
-o example/MaChIAto_Classifier_output \
-a ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattacctgaggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-g gtcatcttagtcattacctg \
-e ACACTCTTTCCCTACACGACGCTCTTCCGATCtttacacgtctcatatgccccttggcagtcatcttagtcattaccAAtggtgttcgttgtaactcatataaactgagttcccatgttttgcttaatggttgagttccgtttgtctgcacagcctgagacattgctggaaataaagaagagagaaaaacaattttagtatttggaagggaagtgctatggtctgaatgtatgtgtcccaccaaaattcctacgt \
-d cAA \
-lh 15 \
-rh 14 \
-cn 41 \
-n RNF2_2AAins_34GAdel-PE \
--primeediting_analysis;
The classification result is visualized in the "MaChIAto_alignment_pie_chart.png" "MaChIAto_alignment_pie_chart.png" has the following labels:
- Unmodified: The unedited amplicon sequence
- Perfect Editing: The edited sequence that a user intended to generate (e.g., precise knock-in sequence, expected substitution/deletion/insertion...)
- Imperfect Editing: The sequence that is simular to "Perfect Editing" sequence. However, the junction sequence is imperfect. *The class is specilized for the homology-based knock-in. In knock-out and Prime Editing, the "Imperfect Editing" reads can be condidered as just InDels. InDels: The sequence that has a trace of editing (e.g., deletion/insertion/frequent substitutions) around the target site.
- Deletion: The InDel sequence whose length is less than that of the unedited amplicon sequence
- Insertion: The InDel sequence whose length is more than that of the unedited amplicon sequence
- Frequent substitutions: The InDel sequence whose length is equal to that of the unedited amplicon sequence UC: Unclear sequence that has the insufficient information to be recognized as the focusing sequence.
- UC (Ambiguous Editing): There is the expected sequence (e.g., knock-in cassette and Reverse transcript). However, the information of the end of amplicon is unclear. For example, The edited reads, which do not have the site of complementary strand nicking, are classified into this class.
- UC (Others): There is not enough sequence informaton for the classification. The unedited amplicon read with sequence errors and read amplified from the other locus can be classified into the this class.
Input directory: The directory that MaChIAto Classifier generated.
Output prefix: The directory into which the output directory is saved.
Left extra sequence (optional): The outside sequence flanking 5'-homology arm of the donor.
Right extra sequence (optional): The outside sequence flanking 3'-homology arm of the donor.
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
(Input directory) \
(Output prefix) \
(Left extra sequence) \
(Right extra sequence);
*when you use external storage for save output, the process can be too slow. We recommend that you use the tool in the local storage.
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112127_on_AAVS1-KO \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215418_on_DBF4B-KO \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215432_on_CD40LG-KO \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112140_on_DBF4B-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723225210_on_AAVS1-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215508_on_CD40LG-KI1 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215528_on_AAVS1-NC \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215540_on_DBF4B-NC \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723225231_on_AAVS1-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215620_on_DBF4B-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200723215633_on_CD40LG-KI2 \
example/MaChIAto_Aligner_output \
GTTTGG \
CCAAAC;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112157_on_VEGFA1TtoArep1-PE \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112511_on_RNF212CTdel6GtoT-PE \
example/MaChIAto_Aligner_output;
Rscript MaChIAto_Aligner/MaChIAtoAligner.R \
example/MaChIAto_Classifier_output/MaChIAto_from_CRISPResso2_at_20200722112446_on_RNF22AAins34GAdel-PE \
example/MaChIAto_Aligner_output;
[Coming soon]
-i or --indir {str}: input directory which includes the outputs of MaChIAto Classifier
-o or --outdir {str}: output directory
-ul or --untreated_label {str}: untreated label (This must be one.)' (default: "machiato_dummy_sample") (optional)
-ol or --knock_out_label {str}: knock-out label' (default: []) (optional)
-il or --knock_in_label {str}: knock-in label' (default: []) (optional)
-sc or --scaffold_seq {str}: scaffold sequence of sgRNA (default: "gttttagagctagaaatagcaagttaaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgc") (optional)
-t or --target_type {str}: target sequence type {"lmh" | "rmh" | "bmh" | "elmh" | "ermh" | "ebmh" | "protospacer"} (default: "bmh") (optional)
--ignore_list {str}: The list of ignore target set contains target names which are not desired to analyze for some reasons. The data (e.g. DBF4B-A, DBF4B-B, DBF4B-C, DBF4B-D) including target name (e.g. DBF4B) shown in the list is skipped through the process of MaChIAto Analyzer. The format should be comma-separated like “TargetA, TargetB, …” “example_data” directory has “ignore_list.csv” as example. (default: "") (optional)
Case of Double knock-in analysis (*ADVANCE: The analysis includes the comparison between two knock-in methods.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-il (knock-in label 1) (knock-in label 2) \
-t (calculation target) \
--ignore_list (list of samples ignored);
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-il (knock-in label) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Simple knock-in analysis (*SIMPLE: The analysis can be applied when there is no knock-out sample used as control.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-il (knock-in label) \
-t (calculation target) \
--ignore_list (list of samples ignored);
Case of Double knock-out analysis (*ADVANCE: The analysis includes the comparison between two knock-out methods.)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label 1) (knock-out label 2) \
-t (calculation target) \
--ignore_list (list of samples ignored);
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i (the prefix of MaChIAto Classifier output) \
-o (output directory) \
-sc (scaffold sequence of sgRNA) \
-ul (untreated label) \
-ol (knock-out label) \
-t (calculation target) \
--ignore_list (list of samples ignored);
*If you do not enter "-ul (untreated label)", the process can work. However, some filtering process will be skipped.
Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Double_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO \
-il KI1 KI2 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Single_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO \
-il KI1 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Simple_knockin_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il KI1 \
-t bmh \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Double_knockout_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ul NC \
-ol KO KI1 \
-t protospacer \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/Single_knockout_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-ol KO \
-t protospacer \
--ignore_list example/data/ignore_list_for_KOKI_analysis.csv;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/PrimeEditing_RT_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il PE \
-t ermh \
--ignore_list example/data/ignore_list_for_primeediting_analysis.csv;
python MaChIAto_Analyzer/collect_MaChIAto_data.py \
-i example/MaChIAto_Classifier_output \
-o example/collections/PrimeEditing_protospacer_analysis \
-sc gttttagagctaggccaacatgaggatcacccatgtctgcagggcctagcaagttaaaataaggctagtccgttatcaacttggccaacatgaggatcacccatgtctgcagggccaagtggcaccgagtcggtgc \
-il PE \
-t protospacer \
--ignore_list example/data/ignore_list_for_primeediting_analysis.csv;
Summary directory: The summary directory generated with “collect_MaChIAto_data.py”.
Output prefix: The directory into which the output directory is saved.
Name of extra data (optional): The name of the feature group that the next argument includes.
Table of extra data (optional): The pathname of extra data added by the user. The data should be a .csv file.
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
(Summary directory) \
(Output prefix) \
(Name of extra data) \
(Table of extra data);
Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Double_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Single_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Simple_knockin_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Double_knockout_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/Single_knockout_analysis \
example/MaChIAto_Analyzer_output \
InDelphi \
example/data/bmh_extra_data_InDelphi.csv \
Accessibility \
example/data/bmh_extra_data_accessibility.csv \
Genome_Property \
example/data/bmh_extra_data_genomeprop.csv;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/PrimeEditing_RT_analysis \
example/MaChIAto_Analyzer_output;
Rscript MaChIAto_Analyzer/MaChIAtoAnalyzer.R \
example/collections/PrimeEditing_protospacer_analysis \
example/MaChIAto_Analyzer_output;
[Coming soon]
Summary directory: The summary directory generated with “collect_MaChIAto_data.py”.
Output prefix: The directory into which the output directory is saved.
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R
(Summary directory)
(Output prefix);
Double knock-in analysis: PITCh and LoAD knock-in using MS2-tagged sgRNA/calculation target: both homology arm
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Double_knockin_analysis \
example/MaChIAto_Reviewer_output;
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Single_knockin_analysis \
example/MaChIAto_Reviewer_output;
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Simple_knockin_analysis \
example/MaChIAto_Reviewer_output;
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Double_knockout_analysis \
example/MaChIAto_Reviewer_output;
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/Single_knockout_analysis \
example/MaChIAto_Reviewer_output;
Simple knock-in analysis: Prime Editing/calculation target: RT template (= edited right homology arm)
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/PrimeEditing_RT_analysis \
example/MaChIAto_Reviewer_output;
Rscript MaChIAto_Reviewer/MaChIAtoReviewer.R \
example/MaChIAto_Classifier_output \
example/MaChIAto_Aligner_output \
example/collections/PrimeEditing_protospacer_analysis \
example/MaChIAto_Reviewer_output;
[Coming soon]
[Coming soon]