Date: 17-Jul-2023
Author: Moeko Okada
List of commands used in the polyploidy summer school.
- Data download
- Quality control
- Homoeologous Mapping
- Read sorting by EAGLE-RC
- Read count
- read count
- make a count matrix
$ cd src
src $ wget -O genome.tar.gz https://drive.switch.ch/index.php/s/ZQzcTZ5lGJEcCbA/download
src $ ls
genome.tar.gz scripts
src $ tar xvzf genome.tar.gz
src $ ls
genome genome.tar.gz scripts
src $
src $ wget -O fastq.tar.gz https://drive.switch.ch/index.php/s/0ny11xweoA5WhEX/download
src $ ls
fastq.tar.gz genome genome.tar.gz scripts
src $ tar xvzf fastq.tar.gz
src $ ls
fastq fastq.tar.gz genome genome.tar.gz scripts
src $ ls fastq
MUR_1_R1_ds.fastq.gz MUR_3_R1_ds.fastq.gz MUR_R48h_2_R1_ds.fastq.gz trimmomatic_adapters.txt
MUR_2_R1_ds.fastq.gz MUR_R48h_1_R1_ds.fastq.gz MUR_R48h_3_R1_ds.fastq.gz
src $
Use a shell script to run all samples.
Make sure you are in the "src" directory.
src $ bash scripts/1_qc.sh
src $
Use a shell script to run all samples.
Make sure you are in the "src" directory.
src $ bash scripts/2_map.sh
src $
Use EAGLE-RC software to classify reads.
Use a shell script to run all samples.
Make sure you are in the "src" directory.
src $ bash scripts/3_eagle.sh
src $
Count the number of reads using featureCounts software.
Use a shell script to run all samples.
Make sure you are in the "src" directory.
src $ bash scripts/4_count.sh
src $
src $ cd 4_count
4_count $ (echo "gene_id";ls *_hal_counts.txt; ) | sed -e s/_hal_counts.txt//g | (awk '{if (NR > 1) printf "\t"} {printf "%s_hal", $0}' && echo) > hal_counts.tsv
4_count $ (echo "gene_id";ls *_lyr_counts.txt; ) | sed -e s/_lyr_counts.txt//g | (awk '{if (NR > 1) printf "\t"} {printf "%s_hal", $0}' && echo) > lyr_counts.tsv
4_count $ python /tmp/eagle/scripts/tablize.py -skip 1 -a -i 0 -c 6 *_hal_counts.txt >> hal_counts.tsv
4_count $ python /tmp/eagle/scripts/tablize.py -skip 1 -a -i 0 -c 6 *_lyr_counts.txt >> lyr_counts.tsv
4_count $