Project: Transcriptome Annotation of Equus caballus
_{^{User Manual}}
_{^{03-713: Bioinformatics Data Integration Practicum}}
_{^{Team-2: Taylor Ayers, Tao Luo, Lilin Huang, Sarah Oladejo}}

Prepare the directory

git clone https://github.com/luotao9728/annotation

This directory contains files:

start_pipeline.sh
build_index.sh
pipeline.sh
annotation.yml
README.md

Prepare the environment

Make sure Anaconda3 is installed on your computer
Make sure in your current working environment has the following packages:

sickle-trim (Trim illumina short reads)

LoRDEC (Fix long reads by short reads)

hisat2 (Short RNA-seq Alignment)

minimap2 (Long RNA-seq Alignment)

seqtk (Convert FASTA and FASTQ format)

SamTools (Sort and Convert sam to bam)

StringTie (Annotation)

Alternatively, you could directly create a new working conda environment using the following command (make sure you have annotation.yml file in your working directory):

conda env create -n annotation --file annotation.yml

conda activate annotation

Instruction for the pipeline

Requirements for input files

Reference genome: fasta/fna

illumina RNA-seq (forward/reverse): fastq

PacBio RNA-seq: fastq

Reference annotation: gff

Keyword: name of this annotation The input files should be in the annotation directory

Make sure you have the environment (with all packages) ready.
Download the input files into the cloned directory.
Execute the command and follow the prompt:

bash start_pipeline.sh

Follow the instructions to enter the file names.
Be patient. The annotation process may take a long time. Have a great day! :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Transcriptome Annotation of Equus caballus
_{^{User Manual}}
_{^{03-713: Bioinformatics Data Integration Practicum}}
_{^{Team-2: Taylor Ayers, Tao Luo, Lilin Huang, Sarah Oladejo}}

Prepare the directory

Prepare the environment

Instruction for the pipeline

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
README.md		README.md
annotation.yml		annotation.yml
build_index.sh		build_index.sh
pipeline.sh		pipeline.sh
start_pipeline.sh		start_pipeline.sh

Folders and files

Latest commit

History

Repository files navigation

Project: Transcriptome Annotation of Equus caballus User Manual 03-713: Bioinformatics Data Integration Practicum Team-2: Taylor Ayers, Tao Luo, Lilin Huang, Sarah Oladejo

Prepare the directory

Prepare the environment

Instruction for the pipeline

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Project: Transcriptome Annotation of Equus caballus
_{^{User Manual}}
_{^{03-713: Bioinformatics Data Integration Practicum}}
_{^{Team-2: Taylor Ayers, Tao Luo, Lilin Huang, Sarah Oladejo}}

Packages