Earwig_genome_project

This repository contains all the scripts used to assemble and annotate the Earwig genome. The pipeline is presented in three parts:

Genome assembly
Denovo repeat library
Genome annotation

1. Genome assembly

Genome is assembled using linked reads from 10x chromium and long reads from Oxford nanopore. Long and linked reads were individually assembled and then merged together. After multiple iterations of scaffolding, gapclosing, and haplotigs and contaminants removal, assembly was polished with mRNA-seq reads to obtain final assembly. Schematic representation in figure below (Created with BioRender.com).

Workflow and Scripts:

2. Denovo repeat library

A comprehensive denovo repeat library is prepared for the assembled genome. It was used for repeat content analysis, repeat masking and as input for annotation pipeline.

Workflow:

Repeat library preparation
Concatenating, filtering, and classifying repeats
1. RepeatClassifier
Repeat masking the genome
1. RepeatMasker

3. Genome annotation

Maker2 pipeline is used for genome annotation. mRNA-seq data is denovo assembled using Trinity. Other relavant publicly available datasets were downloaded and used as input.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
Denovo_repeat_library		Denovo_repeat_library
Genome_annotation		Genome_annotation
Genome_assembly		Genome_assembly
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Earwig_genome_project

1. Genome assembly

2. Denovo repeat library

3. Genome annotation

About

Releases

Packages

Languages

License

upendrabhattarai/Earwig_genome_project

Folders and files

Latest commit

History

Repository files navigation

Earwig_genome_project

1. Genome assembly

2. Denovo repeat library

3. Genome annotation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages