Skip to content

The codes of "the Third Generation Sequencing of 111 rice genomes reveals a huge size of novel genome sequence in the RICE PAN-genome"

Notifications You must be signed in to change notification settings

Minyoung-sss/TGSRICEPAN

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TGSRICEPAN

Introduction

Rice is one of the most important crops for human. The third-generation sequencing (TGS, also called long-read sequencing, LRS) helps us assemble more high-quality genomes and construct more complete pan-genomes. Here are codes of the article "Long-read sequencing of 111 rice genomes reveals significantly larger pan-genomes".

Codes

The main pipelines are as following and the main self-writen scripts are in "scripts" directory.

Reads2Genomes.sh

The pipeline of preprocessing, assembling, polishing and scaffolding from raw long/short reads to assemblies.

Reads2SV.sh

The pipeline of SV calling, filtering and merging from long reads to SVs.

Gapfill.sh

The pipeline of filling gaps in Nipponbare genomes with corrected reads and assembled contigs.

Genomes2Pan.sh

The pipeline of constructing pan-genomes from genomes.

About

The codes of "the Third Generation Sequencing of 111 rice genomes reveals a huge size of novel genome sequence in the RICE PAN-genome"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 68.6%
  • Shell 31.4%