All data is organized into three main directories:
-
code/
Contains all scripts/code for each tool and process. -
data/
Stores outcomes, statistics, and plots in formats like.html,.txt,.csv, and.pdf. -
images/
Exclusively for plots in.pngformat (GitHub wiki-friendly).- Note: R-generated plots are found only here, while other plots (for example
.pdf) are also available inData/.
- Note: R-generated plots are found only here, while other plots (for example
Paper: Chromosome-level genome assembly of Niphotrichum japonicum provides new insights into heat stress responses in mosses
Authors: Xuping Zhou, Tao Peng, Yuying Zeng, Yuqing Cai, Qin Zuo, Li Zhang, Shanshan Dong, Yang Liu
This paper presents a high-quality de novo chromosome-level genome assembly of the moss Niphotrichum japonicum to understand heat defense mechanisms induced by gene activity. Nanopore long reads combined with Hi-C scaffolding produced 14 pseudochromosomes, followed by comparative genomics. Both previously known and species-specific genes showed differential expression under 12 hours of heat stress exposure. These discoveries are crucial for understanding resilient plant responses to extreme conditions.
In this project, I will attempt to recreate the results by:
- Building chromosome 3 of Niphotrichum japonicum
- Performing differential expression analysis
-
Reads Pre-processing
- FastQC
- Trimmomatic
-
Genome Assembly
- FLYE
-
Assembly Quality Assessment
- BUSCO
- QUAST
-
DNA Mapping
- BWA
-
Genome Polishing
- Pilon
-
Polish Quality Assessment
- QUAST
-
Polished DNA Mapping
- BWA
-
Duplicate Marking
- Samtools
-
Hi-C Scaffolding (extra analysis)
- YAHS
- Juicer
-
Scaffold Quality Assessment
- QUAST
-
Repeat Masking
- RepeatMasker
-
RNA Mapping
- hisat2
-
Structural Annotation
- BRAKER2
-
Functional Annotation
- eggNOGmapper
-
Differential Expression Analysis
- featureCounts
- DESeq2
-
Chloroplast Assembly & Analysis (extra analysis)
- GetOrganelle
- QUAST