Skip to content

shu251/pacificocean-metaT

Repository files navigation

pacificocean-metaT

Hosts website to code all code and analysis for Pacific Ocean-wide metatranscriptome survey study:

https://shu251.github.io/pacificocean-metaT/

Updates

June 2023

  • Converting to quarto document. Previously Rmd files will remain or be placed in ARCHIVE. Any exisiting code on HPC will be removed and replaced with individual .R scripts from here. Quarto document will show entire code as blocks and explanation

Sequences

Sequence read table, 03-abundance_tables/ReadTable_ByContig.csv, has raw counts for all sequences that hit contigs (via Salmon).

to do: sequence count table includes sequences that did not have any annotations. This % needs to be determined and these sequences should be placed somewhere else for now. Query the annotation table (below), for number of sequences with annotations.

Taxonomy and function information

This file, 02-annotation_table/TaxonomicAndFunctionalAnnotations.csv, includes the sequence IDs that have taxonomy and/or function.

Within the annotation table, there are duplicated ShortSeqIDs. Salmon count table is generated by running reads agains nucleotide sequences. Taxonomy is determined by EUKulele and uses proteins. Since sometimes there are multiple ORFs on the same contig (Transdecoder).

Determine transcript level gene counts

Using tximport and DESeq2 for proper library normalization.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages