Skip to content

Align sequencing reads or peptides to this modified and heavily curated UniRef database to get simultaneous quantitative community phylogeny and functions.

License

Notifications You must be signed in to change notification settings

TealFurnholm/Universal_Microbiomics_Alignment_Database

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UMRAD: Universal Multiomics Reference and Alignment Database

This script compiles the data from databases 1-3 below, plus functional information from UniProt and other public databases, to annotate the UniRef100 set of deduplicated proteins. Collectively these annotations enable the unification of various meta-omics studies: metabolomes, metaproteomes, metatranscriptomes, and metagenomes. This is #4 of a series of pipelines that comprise this Universal Reference for various 'omics data analysis.
1. Universal Taxonomy Database: found here
2. Universal Compounds Database: found here
3. Universal Reactions Database: found here
4. Universal Protein Alignment Database: this repository
5. Universal ncRNA Alignment Database: found here

Goal

Ideally there should be a Universal Reference Database that:

  • can be used on any NGS/MS data type
  • is comprised of unified compounds, proteins, reactions and pathway reference data
  • has as complete as possible functional annotations and cross-linked ids
  • has all kingdoms of life eukaryotes, bacteria, archaea and MoNA (mobile nucleic acids: viruses, phage, plamids, IS, constructs)
  • and thus relies on a well curated and standardized taxonomy database

UMRAD usage:

  1. With this data you can align any NGS reads or proteomics data using Diamond alignment software, and get simultaneous whole community phylogenetic and functional outputs.
  2. A centralized repo allows cross-study comparisons, since they would have the same compound, function, pathway, protein, and taxon IDs.
  3. By containing all sequenced organisms, it can be used on either environmental and/or host-associated data sets.

Get Started HERE

About

Align sequencing reads or peptides to this modified and heavily curated UniRef database to get simultaneous quantitative community phylogeny and functions.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages