Skip to content
AOE (All Of gene Expression): Index and meta-analysis of gene expression data that works
Perl Shell Common Workflow Language
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information. modified for new DBCLS SRA API Sep 4, 2019 Update Dec 15, 2017 version3 Aug 24, 2018 update for the new API Sep 5, 2019 GSE column added Apr 12, 2018 Bug fixed Apr 12, 2018 updated AOE tab generation script Jul 28, 2017 instrument_model commented Jul 19, 2018 Add missing scripts Jan 25, 2019 Add to generate aetab for AOE Apr 4, 2018 Add missing scripts Jan 25, 2019 Add missing scripts Jan 25, 2019 Add missing scripts Jan 25, 2019 AOE level3 added Apr 12, 2019 AOE level3 added Apr 12, 2019 update for the integration Sep 6, 2019 Add files via upload Dec 20, 2018 added Jul 19, 2018 Add files via upload Jul 17, 2018
gethoge-and-pigz.cwl Integration of two workflows Dec 13, 2018
perl-gethoge.cwl Integration of two workflows Dec 13, 2018
pigz.cwl Integration of two workflows Dec 13, 2018
xRX2instrument_model.txt.gz xRX(SRA experiment) -> instrument_model relation Jul 28, 2017

All of gene expression (AOE)

All of gene expression (AOE) has been an index for public transcriptome database.

The version 2 of AOE includes scripts to extract transcriptome sequencing records from Sequence Read Archive (SRA). Data extracted from SRA will be merged with current version of AOE(AOE1). API for SRA data by DBCLS SRA project is fully used to generate the data.

Currently transcriptome data from NCBI GEO is not included in EBI ArrayExpress (since 2017), and we are going to integrate the data from GEO into AOE utilizing DBCLS SRA API.

From ArrayExpress

  • Generate AOE tab file from ArrayExpress mirror at DDBJ.

Output Example:

ID ProjID AEID Description Date ArrayType ArrayGroup Technology Instrument NGSGroup Organisms Rep_organism
1 NA E-DORD-69 Translation profiling of Arabidopsis cell cultures exposed to elevated temperature and high salinity 2010-07-07 Agilent Arabidopsis 3 Oligo Microarray 4x44K 015059 G2519F (Gene ID version)(A-DORD-1)[24] Agilent array assay NA NA Arabidopsis thaliana[24] Arabidopsis thaliana


  • Extract GEO associated SRA/BioProject/BioSample data from DBCLS SRA API.
  • Make subset of AOE tab file from extracted json file(s).
  • Get metadata from D/E/SRX (Experiment).

From RNA-seq data not in GEO and ArrayExpress (DBCLS SRA API)

  • Extract Instrument_model from SRA Experiment data.
  • Fetch BioProject JSON by IDs via DBCLS SRA API.
You can’t perform that action at this time.