Skip to content

pelingundogdu/mlfpm-esr9-d1.15

Repository files navigation

Project structure

├── environment.yml              <- The environment file
│
├── README.md                    <- The top-level README for developers using this project.
│
├── data
│   ├── external                 <- Data from third party sources.
│   ├── processed                <- The final, canonical data sets for modeling.
│   └── raw                      <- The original, immutable data dump.
│
├── models                       <- Trained models, model predictions, or model summaries
│   ├── NN                       <- training and testing models
│   └── CV                       <- cross-validation performance result
│
├── notebooks                    <- Jupyter notebooks. Naming convention is a number (for ordering),
│                                   the creator's initials, and a short `-` delimited description, 
│                                   e.g. `1.0-jqp-initial-data-exploration`
│
├── references                   <- Data dictionaries, manuals, and all other explanatory materials
│
├── reports                      <- Generated analysis as HTML, PDF, LaTeX, etc.
│   ├── activation               <- pathways activation results
│   ├── CV                       <- the results of cross-validation performance
│   ├── encoding                 <- the results of encoding infomation
│   ├── figures                  <- Generated graphics and figures to be used in reporting
│   └── retrieval                <- the results of retrieval performance
│
└── scripts                      <- The helper scripts

Project based on the cookiecutter data science project template. #cookiecutterdatascience

Codes in this repository covers the content from two articles, which are called Article#1 and Article#2.

Article Titles and Links

Article#1 -- Integrating pathway knowledge with deep neural networks to reduce the dimensionality in single-cell RNA-seq data ( 10.1186/s13040-021-00285-4 )

Article#2 -- Enhanced analysis and validation performances with multiple dataset to annotate cell type ( 10.5281/zenodo.5542325 )


Usage example,

  • 2-layer signaling pathway network definition with mouse - learning set
... $ python notebooks/4.0-pg-model.py
            -design pathways_2_layer
            -first_hidden_layer_pbk pbk_layer_mmu_sig.txt
            -first_hidden_layer_dense 0
            -second_hidden_layer True
            -optimizer SGD
            -activation tanh
            -ds processed/exper_mouse/mouse_learning_sw_gw.pck
            -analysis clustering
            -filter_gene_space False
            -hp_tuning False

About

Deliverable 1.15

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published