Unifying base data object #49

RemyLau · 2022-11-08T16:39:56Z

Currently, there are several different dataset objects specialized for each task and model (e.g., CellTypeDataset, ClusteringDataset), each of them takes a variety of specialized arguments that are not directly related to the underlying data, e.g., save path, processing scheme, choice of tissue. This complexity makes it quite hard to maintain the code base and implement new methods/datasets.

To improve this situation, we need to isolate raw dataset objects from transformation/processing methods.

Base data object
- Take AnnData as an input and save it as a private attribute (read-only?).
- Construct data loaders that load g, x, y, etc., to be passed to the model for training/evaluation.
Dataset object
- Download option
- Transformation option
- Dataset from paper (preprocessed) -> used to benchmark the reproducibility of the reimplemented model
Transformation
- Leverage functionalities from scanpy (recall that now the base data object store an AnnData object as a (private) attribute

To fix

Single modality

Spatial

examples/spatial/spatial_domain/stagate.py (update stagate example script to use dance data object #127)
examples/spatial/spatial_domain/louvain.py (update louvain example script to use dance dataobject #124)
examples/spatial/spatial_domain/stlearn.py (update stlearn example script to use dance data object #126)
examples/spatial/spatial_domain/spagcn.py (refactor spagcn graph construct; use dance data object in the example spagcn script #83)
examples/spatial/cell_type_deconvo/spotlight.py (update spotlight example script to use dance data object #107)
examples/spatial/cell_type_deconvo/dstg.py (update dstg example script to use dance data, refactor dstg graph construct #103)
examples/spatial/cell_type_deconvo/card.py (update card cell-type deconvolution example script using dance data object #93)
examples/spatial/cell_type_deconvo/spatialdecon.py (update spatialdeconv to use dance data object #94)

Multi modality

The text was updated successfully, but these errors were encountered:

RemyLau added the Priority-P0 Top priority label Dec 31, 2022

RemyLau pinned this issue Dec 31, 2022

RemyLau mentioned this issue Dec 31, 2022

Some questions about general wrapper for datasets #38

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unifying base data object #49

Unifying base data object #49

RemyLau commented Nov 8, 2022 •

edited

Loading

Unifying base data object #49

Unifying base data object #49

Comments

RemyLau commented Nov 8, 2022 • edited Loading

To fix

Single modality

Spatial

Multi modality

RemyLau commented Nov 8, 2022 •

edited

Loading