Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement reference mapping components #138

Merged
merged 63 commits into from
Dec 28, 2022
Merged

Conversation

VladimirShitov
Copy link
Collaborator

  • Implement integrate/scarches component to map query data to the reference (currently only supports HLCA)
  • Implement integrate/scvi_tools component to build reference and map query data on it
  • Add scVI model support for integrate/scvi_tools
  • Add scanVI model support for integrate/scvi_tools
  • Add totalVI model support for integrate/scvi_tools
  • Add scripts for downloading test files for reference mapping

@VladimirShitov
Copy link
Collaborator Author

The repository is ready for review. I removed unnecessary details and fixed bugs. Local tests are running fine, but the test pipeline will fail due to the lack of test files. @rcannood , can you please run the script workflows/resources_test_scripts/hlca_reference_model.sh and put the resulting files on AWS so that test script could use them?

Copy link
Member

@DriesSchaumont DriesSchaumont left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR! Well structured 👍 I left some minor comments/questions.

src/integrate/scarches/config.vsh.yaml Outdated Show resolved Hide resolved
src/integrate/scarches/config.vsh.yaml Show resolved Hide resolved
src/integrate/scarches/script.py Show resolved Hide resolved

with tempfile.TemporaryDirectory() as directory:
model_path = _download_HLCA_reference_model(directory)
vae_query, adata_query = map_to_existing_reference(adata_query, model_path=model_path)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True. One way to do this is to make the reference type file and set the default to the url (https://zenodo.org/record/6337966/files/HLCA_reference_model.zip). However, this also means that if we want to pass another reference, this also needs to be a path/url. FYI: if we upload them to s3, s3:// urls are also supported. I think just using the zenodo URL is a better solution.

Correct me if im wrong @rcannood

@DriesSchaumont
Copy link
Member

LGTM! Very nice job @VladimirShitov, thanks!

@DriesSchaumont DriesSchaumont merged commit f83e6c7 into main Dec 28, 2022
@DriesSchaumont DriesSchaumont deleted the Implement_scArches branch January 19, 2023 08:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants