Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GHA for doublet-detection module #462

Merged
merged 15 commits into from
May 28, 2024

Conversation

sjspielman
Copy link
Member

Purpose/implementation Section

Please link to the GitHub issue that this pull request addresses.

#461

Note that should be considered "stacked on" #454

What is the goal of this pull request?

This PR adds a GHA for the module, including a comment placeholder for we'll probably want to download data eventually, later in this module's development.

Is there anything that you want to discuss further?

It may be too early in this module's development to be filing this PR, so it's fine if it hangs out for a little bit since there shouldn't be any conflicts!.

Author checklists

Analysis module and review

Reproducibility checklist

  • Code in this pull request has been added to the GitHub Action workflow that runs this module.
  • The dependencies required to run the code in this pull request have been added to the analysis module Dockerfile.
  • If applicable, the dependencies required to run the code in this pull request have been added to the analysis module conda environment.yml file.
  • If applicable, R package dependencies required to run the code in this pull request have been added to the analysis module renv.lock file.

Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>
@allyhawkins
Copy link
Member

Just noting that I don't think this check actually ran because you didn't change any of the files in the module directory. To test, you probably just want to make a minor change and then revert before merging.

@sjspielman
Copy link
Member Author

To test, you probably just want to make a minor change and then revert before merging.

Yes, definitely! I probably should have marked this as a Draft PR, since it can't be tested until #454 goes in. I'll go ahead and draft this now.

@sjspielman sjspielman marked this pull request as draft May 24, 2024 16:58
@sjspielman sjspielman removed the request for review from allyhawkins May 24, 2024 17:33
@jashapiro
Copy link
Member

jashapiro commented May 24, 2024

While I support adding a Rproj file in 72c0048, this is also an argument for using rprojroot::find_root(rprojroot::is_renv_project) or similar for more explicit root finding than here::here()

@sjspielman
Copy link
Member Author

sjspielman commented May 24, 2024

this is also an argument for using rprojroot::find_root(rprojroot::is_renv_project) or similar for more explicit root

Indeed!!! Will wait for after this build. If renv doesn't get cached...oooof.

@sjspielman
Copy link
Member Author

@jashapiro do you have any insight into why the conda-lock environment doesn't seem to be fully activated/available here? https://github.com/AlexsLemonade/OpenScPCA-analysis/actions/runs/9270012159/job/25502032871#step:9:15143

Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>
@sjspielman sjspielman marked this pull request as ready for review May 28, 2024 15:41
@sjspielman
Copy link
Member Author

GHA passed! ✅

Copy link
Member

@allyhawkins allyhawkins left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@sjspielman
Copy link
Member Author

Wanted to get an opinion here before merging @allyhawkins @jashapiro - running this workflow takes quite a bit of time! I'm wondering if, for now, we want to remove the trigger to run on every change to this module, and instead just add to the workflow that runs all modules. Notably, I did not actually add to the run-all workflow in this PR, so I suppose I'm also seeking opinions on should I go ahead and do that!

@jashapiro
Copy link
Member

jashapiro commented May 28, 2024

Wanted to get an opinion here before merging @allyhawkins @jashapiro - running this workflow takes quite a bit of time! I'm wondering if, for now, we want to remove the trigger to run on every change to this module, and instead just add to the workflow that runs all modules. Notably, I did not actually add to the run-all workflow in this PR, so I suppose I'm also seeking opinions on should I go ahead and do that!

You are running the workflow on all your benchmarking datasets, in full. This is not something we want to have happen in general. The goal of these actions is to see that the code works with example data, so they should not be running with real data in GHA.

If you have a small dataset that you can download and run instead, that would be the preferred solution: you can set a ENV variable to select that option when running your wrapper script. At a minimum, you should run the script on only one dataset, but it would be better to find something completely different, if you can.

For now, I would probably completely disable this action. In general, I don't think benchmarking is going to be something we want running repeatedly in GHA, so it will not be a big problem. When you get to running on the ScPCA samples, you will use simulated data anyway.

@sjspielman
Copy link
Member Author

If you have a small dataset that you can download and run instead, that would be the preferred solution: you can set a ENV variable to select that option when running your wrapper script.

I had been wondering about this too. I think this is a separate PR, to not automatically download the full benchmark zip, but include an option to use smaller zip file with dummy data. I'd also have to change the dataset array here, too.

For now, I'll just turn it off. Thanks!

@sjspielman sjspielman merged commit ce1b734 into AlexsLemonade:main May 28, 2024
2 checks passed
@sjspielman sjspielman deleted the sjspielman/461-doublet-gha branch May 28, 2024 18:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants