-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GHA for doublet-detection module #462
GHA for doublet-detection module #462
Conversation
Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>
Just noting that I don't think this check actually ran because you didn't change any of the files in the module directory. To test, you probably just want to make a minor change and then revert before merging. |
Yes, definitely! I probably should have marked this as a Draft PR, since it can't be tested until #454 goes in. I'll go ahead and draft this now. |
While I support adding a Rproj file in 72c0048, this is also an argument for using |
Indeed!!! Will wait for after this build. If |
@jashapiro do you have any insight into why the conda-lock environment doesn't seem to be fully activated/available here? https://github.com/AlexsLemonade/OpenScPCA-analysis/actions/runs/9270012159/job/25502032871#step:9:15143 |
Co-authored-by: Joshua Shapiro <josh.shapiro@ccdatalab.org>
GHA passed! ✅ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
Wanted to get an opinion here before merging @allyhawkins @jashapiro - running this workflow takes quite a bit of time! I'm wondering if, for now, we want to remove the trigger to run on every change to this module, and instead just add to the workflow that runs all modules. Notably, I did not actually add to the run-all workflow in this PR, so I suppose I'm also seeking opinions on should I go ahead and do that! |
You are running the workflow on all your benchmarking datasets, in full. This is not something we want to have happen in general. The goal of these actions is to see that the code works with example data, so they should not be running with real data in GHA. If you have a small dataset that you can download and run instead, that would be the preferred solution: you can set a ENV variable to select that option when running your wrapper script. At a minimum, you should run the script on only one dataset, but it would be better to find something completely different, if you can. For now, I would probably completely disable this action. In general, I don't think benchmarking is going to be something we want running repeatedly in GHA, so it will not be a big problem. When you get to running on the ScPCA samples, you will use simulated data anyway. |
I had been wondering about this too. I think this is a separate PR, to not automatically download the full benchmark zip, but include an option to use smaller For now, I'll just turn it off. Thanks! |
Purpose/implementation Section
Please link to the GitHub issue that this pull request addresses.
#461
Note that should be considered "stacked on" #454
What is the goal of this pull request?
This PR adds a GHA for the module, including a comment placeholder for we'll probably want to download data eventually, later in this module's development.
Is there anything that you want to discuss further?
It may be too early in this module's development to be filing this PR, so it's fine if it hangs out for a little bit since there shouldn't be any conflicts!.
Author checklists
Analysis module and review
README.md
has been updated to reflect code changes in this pull request.Reproducibility checklist
Dockerfile
.environment.yml
file.renv.lock
file.