Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create robert-allaway-project #121

Merged

Conversation

allaway
Copy link
Contributor

@allaway allaway commented Jul 22, 2022

Adding a new project for personal testing and pilot research projects.

@allaway allaway requested a review from a team as a code owner July 22, 2022 19:59
@BrunoGrandePhD
Copy link
Contributor

BrunoGrandePhD commented Jul 25, 2022

@allaway: I hesitate to merge this because AWS imposes a hard limit of 50 AWS Batch compute environments (CEs) per account. This is something we discovered relatively recently. Since each Tower project creates 3 CEs and we already have 10 active Tower projects in nextflow-prod, we're getting close to our limit, so I want to ask some follow-up questions.

  • Would one of your existing Tower projects make sense for the testing you plan on carrying out?
  • If not, do you plan on processing PHI or controlled-access data? (Asking to see if you need a production-grade account)
  • How urgent is this request? (Asking to see if we can spin up a new AWS account, which will likely take a few weeks)

Copy link
Contributor

@BrunoGrandePhD BrunoGrandePhD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I forgot to post my previous comment as a review. Doing this now to silence GitHub reminders.

@allaway
Copy link
Contributor Author

allaway commented Jul 25, 2022

Hi Bruno,

Thanks for the info! I did not realize that otherwise I would have been more parsimonious with our accounts.

Would one of your existing Tower projects make sense for the testing you plan on carrying out?

Unfortunately, this is for an unfunded pilot project, so I don't want these costs to be billed against/recorded as being part of the other projects, which all have distinct & different program codes.

If not, do you plan on processing PHI or controlled-access data? (Asking to see if you need a production-grade account)

Yes, the data are from dbGaP and EGA :)

How urgent is this request? (Asking to see if we can spin up a new AWS account, which will likely take a few weeks)

It would be very helpful to be able to process the data sooner so that we can send the counts to some external collaborators who are working on it. If I am not able to use Tower, I would probably resort to setting up a beefy EC2 instance with service catalog and running everything there, but I think that is probably less ideal.

I'm curious, let's say I process everything within a week or two and then can deposit the reads on Synapse and "tear down" the project. Does this allow you to recover the CEs?

Cheers,
Robert

@BrunoGrandePhD
Copy link
Contributor

Yes, there is a way to decommission a Tower project (i.e. delete the CEs) while maintaining read-access to the workflow runs. Since this is intended as a short-lived effort, I'll deploy the project. You don't need to rush anything, but I might ping you if we run into our CE limit (which is unlikely in the coming months).

@BrunoGrandePhD BrunoGrandePhD merged commit 69546d6 into Sage-Bionetworks-Workflows:main Jul 26, 2022
@allaway
Copy link
Contributor Author

allaway commented Jul 26, 2022

That sounds good. Thank you Bruno!

@allaway allaway deleted the allaway/new-proj branch July 26, 2022 23:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants