Add support for selecting minimum CPU platform on GCP #1633

hnawar · 2020-06-16T21:00:17Z

This PR adds support to specify the minimum CPU platform used by a run when using the Google Cloud Life Sciences executor

pditommaso · 2020-06-17T13:17:00Z

Thanks for submitting this PR. I was thinking this would make more sense at process (job) level instead of pipeline wide. What do you think?

hnawar · 2020-06-17T13:25:47Z

I thought about that. There is currently no additional cost for specifying Skylake, Cascade lake is only available in the N2 instances so they would need to specify a different machineType all together.

The setting specifies the minimum CPU Platform, and when not specified you are still likely to end up on Skylake.

It was mainly easier for me to do it on the profile level rather than on process level, and the benefit of adding it to process level was not worth it.

I'll check with Andrew to get his opinion

hnawar · 2020-06-18T13:18:32Z

I had a quick sanity check with our team, and I think pipeline level config should be sufficient.
Let me know if you are happy with that

pditommaso · 2020-06-18T13:57:40Z

Ok, that's reasonable. If so it's needed to add the reading of that value from the nextflow config file.

See for example here:

nextflow/modules/nf-google/src/main/nextflow/cloud/google/lifesciences/GoogleLifeSciencesConfig.groovy

Lines 109 to 117 in bc9db8d

    
           final boolean disableBinDir = config.navigate('google.lifeSciences.disableRemoteBinDir',false) 
        
           final preemptible = config.navigate("google.lifeSciences.preemptible", false) as boolean 
        
           final bootDiskSize = config.navigate('google.lifeSciences.bootDiskSize') as MemoryUnit 
        
           final sshDaemon = config.navigate('google.lifeSciences.sshDaemon', false) as boolean 
        
           final sshImage = config.navigate('google.lifeSciences.sshImage', DEFAULT_SSH_IMAGE) as String 
        
           final copyImage = config.navigate('google.lifeSciences.copyImage', DEFAULT_COPY_IMAGE) as String 
        
           final debugMode = config.navigate('google.lifeSciences.debug', System.getenv('NXF_DEBUG')) 
        
           final privateAddr  = config.navigate('google.lifeSciences.usePrivateAddress') as boolean 
        
           final requesterPays = config.navigate('google.enableRequesterPaysBuckets') as boolean

Also, uni tests should be added to corresponding changes. It would be nice also to add an entry in the docs for google LS

nextflow/modules/nf-google/src/main/nextflow/cloud/google/lifesciences/GoogleLifeSciencesConfig.groovy

Lines 109 to 117 in bc9db8d

    
           final boolean disableBinDir = config.navigate('google.lifeSciences.disableRemoteBinDir',false) 
        
           final preemptible = config.navigate("google.lifeSciences.preemptible", false) as boolean 
        
           final bootDiskSize = config.navigate('google.lifeSciences.bootDiskSize') as MemoryUnit 
        
           final sshDaemon = config.navigate('google.lifeSciences.sshDaemon', false) as boolean 
        
           final sshImage = config.navigate('google.lifeSciences.sshImage', DEFAULT_SSH_IMAGE) as String 
        
           final copyImage = config.navigate('google.lifeSciences.copyImage', DEFAULT_COPY_IMAGE) as String 
        
           final debugMode = config.navigate('google.lifeSciences.debug', System.getenv('NXF_DEBUG')) 
        
           final privateAddr  = config.navigate('google.lifeSciences.usePrivateAddress') as boolean 
        
           final requesterPays = config.navigate('google.enableRequesterPaysBuckets') as boolean

Tx!

hnawar · 2020-06-18T21:49:17Z

Thanks,
I added the cpuPlatform to the ConfigGroovy.
I ran some small test after compiling from scratch and verified it is passed properly to the API.
This is how it looks like in my profile to specify Skylake.
google.lifeSciences.cpuPlatform = 'Intel Skylake'

I've added it to the documentation as an example as it will be the most common option for most users.

I'll look at how to add unit test as well, but can sure use some help on that.

docs/google.rst

pditommaso · 2020-06-19T16:56:25Z

Thanks for this changes. For the tests,it could be enough t o verify the cpuPlatform is taken correctly for the config, for example look here

nextflow/modules/nf-google/src/test/nextflow/cloud/google/lifesciences/GoogleLifeSciencesConfigTest.groovy

Lines 231 to 242 in bc9db8d

    
           def 'should set requester pays' () { 
        
               when: 
        
               def config = GoogleLifeSciencesConfig.fromSession0([google:[project:'foo', region:'x', lifeSciences: [:]]]) 
        
               then: 
        
               config.enableRequesterPaysBuckets == false 
        
               when: 
        
               config = GoogleLifeSciencesConfig.fromSession0([google:[project:'foo', region:'x', enableRequesterPaysBuckets:true]]) 
        
               then: 
        
               config.enableRequesterPaysBuckets == true 
        
           }

and then it's included in the request, look for example here

nextflow/modules/nf-google/src/test/nextflow/cloud/google/lifesciences/GoogleLifeSciencesTaskHandlerTest.groovy

Lines 162 to 182 in a481b42

    
           when: 
        
           def req = handler.createPipelineRequest() 
        
           then: 
        
           task.getConfig() >> new TaskConfig(machineType: 'n1-1234') 
        
           and: 
        
           req.machineType == 'n1-1234' 
        
           req.project == 'my-project' 
        
           req.zone == ['my-zone'] 
        
           req.region == ['my-region'] 
        
           req.diskName == GoogleLifeSciencesTaskHandler.DEFAULT_DISK_NAME 
        
           req.diskSizeGb == null 
        
           !req.preemptible 
        
           req.taskName == "nf-bad893071e9130b866d43a4fcabb95b6" 
        
           req.containerImage == 'my/image' 
        
           req.workDir.toUriString() == 'gs://my-bucket/work/dir' 
        
           req.sharedMount.getPath() == '/work/dir' 
        
           req.sharedMount.getDisk() == GoogleLifeSciencesTaskHandler.DEFAULT_DISK_NAME 
        
           !req.sharedMount.getReadOnly() 
        
           req.bootDiskSizeGb == null 
        
           req.entryPoint == GoogleLifeSciencesConfig.DEFAULT_ENTRY_POINT 
        
           !req.usePrivateAddress

and

nextflow/modules/nf-google/src/test/nextflow/cloud/google/lifesciences/GoogleLifeSciencesHelperTest.groovy

Lines 133 to 206 in a481b42

    
           def 'should configure resources correctly'() { 
        
               given: 
        
               def SCOPES = ["https://www.googleapis.com/auth/cloud-platform"] 
        
               def type = "testType" 
        
               def zone = ["testZone1","testZone2"] 
        
               def region = ["testRegion1","testRegion2"] 
        
               def diskName = "testDisk" 
        
               def preEmptible = true 
        
               def acc = new AcceleratorResource(request: 4, type: 'nvidia-tesla-k80') 
        
               def helper = new GoogleLifeSciencesHelper() 
        
               when: 
        
               def resources1 = helper.createResources(new GoogleLifeSciencesSubmitRequest( 
        
                       machineType:type, 
        
                       zone:zone, 
        
                       diskName: diskName, 
        
                       diskSizeGb: 100, 
        
                       preemptible: true)) 
        
               then: 
        
               with(resources1) { 
        
                   getVirtualMachine().getMachineType() == type 
        
                   getZones() == zone 
        
                   getRegions() == null 
        
                   getVirtualMachine().getDisks().get(0).getName() == diskName 
        
                   getVirtualMachine().getDisks().get(0).getSizeGb() == 100 
        
                   getVirtualMachine().getServiceAccount().getScopes() == SCOPES 
        
                   getVirtualMachine().getPreemptible() == preEmptible 
        
                   !getVirtualMachine().getAccelerators() 
        
                   !getVirtualMachine().getNetwork()?.getUsePrivateAddress() 
        
               } 
        
               when: 
        
               def resources2 = helper.createResources(new GoogleLifeSciencesSubmitRequest( 
        
                       machineType:type, 
        
                       region:region, 
        
                       diskName:diskName, 
        
                       diskSizeGb: 200, 
        
                       preemptible: true)) 
        
               then: 
        
               with(resources2) { 
        
                   getVirtualMachine().getMachineType() == type 
        
                   getZones() == null 
        
                   getRegions() == region 
        
                   getVirtualMachine().getDisks().get(0).getName() == diskName 
        
                   getVirtualMachine().getDisks().get(0).getSizeGb() == 200 
        
                   getVirtualMachine().getServiceAccount().getScopes() == SCOPES 
        
                   getVirtualMachine().getPreemptible() == preEmptible 
        
                   !getVirtualMachine().getAccelerators() 
        
                   !getVirtualMachine().getNetwork()?.getUsePrivateAddress() 
        
               } 
        
               when: 
        
               def resources3 = helper.createResources( new GoogleLifeSciencesSubmitRequest( 
        
                           machineType:type, 
        
                           zone:zone, 
        
                           diskName:diskName, 
        
                           preemptible: false, 
        
                           accelerator: acc, 
        
                           bootDiskSizeGb: 75, 
        
                           usePrivateAddress: true )) 
        
               then: 
        
               with(resources3) { 
        
                   getVirtualMachine().getMachineType() == type 
        
                   getZones() == zone 
        
                   getVirtualMachine().getDisks().get(0).getName() == diskName 
        
                   getVirtualMachine().getServiceAccount().getScopes() == SCOPES 
        
                   !getVirtualMachine().getDisks().get(0).getSizeGb() 
        
                   !getVirtualMachine().getPreemptible() 
        
                   getVirtualMachine().getAccelerators().size()==1 
        
                   getVirtualMachine().getAccelerators()[0].getCount()==4 
        
                   getVirtualMachine().getAccelerators()[0].getType()=='nvidia-tesla-k80' 
        
                   getVirtualMachine().getBootDiskSizeGb() == 75 
        
                   getVirtualMachine().getNetwork().getUsePrivateAddress() 
        
               }

Also, please, make sure so sign-off the commit to fulfill the DCO bot requirement. Thanks!

Adding cpuPlatform parameter to the request Signed-off-by: hnawar <hnawar@google.com>

Add support for CPU Platform selection Signed-off-by: hnawar <hnawar@google.com>

Add support for CPU platform Signed-off-by: hnawar <hnawar@google.com>

Adding support for CPU Platform selection Signed-off-by: hnawar <hnawar@google.com>

Signed-off-by: hnawar <hnawar@google.com>

Add the google.lifeSciences.cpuPlatform to the documentation Signed-off-by: hnawar <hnawar@google.com>

Add quotes to the example of google.lifeSciences.cpuPlatform Signed-off-by: hnawar <hnawar@google.com>

Add link to min CPU platform documentation Signed-off-by: hnawar <hnawar@google.com>

Add unit test for cpuPlatform in config Signed-off-by: hnawar <hnawar@google.com>

Signed-off-by: hnawar <hnawar@google.com>

Fix Accidental deletion Signed-off-by: hnawar <hnawar@google.com>

Signed-off-by: hnawar <hnawar@google.com>

hnawar · 2020-06-20T21:46:18Z

I have added the unit tests, signed the commits with DCO and Travis build passed.

hnawar · 2020-06-21T16:34:11Z

I have also staged changes to add support for specifying disk type per process. I can push these changes as well and put all in one PR if this is more convenient or submit a separate pull request

docs/google.rst

pditommaso

Thanks a lot for this contribution. Merging it.

pditommaso · 2020-06-22T06:03:16Z

As for specifying disk type per process, free free to draft a separate PR for it. However, I would like to keep a portable model for this, i.e. able to support also other computing services.

pditommaso · 2020-07-05T20:57:50Z

Any suggestion to retrieve the list of avail cpu platforms for a given zone using the Java API?

Can't find an immediate way using the com.google.api.services.compute.Compute client.

pditommaso requested changes Jun 19, 2020

View reviewed changes

docs/google.rst Outdated Show resolved Hide resolved

hnawar added 13 commits June 20, 2020 20:24

Update GoogleLifeSciencesSubmitRequest.groovy

a23ae93

Adding cpuPlatform parameter to the request Signed-off-by: hnawar <hnawar@google.com>

Update GoogleLifeSciencesHelper.groovy

be5d621

Add support for CPU Platform selection Signed-off-by: hnawar <hnawar@google.com>

Update GoogleLifeSciencesTaskHandler.groovy

93da25a

Add support for CPU platform Signed-off-by: hnawar <hnawar@google.com>

Update GoogleLifeSciencesTaskHandlerTest.groovy

6f5a733

Adding support for CPU Platform selection Signed-off-by: hnawar <hnawar@google.com>

Add support in config for cpuPlatform

fcc43d2

Signed-off-by: hnawar <hnawar@google.com>

Fix typo

770f9ad

Signed-off-by: hnawar <hnawar@google.com>

Update google.rst

8144b00

Add the google.lifeSciences.cpuPlatform to the documentation Signed-off-by: hnawar <hnawar@google.com>

Update google.rst

84e23f8

Add quotes to the example of google.lifeSciences.cpuPlatform Signed-off-by: hnawar <hnawar@google.com>

Update google.rst

2a3e551

Add link to min CPU platform documentation Signed-off-by: hnawar <hnawar@google.com>

Update GoogleLifeSciencesConfigTest.groovy

5c6ebd2

Add unit test for cpuPlatform in config Signed-off-by: hnawar <hnawar@google.com>

Add test for cpuPlatform

cbce2d9

Signed-off-by: hnawar <hnawar@google.com>

Add test for cpuPlatform

ab52953

Signed-off-by: hnawar <hnawar@google.com>

Update GoogleLifeSciencesHelperTest.groovy

c063e10

Fix Accidental deletion Signed-off-by: hnawar <hnawar@google.com>

hnawar force-pushed the master branch from 6f1aac6 to c063e10 Compare June 20, 2020 20:25

Fix type in function name getCpuPlatform()

e56c823

Signed-off-by: hnawar <hnawar@google.com>

hnawar force-pushed the master branch from c1dbb53 to e56c823 Compare June 20, 2020 20:51

hnawar requested a review from pditommaso June 20, 2020 21:13

pgrosu reviewed Jun 22, 2020

View reviewed changes

docs/google.rst Show resolved Hide resolved

pditommaso approved these changes Jun 22, 2020

View reviewed changes

pditommaso merged commit afc4375 into nextflow-io:master Jun 22, 2020

pditommaso mentioned this pull request Jul 2, 2020

Allow user to specify CPU platform in Google Cloud Platform #1632

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for selecting minimum CPU platform on GCP #1633

Add support for selecting minimum CPU platform on GCP #1633

hnawar commented Jun 16, 2020

pditommaso commented Jun 17, 2020

hnawar commented Jun 17, 2020

hnawar commented Jun 18, 2020

pditommaso commented Jun 18, 2020

hnawar commented Jun 18, 2020

pditommaso commented Jun 19, 2020

hnawar commented Jun 20, 2020

hnawar commented Jun 21, 2020

pditommaso left a comment

pditommaso commented Jun 22, 2020

pditommaso commented Jul 5, 2020

Add support for selecting minimum CPU platform on GCP #1633

Add support for selecting minimum CPU platform on GCP #1633

Conversation

hnawar commented Jun 16, 2020

pditommaso commented Jun 17, 2020

hnawar commented Jun 17, 2020

hnawar commented Jun 18, 2020

pditommaso commented Jun 18, 2020

hnawar commented Jun 18, 2020

pditommaso commented Jun 19, 2020

hnawar commented Jun 20, 2020

hnawar commented Jun 21, 2020

pditommaso left a comment

Choose a reason for hiding this comment

pditommaso commented Jun 22, 2020

pditommaso commented Jul 5, 2020