[NOT YET] All of Leo: code review #29

helgridly · 2017-09-01T16:26:50Z

Once we've got Leo working end-to-end I'd like us to swarm on a code review and do some housekeeping tasks. This PR tracks the current state of develop against the beginning of time so it contains everything we've done.

code review date TBD but not yet

Add dockerfile and build scripts

* Add project status to README * Add Verily to LICENSE * Add placeholder for external contribution guidelines until we roll out our full new CLA process.

Remove a specific reference to Broad in favor of 'the copyright holder'

…ce and update Leo code to use it (#89) * Initial refactoring. Main compiles, tests don't. * Tests compile, don't pass * Add DB migration, tests pass now * Added Sam ServiceAccountProvider implementation, clean up some code * Oops, fix name * Beef up pet swat a little bit * Rebase from develop * Minor fixup * post rebase fix * Fix LeonardoModelCopy + ClusterKluge * Add stub method to remove creds from instance metadata, if an override service account is used. * Fix label swat test * s/overrideServiceAccount/notebookServiceAccount/g * Fix comment * Fix substring check * Dummy commit to test github * PR feedback: made ServiceAccountProvider return both the Leo SA and the pem file * Sleep a little more * Jam a recover on removeServiceAccountKey so the delete doesn't fail * Fix swat test flakiness, fix TODOs * retrieving service account keys from Google is SUPER inconsistent and flakey

* Implement new endpoint in HttpSamDAO. Fix some bugs as well * Add PetsPerProjectServiceAccountProvider * rebase oops * Switch unit tests to PPP * Update libs depdendency * Stab at a swat test case, untested * PR feedback, and actually switch unit tests to use a PPP implementation * Fix swat label check and init script bug * Fix spec bug; try NOT adding dataproc.worker to the pet SA * Fix test, temporarily * Revert "Fix test, temporarily" This reverts commit 937abc0. * Put back dataproc worker * Init script bugfix attempt. Committing to run SWAT * More swat * Need to set GOOGLE_APPLICATION_CREDENTIALS _conditionally_ in the docker container, not always * Bash fail * the env file needs to exist * Should make swat pass * Try and prevent spurious selenium timeouts * Even more patience * Increase selenium timeouts * Update to non-SNAP

* Fix concurrency bug relating to removing Dataproc Worker IAM role from pets * Spec * Comment improvement * Change method signature; add missing DB indexes * Fix comment lie

* also removed a false comment

* Cross domain cookie auth - make the Leo proxy accept a token via cookie OR Authorization header * Make the cookie params configurable * Add explicit login/logout endpoints per discussion with jmandel * Add Google client ID param to the create cluster endpoint, per discussion with jmandel * Change notebook extension to call gapi.authorize on a timer instead of init/signIn (thanks again jmandel) * Post rebase fixes * Change cookie logic to be more correct * Minor fixes * Remove googleClientId from the API and templating logic, in its own commit * Stab at postMessage implementation to pass the client ID to the notebook extension. Swat still in progress. * Temporarily comment out allowedOrigins to facilitate testing * Revert "Temporarily comment out allowedOrigins to facilitate testing" This reverts commit 73c1177. * PR feedback, and add swagger * Need to set CORS headers for cross-domain cookies * Fix unit test * Support preflight OPTIONS requests for ajax * ProxyRoutes bug * Hopefully fix my proxy routes woes * slight CORS fix, getting close * Missed a couple cors headers * Make postMessage data an object to be consistent * Dummy page for testing. It works against a fiab :D. Needs swatification * Oops, committed an .orig file * JS tweaks and do cors for all proxy requests * Fix tests * dummy client fixes * SWAT * Debug error on Jenkins * implement awaitLoaded in DummyClientPage * Post rebase fixes * Minor cleanup * docker fail * Check postMessage origin in dummy client

* add AoU client library

* give clusterserviceaccount access to dataproc bucket

* Change cookie name: FCtoken -> LeoToken * Just check the origin

* Add Bigquery scopes to Dataproc cluster SA * BigQuery Swat test * Update formatting for new version of executeCell() * Copy Google Role endpoints from UI-Repo Swat

* Bump handleResponse/awaitResponse times to 1 sec and billing account creation to 10 min

* Dockerfile reduction and refactoring * docker seventeen * Split Dockerfile into separate layers for prereqs + Java + Spark/Hadoop/Hail + pips * Dockerfile updates based on feedback * I think I need to mount Hive * More spark confs that seem to help * More tries * useless Java option * Reduce spark.yarn.am.memory * Link JIRA in comment

* Add println * Increase route timeout

* initial - probs doesn't work * make the notebook action strings correct * fix execution context bug * try a different way of authing * i dunno * logging stuff * fiab stuff * trying to figure out why resourceAction is giving me a false * hmmm - are we actually getting a boolean and passing in the right stuff? * remove whitelist D: * approximately * import statements * stuff * add back recreate cluster * fixing samauth test * add cache stuff to config * make it actually talk to Sam * oops * use a real fake pem * ignore test for now... * forgot this * update for sam changes * resolve conflicts * let's not be too clever * tiny correction * log some stuff * will this work idk * decode * logging stuff * more logging * add back whitelist + update for sam changes * do it right * log more things * maybe, idk * omg this might work i think * no, but maybe this? * wild guesswork * this has to be it * as json this time * try this * idk * oh this actually worked * comments and test stuff * switch auth for tests * src * im sleepy * some initial review changes * Change strings to model classes * change tests * oops broke tests * new auth provider interface * fix whitelist * fix reference.conf * fix sam server url * use mockLeoAuthProvider in LeonardoServiceSpec for now * go back to whitelist * sam auth provider spec * list clusters courtesy of Rob and Hussein * override canSeeAllClustersInProject * hussein's nitpick + executionContext * bracket * samclient prelim * move project and notebook actions back to SamAuthProvider * [nomerge] Better LeoService interop w/ auth providers (#120) * new auth provider interface * wip * better * commentary * tests for optimized leo list clusters * whitelist fixes * mockito test * removing extra stuff from commontestdata * silly * fix tests * fix tests for real * maybe don't use mockito * add billing project manually to mock * tiny mistake * maybe this * I can't seem to do anything right * now we're getting somewhere * wait no, add back mockito * why can't this be over * what's in a name * stop reading my commit messages, the pressure to entertain is too much * I miss the days when my commit messages were just for me * please pass * clean it up make it pretty * get and contain issue * that which we call a rose By any other word would smell as sweet but would probs be confusing * change artifactory * review suggestions * why would I get it right the first time * something broke again * fix for notebookclusters too * stupid race conditions * change cacheExpiryTime to FiniteDuration * fix reference.conf cacheExpiryTime * ergh * missed one * maybe this * I think I'm getting closer * well, maybe * update sam swagger client version * so muh simpler * oops naming resources * remove logging * fix the automated tests * have more patience * typo * remove logging messages * do the simple thing first * fix the owner delete bug * don't need userEmail in internalDeleteCluster anymore * pass in full cluster to notify methods * fix tests * go back to using email project and cluster name * refactor swat tests a bit * missed two... * change it again sigh * fix tests

* Make notebooky tests reuse the same cluster - new LeonardoTestUtils trait - Run ClusterMonitorSpec in parallel - Don't need to explicitly mixin WebBrowserSpec - centralize hail file vals - Ensure withOpenNotebook/withNewNotebook always terminate their kernels - Removed redudant test which uploads but doesn't execute notebook

* use workbench libs * auth token for leo, other deletions, tweaks * adding methods to configs * adding some more info * forgot to render users.json * build/push swat tests docker image in jenkins build script * all up to date * syncing users json * Add campaignManagers to users * workbench-libs needs these (TODO remove) * fix for app conf * real fix for app conf * Hermione and Ron * more app conf fix * more app conf ctmpl * Update scalatest version for workbench-libs compatibility * implicits are the devil * dammit weasley * MORE PATIENCE * fix build script when pushing to develop * save git SHA during build script

Added link to the production swagger UI to README.md

* Add error handling to providers * Add tests * Clean up * Add logging * Exception recovery not needed on SamAuthProvider * Better specs and error handling * Add comment * Log text nit * Add comment to specs

* Delete cluster resources in Sam only after deletion is complete in Google and the Leo DB * Integration Tests: Downgrade selenium/node-chrome to 3.8.1-erbium

Hussein Elgridly and others added 30 commits July 11, 2017 18:34

legalities

75b1e15

gitignore

531ddf0

placeholder initial route and test

f2ca14d

hello travis

8a43494

rename travis

54aee79

coveralls

c3be7ea

add build badges

a1f508b

coveralls fixes

87626dd

Get coveralls working

0ce4830

update coveralls badge

f67298e

adding simple Dockerfile

59f62aa

add scripts to build jar/docker image

3cb25a8

oops

420f443

add git_model_hash back in

57e7713

Merge pull request #4 from broadinstitute/dockerfile_as

dbcc9e1

Add dockerfile and build scripts

GAWB-2339 Add swagger (#3)

030a2dc

Pulled in workbench-libs, removed ErrorReport copypasta (#5)

00ec0de

GAWB-2000: first try, jam some files in

4f8d7ef

GAWB-2000: changes to put route

1a9224b

GAWB-2000: googleutilites for now

49ed394

GAWB-2000: util for now

b06d7ba

GAWB-2000: dependencies for now, leo service, model, servicespec

d135cd2

GAWB-2000: more changes

29c7a7f

rebase fixes

2528393

GAWB-2000: tests

9f7f3b3

GAWB-2000: fix swagger

3cd1e77

GAWB-2000: ClusterResponse, review suggestions

28ad75b

GAWB-2000: change variable nameg

014a354

GAWB-2000: adding labels to clusters + test dummy conf

7224807

GAWB-2000: fixing test conf

4661d71

Bradley R Taylor and others added 29 commits December 8, 2017 11:50

Minor updates to repo documentation (#91)

91be516

* Add project status to README * Add Verily to LICENSE * Add placeholder for external contribution guidelines until we roll out our full new CLA process.

Update LICENSE.txt

d8b38f9

Remove a specific reference to Broad in favor of 'the copyright holder'

3 + 2 = fix

e3492a7

fix for minor swagger lies (#96)

7cb3eb1

Multiline keys must be indented (#97)

41d48cf

Dataproc Worker concurrency bug (#99)

34601da

* Fix concurrency bug relating to removing Dataproc Worker IAM role from pets * Spec * Comment improvement * Change method signature; add missing DB indexes * Fix comment lie

added cluster creator to default set of labels (#98)

50346da

Selenium upgrade to fix download issues (#100)

ec6ee59

* also removed a false comment

GAWB-2386 stub /localize endpoint in Jupyter API (#101)

b4ad84d

remove dependency on dsde-toolbox (#106)

e94bbe4

GAWB-2386, GAWB-2407 loc/deloc (#104)

ddf7869

GAWB 2608 - add AoU client library (#107)

d261216

* add AoU client library

GAWB 3026 - give clusterserviceaccount access to dataproc bucket (#111)

d066831

* give clusterserviceaccount access to dataproc bucket

s/FCtoken/LeoToken (#109)

148aab0

* Change cookie name: FCtoken -> LeoToken * Just check the origin

GAWB-2838: BigQuery inside notebooks (#113)

6d404ab

* Add Bigquery scopes to Dataproc cluster SA * BigQuery Swat test * Update formatting for new version of executeCell() * Copy Google Role endpoints from UI-Repo Swat

GAWB-3023 Create new billing project for every new-cluster test (#114)

3fdd99b

* Bump handleResponse/awaitResponse times to 1 sec and billing account creation to 10 min

GAWB-3082: Fix intermittent unit test failure in ProxyRoutesSpec (#124)

0b51524

Fix ProxyRoutesSpec for real (#125)

98dd9ef

* Add println * Increase route timeout

Update README.md

e043ffd

Added link to the production swagger UI to README.md

configurable providers are live now

f04c6e9

Leo/Sam auth: handle Sam API exceptions in SamAuthProvider (#138)

99bacf5

* Add error handling to providers * Add tests * Clean up * Add logging * Exception recovery not needed on SamAuthProvider * Better specs and error handling * Add comment * Log text nit * Add comment to specs

GAWB-3108 Defer Deletion of Sam Cluster Resources (#139)

5258227

* Delete cluster resources in Sam only after deletion is complete in Google and the Leo DB * Integration Tests: Downgrade selenium/node-chrome to 3.8.1-erbium

rtitle closed this Feb 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NOT YET] All of Leo: code review #29

[NOT YET] All of Leo: code review #29

helgridly commented Sep 1, 2017 •

edited

Loading

[NOT YET] All of Leo: code review #29

[NOT YET] All of Leo: code review #29

Conversation

helgridly commented Sep 1, 2017 • edited Loading

helgridly commented Sep 1, 2017 •

edited

Loading