-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade Jobserver to 2.4.4 Spark #1283
Commits on Feb 25, 2020
-
feat(jobserver): Upgrade spark to 2.4.4
* Hive doesn't cleanup some of the data after the tables are dropped. As Jobserver project has for now 2 tests, which use Hive, add additional parameter LOCATION to table creation and use different paths for metastore. * Also add own warehouse configuration into hive-site.xml, to distinguish data in the future. * Enable REST api (spark.master.rest.enabled) for master explicitly because the default has been changed (to false) in 2.4.4. Change-Id: Id8604d0156f60970c494dd373bfaaafdbcb4d63f
Configuration menu - View commit details
-
Copy full SHA for 1244df7 - Browse repository at this point
Copy the full SHA 1244df7View commit details -
feat(jobserver): Disable hive support from SparkSession
Change-Id: Ic1c66785ea8c8645789c86dae83154e80458b7ec
Configuration menu - View commit details
-
Copy full SHA for f44aa50 - Browse repository at this point
Copy the full SHA f44aa50View commit details -
feat(jobserver): Add missing C* dependency
C* connector 2.4 indirectly depends on common-configuration which is brought in the classpath by Hadoop 2.7. This dependency has been changed in Hadoop 3.x, so C* connector 2.4 is broken. Until it is fixed, jobserver puts the dependency on the classpath. https://datastax-oss.atlassian.net/browse/SPARKC-566 Change-Id: I532ab22d2bb97dc5fd118c7178f67207b06bf885
Configuration menu - View commit details
-
Copy full SHA for d88b956 - Browse repository at this point
Copy the full SHA d88b956View commit details -
fix(ci): Update CI scripts to 2.4.4
Change-Id: Ic5128c5f306250c289f8c59d22f53d31bf87674e
Configuration menu - View commit details
-
Copy full SHA for da9dfd4 - Browse repository at this point
Copy the full SHA da9dfd4View commit details
Commits on Mar 2, 2020
-
jobserver(python): Enable secure communication
After upgrade to 2.4.4, python tests and context started to through warnings like "You are trying to pass an insecure Py4j gateway to Spark. This presents a security risk." This change is addressing the above problem by passing a token to the python subprocess. Subprocess uses the token for communication and is only allowed by the py4j gateway if the token is valid. Change-Id: I61e82b2996fd830315db1dc72af549578fc9a7a4
Configuration menu - View commit details
-
Copy full SHA for 31fd78c - Browse repository at this point
Copy the full SHA 31fd78cView commit details
Commits on Mar 17, 2020
-
fix(tests): Stop context cleanly
The tests related to checking if hive is disabled were failing because the context from previous testcase was not shutdown properly and had hive enabled. This fix cleans the context properly and makes sure that context is stopped. Change-Id: If6f9cb26fcc6f8f2af3243057825ef75585378d8
Configuration menu - View commit details
-
Copy full SHA for d53f507 - Browse repository at this point
Copy the full SHA d53f507View commit details -
refactor(python): Make subprocess.py PEP8 complaint
Jobserver in opensource is using "pycodestyle" to make the python files PEP8 complaint. subprocess.py was not complaint and due to it the opensource build failed. Change-Id: I93f9718ed30e122441d6e775045fff0711342f08
Configuration menu - View commit details
-
Copy full SHA for 875407a - Browse repository at this point
Copy the full SHA 875407aView commit details -
feat(python): Disable hive support from Python Spark Session
Change-Id: I692a8ff1387aee99ea4db7863d4676f6dd8fa5c9
Configuration menu - View commit details
-
Copy full SHA for 66f6150 - Browse repository at this point
Copy the full SHA 66f6150View commit details