Spark multiversion support #1325

sumwale · 2019-06-10T20:14:34Z

Changes proposed in this pull request

support for multiple Spark versions from same code base
SparkInternals interface to abstract out the internal APIs used by SnappyData/AQP layers that have
changed between 2.1 to 2.4 with implementations for 2.1.0/2.1.1/2.3.2
updated build to allow for "spark.connector.version" property to build smart connector for non-default
Spark version

Patch testing

precheckin

ReleaseNotes.txt changes

Documentation for multiple spark version support in smart connector mode

Other PRs

TIBCOSoftware/snappy-store#478
https://github.com/SnappyDataInc/snappy-aqp/pull/187

…support

remaining build failures = 34 with Spark 2.3.x (was > 200 originally)

- product will always use compatible Spark version but connector can use a different one - added couple of sub-projects (core-product and aqp-product) that will always use the compatible version while connector build can use a different one - cluster will depend on normal build if connector version is same else it will use core-product

…support

ashishkshukla · 2019-06-19T02:56:47Z

@sumwale - I was looking into this code base and trying to make a build for spark 2.3.2.
I noticed that we have defined a def newSnappySessionState(snappySession: SnappySession): SnappySessionState in SparkInternals trait which creates an instance of SnappySessionState for the given spark version.
It seems this needs to implemented for every spark version we support, as it has direct call from SnappySession sessionState .
We have its implementation in Spark210Internals.scala for spark 2.1.1 and spark 2.1.0 but its implementation is missing for spark 2.3.2. Do we need to work on the implementation or we are handing in different way as I can not see its implementation in Spark232Internals.scala .

some build and runtime fixes

…support

made CREATE FUNCTION to be consistent with Spark

- add search and explicit cleanup of broadcast exchanges at the end of query execution (else they would only be cleared with GC collects the reference) - corrected GUI plan timings and cleanup the END message to deliver it reliably and not leave dangling SQL tasks running forever in some cases. - other test changes for Spark 2.4.5 to fix failures and

… Decimal handling in prepared statements

- fix code generation issue seen in TPCH Q20 - correct SD's SQL listener to link any jobs during planning and execution phase of a query correctly (reintroduced SparkListenerSQLPlanExecutionStart/End and handle SparkListenerSQLExecutionStart to search for any existing execution from SparkListenerSQLPlanExecutionStart then mark it as active instead of creating new one)

also fix few dunit test failures in ColumnBatchAndExternalTableDUnitTest

Sumedh Wale added 17 commits October 22, 2018 15:41

First cut for multiple Spark version support from same code base

e93e1e7

Merge remote-tracking branch 'origin/master' into spark-multiversion-…

fb6de86

…support

more fixes and additions

081f8b5

Merge remote-tracking branch 'origin/master' into spark-multiversion-…

604ae5a

…support

more fixes for Spark 2.3.x support

d8172d4

many other fixes for Spark 2.3.x support

7dba6ad

remaining build failures = 34 with Spark 2.3.x (was > 200 originally)

more additions for multiple version support

81a3492

fixes

2f31c88

more compatibility fixes

dbe390e

more fixes

d425f8f

more fixes

69e8969

fixing remaining failures

3514705

update submodule links

55440b3

Merge remote-tracking branch 'origin/master' into spark-multiversion-…

64b1bd5

…support

fixing compilation issues after merge

4b06b66

minor changes to names

b0da14f

sumwale mentioned this pull request Jun 10, 2019

Spark multiversion support TIBCOSoftware/snappy-store#478

Open

sumwale requested review from kneeraj and ahshahid June 14, 2019 04:42

Sumedh Wale and others added 9 commits June 24, 2019 16:06

some build changes

803a902

more build cleanups

d3c5fa7

moved compatibility modules inside core

2a5082e

some build and runtime fixes

more build cleanups and fixes

c193abd

build fixes

5080c86

more fixes

c71fb90

Merge remote-tracking branch 'origin/master' into spark-multiversion-…

8d6d58c

…support

Merge remote-tracking branch 'origin/master' into spark-multiversion-…

6a0b22e

…support

update gradle-scalatest and fixes for failures

fc6d95e

sumwale added 12 commits March 6, 2020 20:30

fix spark-unsafe deps

2094103

changes to build.gradle deps

db93deb

fixes for regression issues and others

7086e9e

fixes for multiple issues

71fad29

more fixes; make RETURNS optional in CREATE FUNCTION

b722db2

made CREATE FUNCTION to be consistent with Spark

fix VIEW test failure (due to auto-generated alias name mismatch) and…

36bab5a

… Decimal handling in prepared statements

fixing few more remaining failures and update spark module link

f665b0b

fixing test

d83bf9d

correct uncaught handler setting in Executor with for Spark 2.4

d771acb

skip dynamic cpusPerTask setting with smart connector

1b2dbb1

also fix few dunit test failures in ColumnBatchAndExternalTableDUnitTest

ashetkar force-pushed the master branch from b73485e to f740fee Compare April 20, 2021 09:04

ashetkar force-pushed the spark-multiversion-support branch from f1f675c to 1b2dbb1 Compare April 20, 2021 09:10

sumwale force-pushed the master branch from 1e636db to e1d45b2 Compare June 26, 2021 19:41

sumwale force-pushed the master branch from 8cc4798 to 5f5c15d Compare July 14, 2021 18:12

sumwale force-pushed the master branch 5 times, most recently from 8b43301 to 2b254d9 Compare October 1, 2021 09:23

sumwale force-pushed the master branch 5 times, most recently from 2c254f0 to 0f2888f Compare October 18, 2021 17:01

sumwale force-pushed the master branch 2 times, most recently from a466d26 to ea127bd Compare April 12, 2022 10:05

sumwale force-pushed the master branch 2 times, most recently from 99ec79c to c7b84fa Compare June 12, 2022 04:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark multiversion support #1325

Spark multiversion support #1325

sumwale commented Jun 10, 2019 •

edited

ashishkshukla commented Jun 19, 2019 •

edited

Spark multiversion support #1325

Are you sure you want to change the base?

Spark multiversion support #1325

Conversation

sumwale commented Jun 10, 2019 • edited

Changes proposed in this pull request

Patch testing

ReleaseNotes.txt changes

Other PRs

ashishkshukla commented Jun 19, 2019 • edited

sumwale commented Jun 10, 2019 •

edited

ashishkshukla commented Jun 19, 2019 •

edited