sql-spark-connector slower than older azure-sqldb-spark connector

Ran the test with the same file 8.5GB with old connector https://github.com/Azure/azure-sqldb-spark and this one sql-spark-connector.  Used the very same parameters bulkcopy , batchsize 150000, tablelock true for both. With the old connector (spark 2.4.0, scala code) job finished in 16m and with this connector, job took around 38m to finish with spark 3.3.1 and pyspark code.  What could be the reason?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sql-spark-connector slower than older azure-sqldb-spark connector #206

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

sql-spark-connector slower than older azure-sqldb-spark connector #206

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions