You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Configuring a dataset to use the Spark Data Connector and setting the parameter to connect locally to sc://localhost:15002 results in a transport error.
Spice.ai runtime starting...
2024-05-01T00:19:11.836924Z INFO spiced: Metrics listening on 127.0.0.1:9000
2024-05-01T00:19:12.018627Z WARN runtime: Failed to get data connector from source for dataset traces: Unable to initialize data connector spark: transport error
Use pyspark to register a table from a parquet file at traces:
frompyspark.sqlimportSparkSessionSparkSession.builder.master("local[*]").getOrCreate().stop()
spark=SparkSession.builder.remote("sc://localhost:15002").getOrCreate()
# Replace the path with a path to a parquet file on your systemspark.catalog.createTable("traces", path="/path/to/traces.parquet", source="parquet")
Start the Spice runtime with the above Spicepod that connects to the Spark Cluster, and observe the transport error and this error from Spark:
24/05/01 09:19:13 INFO connections: Transport failed io.netty.handler.codec.http2.Http2Exception: HTTP/2 client preface string missing or corrupt. Hex dump for received bytes: 16030100f0010000ec03033053ae382b06852dedb277ee05
Expected behavior
The connection is made and I can query my table.
Additional context
I believe this is because we try to connect over TLS by default - we should provide an option to bypass TLS for this local scenario.
The text was updated successfully, but these errors were encountered:
Describe the bug
Configuring a dataset to use the Spark Data Connector and setting the parameter to connect locally to
sc://localhost:15002
results in atransport
error.This is the Spicepod I used:
To Reproduce
Follow this quickstart to start a Spark Connect cluster locally: https://spark.apache.org/docs/latest/api/python/getting_started/quickstart_connect.html
Use
pyspark
to register a table from a parquet file attraces
:Start the Spice runtime with the above Spicepod that connects to the Spark Cluster, and observe the transport error and this error from Spark:
24/05/01 09:19:13 INFO connections: Transport failed io.netty.handler.codec.http2.Http2Exception: HTTP/2 client preface string missing or corrupt. Hex dump for received bytes: 16030100f0010000ec03033053ae382b06852dedb277ee05
Expected behavior
The connection is made and I can query my table.
Additional context
I believe this is because we try to connect over TLS by default - we should provide an option to bypass TLS for this local scenario.
The text was updated successfully, but these errors were encountered: