Execution never ends with a bad Redshift connection #22

rafaelleinio · 2019-09-16T17:00:46Z

I experienced this issue testing aws-data-wrangler in the "Spark to Redshift" use case.

I created a Redshift connection with the method Redshift.generate_connection and when I tried to load the dataframe to Redshift with the method session.spark.to_redshift the executions never finished so I had to manually cancel the execution. The temporary files on S3 were created, but nothing on Redshift. Later I realize that I was passing the port as string to the connection but I think that the method expects an int argument, when I changed this the load worked just fine!

My guess is that with a wrong connection setup wrangler can't reach Redshift but there is no timeout to stop the execution, so the code gets stuck.

The text was updated successfully, but these errors were encountered:

igorborgest · 2019-09-16T17:20:10Z

Thanks @rafaelleinio!

I solved the port typing issue, and also created a more generic connection validation mechanism.

Make sense?

igorborgest self-assigned this Sep 16, 2019

igorborgest added the enhancement New feature or request label Sep 16, 2019

igorborgest mentioned this issue Sep 16, 2019

Add mechanism to make Redshift handle bad connections #23

Merged

igorborgest closed this as completed Sep 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Execution never ends with a bad Redshift connection #22

Execution never ends with a bad Redshift connection #22

rafaelleinio commented Sep 16, 2019

igorborgest commented Sep 16, 2019

Execution never ends with a bad Redshift connection #22

Execution never ends with a bad Redshift connection #22

Comments

rafaelleinio commented Sep 16, 2019

igorborgest commented Sep 16, 2019