Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support structured streaming writes to BigQuery #201

Merged

Conversation

varundhussa
Copy link
Contributor

Add a new Streaming sink
Make load requests per batch DataFrame

@varundhussa
Copy link
Contributor Author

@davidrabinowitz this allows loads for each batch

@davidrabinowitz
Copy link
Member

/gcbrun

@davidrabinowitz
Copy link
Member

@varundhussa Thanks for the clean up! Sorry, I forgot to ask, can you please add an integration test - as it is a new functionality I want to make sure it is properly covered.

@davidrabinowitz
Copy link
Member

/gcbrun

@varundhussa
Copy link
Contributor Author

Thanks @davidrabinowitz. I'll write a dataframe selected from BigQuery to a MemoryStream and write tests on top of it.

@varundhussa
Copy link
Contributor Author

@davidrabinowitz I have added a append integration test. Not sure how to run it with the build process.

@davidrabinowitz
Copy link
Member

/gcbrun

@davidrabinowitz davidrabinowitz merged commit 041b07a into GoogleCloudDataproc:master Jul 13, 2020
@varundhussa varundhussa deleted the spark_streaming branch July 14, 2020 05:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants