Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable automatic URL tracking for Spark applications #2669

Merged
merged 2 commits into from Mar 28, 2019

Conversation

GoodDok
Copy link
Contributor

@GoodDok GoodDok commented Mar 12, 2019

Description

#2661 has provided the mechanism of tracking URL in external programs.
So now we can enable it for Spark applications:
If spark application is run in cluster mode, the string matching the pattern tracking URL: (https?://.*)\s is searched.
If spark application is run in client mode, the string matching the pattern Bound (?:.*) to (?:.*), and started at (https?://.*)\s is searched.
In both cases the searching is performed in stderr stream.

Motivation and Context

To enable URL tracking in Spark applications.

Have you tested this? If so, how?

The PR contains some new unit tests + existing tests were run locally

GoodDok and others added 2 commits March 6, 2019 19:11
Sync with the main repo
- changed `track_url_in_stderr` to `stream_for_searching_tracking_url` according to the latest changes in external_program.py
- added tests for spark url tracking in cluster and client modes
Copy link
Contributor

@Tarrasch Tarrasch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's once merge once you've confirmed it works in prod!

@GoodDok
Copy link
Contributor Author

GoodDok commented Mar 13, 2019

Ok, I'll be back when it's been working long enough.

@GoodDok
Copy link
Contributor Author

GoodDok commented Mar 26, 2019

Hi @Tarrasch, just wanted to inform that the pipeline with these changes applied has been working for last ~2 weeks with no issues observed.

@Tarrasch
Copy link
Contributor

Awesome. Let's merge then!

@Tarrasch Tarrasch merged commit 62fbaf2 into spotify:master Mar 28, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants