Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BEAM-2143]: Fix default temp location for DataflowRunner #2907

Closed
wants to merge 3 commits into from

Conversation

vikkyrk
Copy link
Contributor

@vikkyrk vikkyrk commented May 5, 2017

Be sure to do all of the following to help us incorporate your contribution
quickly and easily:

  • Make sure the PR title is formatted like:
    [BEAM-<Jira issue #>] Description of pull request
  • Make sure tests pass via mvn clean verify.
  • Replace <Jira issue #> in the title with the actual Jira issue
    number, if there is one.
  • If this contribution is large, please file an Apache
    Individual Contributor License Agreement.

@vikkyrk
Copy link
Contributor Author

vikkyrk commented May 5, 2017

R: @bjchambers @davorbonaci

Copy link
Member

@davorbonaci davorbonaci left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM; leaving final review & merge to @bjchambers.

@coveralls
Copy link

Coverage Status

Coverage increased (+0.002%) to 70.491% when pulling ea165bf on vikkyrk:df_temp_loc into b130d7a on apache:master.

@coveralls
Copy link

Coverage Status

Coverage decreased (-0.005%) to 70.484% when pulling e8421d8 on vikkyrk:df_temp_loc into 3bffe0e on apache:master.

Copy link
Contributor

@bjchambers bjchambers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you comment on what the logs are like now when running the word count example and not specifying the tempLocation?


@Override
public String create(PipelineOptions options) {
GcsOptions gcsOptions = options.as(GcsOptions.class);
LOG.info("No staging location provided, falling back to temp location.");
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

suggest formatting the options as options: "No stagingLocation provided, falling back to gcpTempLocation"

@@ -278,7 +278,7 @@ static String tryCreateDefaultBucket(
}
final String bucketName =
"dataflow-staging-" + region + "-" + projectNumber;
LOG.info("No staging location provided, attempting to use default bucket: {}",
LOG.info("No temp location provided, attempting to use default bucket: {}",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"No tempLocation specified, attempting to use default bucket:"

@coveralls
Copy link

Coverage Status

Coverage increased (+0.006%) to 70.495% when pulling e8421d8 on vikkyrk:df_temp_loc into 3bffe0e on apache:master.

@vikkyrk
Copy link
Contributor Author

vikkyrk commented May 5, 2017

If tempLocation is not specified:
INFO: No tempLocation specified, attempting to use default bucket: dataflow-staging-us-central1-996210899482

If both tempLocation and stagingLocation is not specified
INFO: No tempLocation specified, attempting to use default bucket: dataflow-staging-us-central1-996210899482
INFO: No stagingLocation provided, falling back to gcpTempLocation

@vikkyrk
Copy link
Contributor Author

vikkyrk commented May 5, 2017

Done

@coveralls
Copy link

Coverage Status

Coverage increased (+0.01%) to 70.5% when pulling 02b50fd on vikkyrk:df_temp_loc into 3bffe0e on apache:master.

@bjchambers
Copy link
Contributor

The exception I get from this is still not particularly useful. It says:

...
Caused by: java.lang.IllegalArgumentException: Missing object or bucket in path: 'gs://dataflow-staging-us-central1-927334603519/', did you mean: 'gs://some-bucket/dataflow-staging-us-central1-927334603519'?
...

@vikkyrk
Copy link
Contributor Author

vikkyrk commented May 5, 2017

What command did you run? You shouldn't see an exception after this change for missing either of tempLocation or stagingLocation

@bjchambers
Copy link
Contributor

Ah -- it looks like it wasn't using the patched version of the SDK. Looks good now.

@vikkyrk
Copy link
Contributor Author

vikkyrk commented May 5, 2017

@bjchambers @davorbonaci Looks good to be merged.

@asfgit asfgit closed this in e919083 May 5, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants