Add benchmark upload util to Bigquery. #3776

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

qlzh727 merged 10 commits into tensorflow:master from qlzh727:metric-upload

Mar 28, 2018

Member

qlzh727 commented Mar 27, 2018

Also update the benchmark logger and bigquery schema for the
errors found during the integration test.


          Add benchmark upload util to bigquery.

db19c1c

Also update the benchmark logger and bigquery schema for the
errors found during the integration test.

qlzh727 requested review from k-w-w, karmel and nealwu as code owners

March 27, 2018 19:11

googlebot added the cla: yes label

qlzh727 requested review from robieta and yhliang2018

March 27, 2018 19:11

qlzh727 added 3 commits

March 27, 2018 12:24


          Fix lint error.

fe8d118


          Update test to clear all the env vars during test.

fd5afbf

This was causing error since the Kokoro test has TF_PKG=tf-nightly
injected during test.


          Update lintrc to ignore google related package.

127ff79

qlzh727 added the kokoro:force-run label

kokoro-team removed the kokoro:force-run label


          Another attempt to fix lint import error.

robieta suggested changes

View reviewed changes

official/utils/logging/benchmark_uploader.py

    
              This library require google cloud bigquery lib as dependency, which can be

              installed with:

                > pip install --upgrade google-cloud-bigquery

Contributor

robieta Mar 27, 2018

Also add to requirements.txt

Member Author

qlzh727 Mar 27, 2018

Done.

official/utils/logging/benchmark_uploader.py Outdated

    
                    logging_dir: string, logging directory that contains the benchmark log.

                    gcp_project: string, the name of the GCP project that the log will be

                      uploaded to. The default project name will be detected from local

                      environment if no value is provide.

Contributor

robieta Mar 27, 2018

µnit: provided.

Member Author

qlzh727 Mar 28, 2018

Done

official/utils/logging/benchmark_uploader.py

    
                    credentials: google.auth.credentials. The credential to access the

                      BigQuery service. The default service account credential will be

                      detected from local environment if no value is provided. Please use

                      google.oauth2.service_account.Credentials to load credential from local

Contributor

robieta Mar 27, 2018

µnit: space before "Credentials"

Member Author

qlzh727 Mar 28, 2018

google.oauth2.service_account is the module name and Credentials is the class name. Do u want me to split them with a space?

Contributor

robieta Mar 28, 2018

I can't read. Ignore, and I'm sorry.

official/utils/logging/benchmark_uploader.py Outdated

    
                    metrics = []

                    for l in lines:

                      if not l.strip(): continue

                      metric = json.loads(l)

Contributor

robieta Mar 27, 2018

I would be inclined to wrap this in a try-except (and log error). That way a single malformed line doesn't ruin all entries.

Member Author

qlzh727 Mar 28, 2018

The inline if is trying to catch the case of the last line of the file which only contains "\n". This is kind of an expected entry, maybe I should just do a inline strip and filter of the "line"?

Contributor

robieta Mar 28, 2018

I don't remember the exact failure modes, but because JSON is a fragile container (1 bad byte can corrupt the entire entry), I've always seen code that skips invalid entries.

official/utils/logging/benchmark_uploader.py

    
              FLAGS = None

              class BigQueryUploader(object):

Contributor

robieta Mar 27, 2018

I have a conceptual proposal:

Pull dataset_name and run_id into __init__(). In effect make all local information part of the class state.
Support intermediate uploading. So if I call upload_metric() with 500 lines, and then call it again with 1000 the second call will upload lines 501 to 1000.

The reason is that is would allow intermediate uploading during the training loop, rather than at the very end. Depending on the sort of monitoring we want to build on top of BigTable that could be desirable. Discuss.

Member Author

qlzh727 Mar 28, 2018

Discussed offline. Since bigquery does not care about the quota, I think I can update the benchmarkLogger to contain a instance of uploader in future, and do a direct upload with writing to local file, treating bigquery as a file system.

Will address that in a separate change.

official/utils/logging/benchmark_uploader.py Outdated

    
              if __name__ == "__main__":

                parser = argparse.ArgumentParser()

Contributor

robieta Mar 27, 2018

Can we make this a class?

Member Author

qlzh727 Mar 28, 2018

k, moving this arg_parser

official/utils/logging/benchmark_uploader.py Outdated

    
                )

                parser.add_argument(

                    "--bigquery_data_set", "-bds", default="test_benchmark",

                    help="The Bigquery dataset name where the benchmark will be uploaded.",

Contributor

robieta Mar 27, 2018

How come there are no "[default: %(default)s]" for the rest of these?

Member Author

qlzh727 Mar 28, 2018

Done.

official/utils/logging/benchmark_uploader.py Outdated

    
                         " be uploaded.",

                    metavar="<BMT>"

                )

                FLAGS, unparsed = parser.parse_known_args()

Contributor

robieta Mar 27, 2018

Use parse_args() instead of parse_known_args() so it blows up with unrecognized args.

Member Author

qlzh727 Mar 28, 2018

Done. Moved to use arg_parser

official/utils/logging/logger.py Outdated

    
                      self.log_metric(key, eval_results[key], global_step=global_step)

                def log_metric(self, name, value, unit=None, global_step=None, extras=None):

                def log_metric(self, name, value, unit=None, global_step=None, extras={}):

Contributor

robieta Mar 27, 2018

Mutable types should not be default args.

Member Author

qlzh727 Mar 28, 2018

Done.

official/utils/logging/logger_test.py

    
                def setUp(self):

                  super(BenchmarkLoggerTest, self).setUp()

                  self.original_environ = dict(os.environ)

Contributor

robieta Mar 27, 2018

Could you put a comment about why this was necessary and when others would need to worry about environment variables?

Member Author

qlzh727 Mar 28, 2018

Done.

qlzh727 added 5 commits

March 27, 2018 16:43


          Merge branch 'master' into metric-upload

47844c5


          Address the review comment.

c25b7cb


          Fix lint error.

4440ef4


          Another fix for lint.

e851ebd


          Update test comment for env var clean up.

f2a1887

robieta approved these changes

View reviewed changes

qlzh727 merged commit 932364b into tensorflow:master

qlzh727 deleted the metric-upload branch

March 28, 2018 20:07

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

k-w-w Awaiting requested review from k-w-w

karmel Awaiting requested review from karmel

nealwu Awaiting requested review from nealwu

yhliang2018 Awaiting requested review from yhliang2018

1 more reviewer

robieta robieta approved these changes

Labels