Add data schema for the benchmark run in Bigquery. #3585

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

qlzh727 merged 3 commits into tensorflow:master from qlzh727:benchmark-schema

Mar 14, 2018

Member

qlzh727 commented Mar 13, 2018

The current schema contains the entity information about model
and train data metadata, as well as machine config. Future change
will contain benchmark metric.

The json schema can be used to create bigquery table. A sample
table can be found in
https://bigquery.cloud.google.com/table/tf-benchmark-dashboard:test_benchmark.benchmark_run.


          Add data schema for the benchmark run in Bigquery.

83a867b

The current schema contains the entity information about model
and train data metadata, as well as machine config. Future change
will contain benchmark metric.

The json schema can be used to create bigquery table. A sample
table can be found in
https://bigquery.cloud.google.com/table/tf-benchmark-dashboard:test_benchmark.benchmark_run.

qlzh727 requested review from k-w-w, karmel, robieta and yhliang2018

March 13, 2018 22:19

qlzh727 requested a review from nealwu as a code owner

March 13, 2018 22:19

googlebot added the cla: yes label

qlzh727 requested a review from tfboyd

March 13, 2018 22:19


          Add data schema of benchmark metric for bigquery.

0bfab87

Member Author

qlzh727 commented Mar 14, 2018

Sample tables can be found in https://bigquery.cloud.google.com/dataset/tf-benchmark-dashboard:test_benchmark.

karmel removed the request for review from nealwu

March 14, 2018 16:39

karmel reviewed

View reviewed changes

official/benchmark/datastore/schema/benchmark_run.json

+                  "mode": "REPEATED",
+                  "name": "attribute",
+                  "type": "RECORD"
+                },

Contributor

karmel Mar 14, 2018

Some other things that would be nice:

Commit hash indicating exactly what code was run.
Command line used to run the model
Any env variables set outside of the code that are relevant to the model itself rather than the compute environment (ie, TF_ENABLE_WINOGRAD_NONFUSED could be set from outside the code and would change algorithm choice inside the code).

Member Author

qlzh727 Mar 14, 2018 •

edited

Loading

Good point. Adding tf verison info and environment variables. The command line info should be captured by the attributes.

official/benchmark/datastore/schema/benchmark_run.json Outdated

+                  "type": "RECORD"
+                },
+                {
+                  "description": "The list of hyper parameter of the model.",

Contributor

karmel Mar 14, 2018

nit: hyperparameters

Member Author

qlzh727 Mar 14, 2018

Done.

official/benchmark/datastore/schema/benchmark_run.json Outdated

+                    }
+                  ],
+                  "mode": "REPEATED",
+                  "name": "hyper_parameter",

Contributor

karmel Mar 14, 2018

same nit: one word

Member Author

qlzh727 Mar 14, 2018 •

edited

Loading

Done.

official/benchmark/datastore/schema/benchmark_run.json

+                        {
+                          "mode": "NULLABLE",
+                          "name": "model",
+                          "type": "STRING"

Contributor

karmel Mar 14, 2018

We probably want to indicate in some way how the GPUs are configured:

Cuda version
Topology params we care about (cf @tfboyd )?
Number of hosts will eventually become relevant, if we know it (ie, number of separate boards that all these GPUs live on)
Not sure if we want to anticipate these upfront, or just add as they come.

Member Author

qlzh727 Mar 14, 2018

Added cuda_version which is standard, not sure we could capture other info easily or not.

official/benchmark/datastore/schema/benchmark_run.json

+                          "name": "version",
+                          "type": "STRING"
+                        }
+                      ]

Contributor

karmel Mar 14, 2018

Do we want to try to capture cloud info here? ie, running on k8s versus a VM versus metal?

Member Author

qlzh727 Mar 14, 2018

Done. Added a section with minimal cloud info, and a free format key-value pair for the moment.

official/benchmark/datastore/schema/benchmark_run.json

+                      "mode": "NULLABLE",
+                      "name": "memory_available",
+                      "type": "STRING"
+                    }

Contributor

karmel Mar 14, 2018

Capturing env variables relevant to the compute environment would be good-- ie, CUDA_VISIBLE_DEVICES, whether to share GPU memory (sorry, forgetting what that one is right now, but it should suffice to say, there are many).

Member Author

qlzh727 Mar 14, 2018

Ack, for the moment, we will just dump them into env variables.

official/benchmark/datastore/schema/benchmark_run.json

+                        }
+                      ]
+                    },
+                    {

Contributor

karmel Mar 14, 2018

I see you went with JSON instead of Proto, which WFM if you find it preferable. But, to make the question more complicated-- what about YAML? We will have a bunch of those for k8s anyhow, and it's much more human-readable without all these brackets. Thoughts?

Member Author

qlzh727 Mar 14, 2018

The schema file is used to create bigquery table, and bigquery only accept json as schema format. I don't have other option here.

official/benchmark/datastore/schema/benchmark_run.json

+                    }
+                  ]
+                },
+                {

Contributor

karmel Mar 14, 2018

Where should information about parameter server configuration live? I guess that's mostly about how the model itself is run. Maybe we don't need to explicitly store that as long as we capture the command line and code commit that was run.

Member Author

qlzh727 Mar 14, 2018

Ack. I am not worrying that for the moment, we can update the data schema if we want in future.

Contributor

karmel commented Mar 14, 2018

Oh, and another thought: tensorflow build/version should be represented somewhere.


          Address the comment from code review.

7da439b

1. Added Tensorflow version information.
2. Added environment variables.
3. Fix typo for hyperparameters.
4. Added cloud related information.

Member Author

qlzh727 commented Mar 14, 2018

Ping

karmel approved these changes

View reviewed changes

qlzh727 merged commit bf5186b into tensorflow:master

qlzh727 deleted the benchmark-schema branch

March 20, 2018 20:04

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

robieta Awaiting requested review from robieta

k-w-w Awaiting requested review from k-w-w

yhliang2018 Awaiting requested review from yhliang2018

tfboyd Awaiting requested review from tfboyd

1 more reviewer

karmel karmel approved these changes

Labels