Add unit test, official flags, and benchmark logs for recommendation model #4343

yhliang2018 · 2018-05-22T23:46:35Z

Hi All,

I have add official flags and benchmark logs for the recommendation model. A simple unit test for dataset.py is also added based on previous comments. As dataset.py needs to read csv files, I add a folder "unittest_data" for the csv files.

karmel

Woo, looking so real.

karmel · 2018-05-23T16:55:37Z

official/recommendation/data_download.py

+def define_data_download_flags():
+  """Add flags specifying data download arguments."""
+  flags.DEFINE_string(
+      name="data_dir", short_name="dd", default="/tmp/movielens-data/",


@robieta , what did we decide-- no short names for flags defined within individual modules? I think that will make us happier in the end.

I will remove the short_name for now. We can always add it back if necessary. :)

karmel · 2018-05-23T16:56:08Z

official/recommendation/data_download.py

+      enum_values=["ml-1m", "ml-20m"], case_sensitive=False,
+      help=flags_core.help_wrap(
+          "Dataset to be trained and evaluated. Two datasets are available "
+          "for now: ml-1m and ml-20m."))


nit: We don't have any plans to add others, so no need to specify "for now"

karmel · 2018-05-23T17:11:47Z

official/recommendation/ncf_main.py


+  # Return estimator's last checkpoint as global_step for estimator
+  def _get_global_step(estimator):
+    return int(estimator.latest_checkpoint().split("-")[-1])


Hmm. This is fragile and unreliable. Can you try estimator.get_variable_value(tf.GraphKeys.GLOBAL_STEP)? (Sorry, that's from memory, so may not be exactly the right syntax.) Or actually reading in the checkpoint and grabbing the global step?

karmel · 2018-05-23T17:14:49Z

official/recommendation/ncf_main.py

+
+  flags.DEFINE_float(
+      name="mf_regularization", short_name="mr", default=0.0,
+      help=flags_core.help_wrap("The Regularization for MF embeddings."))


nit: Regularization what? Factor? Feels like it's missing a noun. Also, maybe some advice on reasonable values, defaults, etc?

karmel · 2018-05-23T17:15:16Z

official/recommendation/ncf_main.py

+      enum_values=["ml-1m", "ml-20m"], case_sensitive=False,
+      help=flags_core.help_wrap(
+          "Dataset to be trained and evaluated. Two datasets are "
+          "available for now: ml-1m and ml-20m."))


There should be a way with flags to display the choices and defaults for each of these without hard-coding them.

absl will automatically display allowed values for enums, so it can be removed altogether.

karmel · 2018-05-23T17:15:30Z

official/recommendation/ncf_main.py

+          "available for now: ml-1m and ml-20m."))
+
+  flags.DEFINE_integer(
+      name="num_factors", short_name="nf", default=8,


See above discussion re:short names

karmel · 2018-05-23T17:16:14Z

official/recommendation/ncf_main.py

+
+  flags.DEFINE_list(
+      name="layers", short_name="ly", default=[64, 32, 16, 8],
+      help=flags_core.help_wrap("The size of hidden layers for MLP."))


This should be a list? How does one enter the list? Comma-separated ints? Please define expectations for the user, display default.

karmel · 2018-05-23T17:17:23Z

official/recommendation/ncf_main.py

+          "If passed, training will stop when the evaluation metric HR is "
+          "greater than or equal to hr_threshold. For dataset ml-1m, the "
+          "desired hr_threshold is 0.68; For dataset ml-20m, the threshold can "
+          "be set as 0.95."))


Are these set auto-magically? They should be, and that should be clear from the help text as well.

yhliang2018 · 2018-05-23T22:07:22Z

Thank you for the comments, Karmel, especially the regularization one, which helps me identify a model layer bug. Thanks a lot!

karmel

A few minor nits.

karmel · 2018-05-24T00:16:09Z

official/recommendation/data_download.py

      help=flags_core.help_wrap(
          "Dataset to be trained and evaluated. Two datasets are available "
-          "for now: ml-1m and ml-20m."))
+          ": ml-1m and ml-20m."))


nit: no defaults

karmel · 2018-05-24T00:22:31Z

official/recommendation/ncf_main.py

+  flags.DEFINE_list(
+      name="mlp_regularization", default=["0.", "0.", "0.", "0."],
+      help=flags_core.help_wrap(
+          "The regularization factor for each MLP layer. See ml_regularization "


What is ml_regularization? Should this be mf_regularization?

Add unit test, official flags, and benchmark logs

105ca1b

yhliang2018 requested review from a team and karmel as code owners May 22, 2018 23:46

googlebot added the cla: yes label May 22, 2018

yhliang2018 added 2 commits May 22, 2018 17:25

Fix checking errors

9a4aba0

Reorder imports to fix lints

c1d9814

karmel suggested changes May 23, 2018

View reviewed changes

Address comments and correct model layers

f54f6cb

karmel approved these changes May 24, 2018

View reviewed changes

Add dataset checking

c75e72b

yhliang2018 merged commit 023fc2b into master May 25, 2018

yhliang2018 deleted the recommendation_flags branch May 25, 2018 00:35

Add unit test, official flags, and benchmark logs for recommendation model #4343

Add unit test, official flags, and benchmark logs for recommendation model #4343

Uh oh!

Conversation

yhliang2018 commented May 22, 2018

Uh oh!

karmel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yhliang2018 commented May 23, 2018

Uh oh!

karmel left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants