Add embedding table compression script #261

xiangzez · 2022-07-06T09:52:00Z

Description

Brief Description of the PR:

Hi, This PR contains a script to compress TFRA dynamic embedding tables' value datatype from float32 to float16. It can be used to process exported SavedModel and can reduce ~50% size of models that have very large embedding tables. It is integrated into movielens-100k-estimator demo and I hope it can be a reference to other users who have similar needs.

The script only edits CuckooHashTable related ops in the model graph and will add a cast after CuckooHashTableFind op to keep outputs the same, so other parts of the model are not affected.

Type of change

Checklist:

I've properly formatted my code according to the guidelines
- By running yapf
- By running clang-format
This PR addresses an already submitted issue for TensorFlow Recommenders-Addons
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works

How Has This Been Tested?

If you're adding a bugfix or new feature please describe the tests that you ran to verify your changes:
*

Add a script to compress TFRA dynamic embedding tables' value datatype from float32 to float16. It can be used for exported SavedModel.

demo/dynamic_embedding/movielens-100k-estimator/movielens-100k-estimator.py

rhdong · 2022-07-06T10:23:38Z

Hi @xiangzez , thank you for your first contribution, it's very valuable! I would like to know how much does compression affect the model's accuracy? I mean if you have done some comparison or benchmark, you can add it to the Readme of the demo for other user. Thank you!

xiangzez · 2022-07-07T09:18:05Z

Hi @rhdong, this is a work we did for our customers, so we don't have accuracy change data of their production models. However this demo shows comparison of loss before/after compression. I have added example outputs in readme.

rhdong

LGTM

Add embedding table compression script

f59949f

Add a script to compress TFRA dynamic embedding tables' value datatype from float32 to float16. It can be used for exported SavedModel.

rhdong reviewed Jul 6, 2022

View reviewed changes

demo/dynamic_embedding/movielens-100k-estimator/movielens-100k-estimator.py Outdated Show resolved Hide resolved

rhdong self-requested a review July 6, 2022 10:24

Move to a separate demo

78f3d26

rhdong requested a review from luliyucoordinate July 18, 2022 01:22

rhdong approved these changes Jul 19, 2022

View reviewed changes

rhdong merged commit 0b81f24 into tensorflow:master Jul 19, 2022

xiangzez deleted the compress_model branch July 22, 2022 07:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embedding table compression script #261

Add embedding table compression script #261

xiangzez commented Jul 6, 2022

rhdong commented Jul 6, 2022 •

edited

xiangzez commented Jul 7, 2022

rhdong left a comment

Add embedding table compression script #261

Add embedding table compression script #261

Conversation

xiangzez commented Jul 6, 2022

Description

Type of change

Checklist:

How Has This Been Tested?

rhdong commented Jul 6, 2022 • edited

xiangzez commented Jul 7, 2022

rhdong left a comment

Choose a reason for hiding this comment

rhdong commented Jul 6, 2022 •

edited