Should I regenerate the savedmodel when I use a new project #10

Colibrow · 2021-06-07T08:37:59Z

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

Colibrow · 2021-06-07T14:02:45Z

After tested, the origin saved model could not be reused, so close this!

mtrofin · 2021-06-07T15:17:19Z

When you say "the original saved model could not be reused", do you mean you could not build the compiler with it embedded, or its performance wasn't as good as the one of the model you trained on chrome/v8?

Colibrow · 2021-06-08T03:03:05Z

When you say "the original saved model could not be reused", do you mean you could not build the compiler with it embedded, or its performance wasn't as good as the one of the model you trained on chrome/v8?

That compilers okay, but the binary size is bigger the one which doesn't use the model so I am training the new model for my personal project.

yundiqian · 2021-06-08T04:59:03Z

I see, that is possible, Fuchsia code may be quite different from the v8 code so the model trained on fuchsia does not work well on v8

Colibrow · 2021-06-09T04:13:42Z

I see, that is possible, Fuchsia code may be quite different from the v8 code so the model trained on fuchsia does not work well on v8

@yundiqian @mtrofin
Sadly found that the model specify for the project didn't work and the so size was bigger than the one not trained. Is there any way to find which compile command influences the result? May I need to close the -flto flag?

mtrofin · 2021-06-09T04:17:50Z

We don't support lto currently.

The model included with llvm is a reasonable reference, but we didn't use an overly comprehensive codebase when we trained it; that's why Fuchsia, for example, builds their own, which holds up well over time (as their codebase and as the compiler evolve).

Colibrow · 2021-06-09T04:20:42Z

We don't support lto currently.

The model included with llvm is a reasonable reference, but we didn't use an overly comprehensive codebase when we trained it; that's why Fuchsia, for example, builds their own, which holds up well over time (as their codebase and as the compiler evolve).

So if I need to regenerate the model with llvm which disable the lto?

mtrofin · 2021-06-09T04:24:00Z

Yes, when training your own model, disable lto, and (of course) make sure you're passing -Oz to clang.

Colibrow · 2021-06-09T04:27:24Z

I'll try it.Thanks!

yundiqian · 2021-06-09T06:05:23Z

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

Colibrow · 2021-06-09T06:27:25Z

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

emm.. I have tried three projects using the ml-compiler-opt and got 7% size-reduction on Fuchsia demo, 5% (trained 100*2000 out of consideration of time)on chrome/v8 build and -2% on my personal project (-.-)
I am retraining the third model cuz I trained it with -flto last time.
plus.I am migrating the model in an Android CMake project which the strip-binary-size is 1.7MB or so

yundiqian · 2021-06-09T06:58:06Z

Hi @yundiqian, I have migrated the demo project to the chrome/v8 project and got 5% percent reduction of size in binary and I wanna know if I need to regenerate the saved model or use the exactly one generated by Fuchsia etc?

I'm a little confused, to be clear, which model caused 5% percent reduction of size on which binary?

emm.. I have tried three projects using the ml-compiler-opt and got 7% size-reduction on Fuchsia demo, 5% (trained 100*2000 out of time consideration)on chrome/v8 build and -2% on my personal project (-.-)
I am retraining the third model cuz I trained it with -flto last time.
plus.I am migrating the model in an Android CMake project which the strip-binary-size is 1.7MB or so

got it, so it's 3 projects instead of 2 projects :) Is the "Android CMake project which the strip-binary-size is 1.7MB or so" the 4th project different from your personal projects?

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

Colibrow · 2021-06-09T07:24:32Z

Okay

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

I will try it ASAP. So many thanks~

Colibrow · 2021-06-09T17:37:31Z

In addition to retraining without -flto, you can also try our model included in llvm --- this is a model that we found generalizable across SPEC, so probably generalizable to your project as well.

Unfortunately, the size after trained is bigger than the origin one which applies -flto -faddrsig/-flto -Wl,-z,norelro,-z,lazy,--icf=all.. about 3% or so.

mtrofin · 2021-06-09T17:42:20Z

To make sure I understand: you trained a model on your project (without lto, but -Oz); and then built with that model. (also without lto, and with -Oz)

How does that size compare to all other options being the same, except building with the default heuristic?

Colibrow · 2021-06-09T17:56:40Z

Here is the approaches:

normally build the project with nothing changed : binary is 1705 kb
now delete the -flto with other flags not changed: binary is 1755 kb
build the llvm with the latest model in llvm-project with flag(LLVM_ENABLE_LTO false ) and (TENSORFLOW_AOT_PATH)
then delete -flto, add -mllvm -enable-ml-inliner=release then got the binary is 1823kb

Colibrow · 2021-06-09T17:57:58Z

I haven't build the specify model now because of it needs a lot of time to train and if it's done, I'll post result here~

mtrofin · 2021-06-09T18:47:06Z

I see now - thanks!

(fwiw - LLVM_ENABLE_LTO can be enabled for clang - just no -flto for your project)

Colibrow · 2021-06-15T06:23:08Z

FYI, I've tested my personal project twice with SPEC model or not and found that the specific model is better than the SPEC but still worse than the origin one.
Here is some data(kilobytes):
origin: 1712312
close flto: 1763256
close flto and use specify mode(enable-ml-inliner): 1757576
use flto and use specify model: 1716344

yundiqian · 2021-06-15T20:32:04Z

hmm...interesting, we need to look into what happens during training to debug.

Can you share your log file during training with tensorboard.dev following the instructions here: https://tensorboard.dev/#get-started? (basically running two command lines)

When running "tensorboard dev upload --logdir logs..." , set the logdir flag to be the root_dir flag you use when running train_locally.py

Colibrow · 2021-06-17T03:15:43Z

hmm...interesting, we need to look into what happens during training to debug.

Can you share your log file during training with tensorboard.dev following the instructions here: https://tensorboard.dev/#get-started? (basically running two command lines)

When running "tensorboard dev upload --logdir logs..." , set the logdir flag to be the root_dir flag you use when running train_locally.py

Okay, I'll try it.

Colibrow · 2021-06-17T09:52:52Z

I've also tried the cronet project and the reduction is also not obvious... I doubt that if my training process is wrong?
Here is the detail when I was applying the model https://gist.github.com/Colibrow/9d2b31bc7eff127cfe74c807fce86451
And I found using flto may reduce more than applying the trained model single...And I will post the log file later~

Colibrow closed this as completed Jun 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should I regenerate the savedmodel when I use a new project #10

Should I regenerate the savedmodel when I use a new project #10

Colibrow commented Jun 7, 2021

Colibrow commented Jun 7, 2021

mtrofin commented Jun 7, 2021

Colibrow commented Jun 8, 2021 •

edited

yundiqian commented Jun 8, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

yundiqian commented Jun 9, 2021

Colibrow commented Jun 9, 2021 •

edited

yundiqian commented Jun 9, 2021

Colibrow commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 15, 2021

yundiqian commented Jun 15, 2021

Colibrow commented Jun 17, 2021

Colibrow commented Jun 17, 2021

Should I regenerate the savedmodel when I use a new project #10

Should I regenerate the savedmodel when I use a new project #10

Comments

Colibrow commented Jun 7, 2021

Colibrow commented Jun 7, 2021

mtrofin commented Jun 7, 2021

Colibrow commented Jun 8, 2021 • edited

yundiqian commented Jun 8, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

yundiqian commented Jun 9, 2021

Colibrow commented Jun 9, 2021 • edited

yundiqian commented Jun 9, 2021

Colibrow commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 9, 2021

Colibrow commented Jun 9, 2021

mtrofin commented Jun 9, 2021

Colibrow commented Jun 15, 2021

yundiqian commented Jun 15, 2021

Colibrow commented Jun 17, 2021

Colibrow commented Jun 17, 2021

Colibrow commented Jun 8, 2021 •

edited

Colibrow commented Jun 9, 2021 •

edited