State serialization/deserialization overhaul #247

nathanielsimard · 2023-03-22T21:08:40Z

fix #202 I saved models with f16, but It's also possible to save weights with bf16. I don't expect much difference in performance for both.

fix #201 by default we save states in compressed bincode, which is extremelly small :)

antimora

wow. That was a quick with lots of awesome features.

I found a minor fix in the readme doc. Basically additional updates.

antimora · 2023-03-22T21:56:53Z

examples/mnist-inference-web/README.md

-[`bincode`](https://github.com/bincode-org/bincode) (for compactness) and included as part of the
-final wasm output. The MNIST model is initialized with trained weights from memory during the
-runtime.
-
 The inference API for JavaScript is exposed with the help of
 [`wasm-bindgen`](https://github.com/rustwasm/wasm-bindgen)'s library and tools.



Under the future improvements you can now remove #201 and #202

Probably you can also remove references to the wasm file since it is smaller, so that we don't have to keep updating it.

The state is not saved with f16 anymore, since it doesn't compile correctly without std, so I'll keep the link to issue 202 as potential improvement.

Probably you can also remove references to the wasm file since it is smaller, so that we don't have to keep updating it.

I'm not sure what you are refering to.

Sorry I meant in the readme file, the document talks about the file size of the wasm output under comparison section (e.g. 1,509,747 bytes).

It's okay. I'll update the readme file not to talk about the file sizes.

It's the same size because it's still in f32... for now :)

antimora

Looks good to me. I had a few questions regarding f16 changes. Please see if we need to modify the code more.

antimora · 2023-03-23T02:07:44Z

examples/mnist-inference-web/README.md

-[`bincode`](https://github.com/bincode-org/bincode) (for compactness) and included as part of the
-final wasm output. The MNIST model is initialized with trained weights from memory during the
-runtime.
-
 The inference API for JavaScript is exposed with the help of
 [`wasm-bindgen`](https://github.com/rustwasm/wasm-bindgen)'s library and tools.



Sorry I meant in the readme file, the document talks about the file size of the wasm output under comparison section (e.g. 1,509,747 bytes).

It's okay. I'll update the readme file not to talk about the file sizes.

antimora · 2023-03-23T02:09:18Z

examples/mnist/src/training.rs

@@ -64,14 +65,21 @@ pub fn run<B: ADBackend>(device: B::Device) {
        .metric_valid_plot(AccuracyMetric::new())
        .metric_train_plot(LossMetric::new())
        .metric_valid_plot(LossMetric::new())
-        .with_file_checkpointer::<f32>(2)
+        .with_file_checkpointer::<burn::tensor::f16>(2, StateFormat::default())


Is it still f16, since you reverted back your changes about f16.

The checkpoint during training are f16 with compressed bin format (bin.gz) but we now save the final model somewhere else, we don't use the checkpoints.

antimora · 2023-03-23T02:10:30Z

examples/text-classification/src/training.rs

@@ -78,17 +78,18 @@ pub fn train<B: ADBackend, D: TextClassificationDataset + 'static>(
        .metric_valid(AccuracyMetric::new())
        .metric_train_plot(LossMetric::new())
        .metric_valid_plot(LossMetric::new())
-        .with_file_checkpointer::<f32>(2)
+        .with_file_checkpointer::<burn::tensor::f16>(2, StateFormat::default())


Do you need to revert back this change or does it work because serialization works for std?

serialization works with std

nathanielsimard added 6 commits March 22, 2023 09:52

refactor: P => E for element type

5417b00

Add msg pak format

c48b29c

Support bincode v2

67820b5

Update to new API

f22ebe2

Fix mnist examples

48de8aa

Some fixes

f21562d

nathanielsimard changed the title ~~Feat/serde formats~~ State serialization/deserialization overhaul Mar 22, 2023

nathanielsimard added 3 commits March 22, 2023 17:17

Run clippy

8c136e3

enable std on half

8046a39

Temporary fix with missing half serde support with no_std

7327e9f

antimora requested changes Mar 22, 2023

View reviewed changes

Cleanup

ab2df0b

antimora approved these changes Mar 23, 2023

View reviewed changes

Update readme

e973d07

nathanielsimard merged commit 6f43d98 into main Mar 23, 2023

nathanielsimard deleted the feat/serde-formats branch March 23, 2023 15:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

State serialization/deserialization overhaul #247

State serialization/deserialization overhaul #247

nathanielsimard commented Mar 22, 2023 •

edited

Loading

antimora left a comment

antimora Mar 22, 2023

antimora Mar 22, 2023

nathanielsimard Mar 22, 2023

nathanielsimard Mar 22, 2023

antimora Mar 23, 2023

nathanielsimard Mar 23, 2023

antimora left a comment

antimora Mar 23, 2023

antimora Mar 23, 2023

nathanielsimard Mar 23, 2023

antimora Mar 23, 2023

nathanielsimard Mar 23, 2023

State serialization/deserialization overhaul #247

State serialization/deserialization overhaul #247

Conversation

nathanielsimard commented Mar 22, 2023 • edited Loading

antimora left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

antimora left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nathanielsimard commented Mar 22, 2023 •

edited

Loading