add pytorch backend trainer #247

haifeng-jin · 2023-06-02T06:21:33Z

Have a basic working example of using Keras with PyTorch backend.
Some notable changes to other parts of the code:

Use backend.standardize_dtype() when possible, since the dtype in PyTorch is different than other backends. It does not have the .name attribute, and it is not a str.
In ProgBar, now use ops instead of np for computing the mean. Other backends tensors work seamlessly with np, but not torch.Tensor.
A minor change in torch/numpy.py. torch.zerors(*shape) would not work when shape is an empty tuple. Does other ops support this corner case? @nkovela1 @chenmoneygithub

Missing features to be added later:

Use torch script for the training step.
Support steps_per_execution.
Support validation data. (implement .evaluate())

fchollet

Great work! It looks very clean and close to the TF trainer.

The top priority now will be implementing evaluate/predict and adding compilation support. Do you expect any major issue with compilation?

steps_per_execution is less important and won't be a launch blocker. I'd rather focus on overall performance.

fchollet · 2023-06-02T16:59:20Z

integration_tests/torch_backend_keras_workflow.py

+    x=np.random.rand(100, 10),
+    y=np.random.rand(100, 1),
+    epochs=10,
+    shuffle=False,


QQ: are you able to run integration_tests/numerical_test.py to check the end-to-end numerics?

Not yet, since it requires .evaluate() to be implemented.

fchollet · 2023-06-02T17:01:08Z

keras_core/backend/torch/trainer.py

+        data = data[0]
+        x, y, sample_weight = data_adapter_utils.unpack_x_y_sample_weight(data)
+
+        self.train()


Can you comment on what this does?

This one should be removed since it was duplicated to the one in the .fit(). I commented the one in the .fit() funciton.

fchollet · 2023-06-02T17:02:22Z

keras_core/backend/torch/trainer.py

+        # TODO: respect compiled trainable state
+        if validation_split and validation_data is None:
+            # Create the validation data using the training data. Only supported
+            # for TF/numpy/jax arrays.


Should work for torch tensors as well -- we should start adding data adapter tests targeting torch tensors (in a new PR)

keras_core/backend/torch/trainer.py

fchollet · 2023-06-02T17:03:32Z

keras_core/utils/progbar.py

@@ -159,7 +158,7 @@ def update(self, current, values=None, finalize=None):
            for k in self._values_order:
                info += f" - {k}:"
                if isinstance(self._values[k], list):
-                    avg = np.mean(
+                    avg = ops.mean(


By the time we hit this I'd expect the logs to be Python/numpy, is it not the case?

As I turn the backend to TF, the values returned by the values here are still tf.Tensor. It is caused by the Trainer.compute_metrics not returning plain types like in the docs says but backend specific tensors.

Ok we can leave it as is. Thanks!

haifeng-jin · 2023-06-02T17:46:15Z

For torch script, I do not see any major issue, but would expect a series of small issues.

add pytorch backend trainer

78617a7

haifeng-jin requested a review from fchollet June 2, 2023 06:36

fchollet reviewed Jun 2, 2023

View reviewed changes

addressing the comments

0378693

haifeng-jin requested a review from fchollet June 2, 2023 17:46

fchollet approved these changes Jun 2, 2023

View reviewed changes

fchollet merged commit 0724e27 into main Jun 2, 2023
4 checks passed

haifeng-jin deleted the haifeng-torch branch June 2, 2023 18:32

haifeng-jin mentioned this pull request Jun 3, 2023

Trainer should work for torch tensors #256

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pytorch backend trainer #247

add pytorch backend trainer #247

haifeng-jin commented Jun 2, 2023 •

edited

fchollet left a comment

fchollet Jun 2, 2023

haifeng-jin Jun 2, 2023

fchollet Jun 2, 2023

haifeng-jin Jun 2, 2023

fchollet Jun 2, 2023

fchollet Jun 2, 2023

haifeng-jin Jun 2, 2023

fchollet Jun 2, 2023

haifeng-jin commented Jun 2, 2023

add pytorch backend trainer #247

add pytorch backend trainer #247

Conversation

haifeng-jin commented Jun 2, 2023 • edited

fchollet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

haifeng-jin commented Jun 2, 2023

haifeng-jin commented Jun 2, 2023 •

edited