Load TorchScript modules #644

NiklasGustafsson · 2022-07-02T01:15:07Z

With this PR, TorchSharp supports loading modules that were created using torch.jit.{trace,script} with Pytorch.

No support for traced or scripted functions, yet. The PR also does not support tracing or scripting in TorchSharp.

src/TorchSharp/JIT/ScriptModule.cs

tarekgh · 2022-07-02T02:04:16Z

src/TorchSharp/JIT/ScriptModule.cs

+                    return this;
+                }
+
+#if false   // These functions "work," but the native code doesn't seem to find any interesting information.


if

we'll keep this code for now? what is the next step here?

I want to try to figure out why the scripts don't seem to contain any information on element type or dimensions.

tarekgh · 2022-07-02T02:07:39Z

src/TorchSharp/JIT/ScriptModule.cs

+            {
+                if (!System.IO.File.Exists(filename))
+                    throw new System.IO.FileNotFoundException(filename);
+                return new ScriptModule(THSJIT_load(filename));


THSJIT_load(filename)

What happen if I pass a file pass that is exist but not containing a valid script? does the native side somehow throw? how we recognize this failing case?

It will blow up in some fashion. We're just interfacing with the libtorch support for TorchScript, so whatever it does. I'll add a negative unit test.

src/TorchSharp/JIT/Type/Type.cs

tarekgh

Added a few comments. LGTM otherwise.

Adding article on torchscript.

NiklasGustafsson · 2022-07-02T22:02:58Z

src/TorchSharp/JIT/ScriptModule.cs

+
+                public unsafe override Tensor forward(Tensor tensor)
+                {
+                    using (var parray = new PinnedArray<IntPtr>()) {


@tarekgh -- is there some way to avoid the new[]{} and use stackalloc here, instead?

You don't need to use PinnedArray at all too. You may try the following which will allow writing the whole method without any unsafe code and you'll not to do any managed allocations.

ReadOnlySpan<IntPtr> buffer = stackalloc IntPtr[1] { tensor.Handle }; var res = THSJIT_Module_forward(handle, (IntPtr)buffer, 1);

Added unit test for a script module formed from a function in Pytorch.

GeorgeS2019 · 2022-07-02T22:20:13Z

I assume this is related to this functionality: torch.jit.save

For consistency, would it make sense to rename the extension *.dat in the unit tests to *.pt?

test/TorchSharpTest/func.script.pt
test/TorchSharpTest/l1000_100_10.script.pt
test/TorchSharpTest/linrelu.script.pt
test/TorchSharpTest/scripted.script.pt

This will help separate 3 different file formats.

*.pth and *.data [Torchsharp specific format for import and export] discussed in saveload.md

With this new implementation, now *.pt is introduced to the saveload.md

Instead of using model.ts in unit test, it seems the consensus is to keep it as model.pt (for torchscript file, load or save)

NiklasGustafsson · 2022-07-02T23:02:46Z

The file extensions used in unit tests don't really matter.

Added doc comments.

NiklasGustafsson · 2022-07-02T23:15:21Z

But, yes, there will now be three file formats:

*.pth, which TorchSharp does not support. This is for saving Pytorch modules via pickling.
*.pt, which TorchSharp will load and save, but not create from scratch. This is for saving Pytorch modules via TorchScript.
*.ts/.data/.whatever, which is for saving model weights only in the TorchSharp-specific format.

GeorgeS2019 · 2022-07-02T23:17:30Z

I think *.ts is a good TorchSharp format for those involving importsd.py and exportsd.py

GeorgeS2019 · 2022-07-02T23:21:48Z

*.pt, which TorchSharp will load and save, but not create from scratch. This is for saving Pytorch modules via TorchScript.

Using the consensus practice used in Rust LibTorch wrapper, *.pt is meant for Torchscript, created in pytorch.jit.save, loaded in RustWrapper, save in RustWrapper. I believe they are all could be interchanged, I hope this is right.

If so, *.pt could be a good format interchange between pytorch, Torchsharp and perhaps other LibTorch wrappers

GeorgeS2019 · 2022-07-02T23:25:59Z

Another thought!

.pt is used between pytorch and LibTorch c++ wrappers, including Torchsharp.

Since .pt includes the architecture of networks and trained weights, .pt could be converted into Onnx using TorchSharp for interchange with ML.NET.

Likewise, that interchange could be done in ML.NET.

However, I think it makes more sense to have a utility to convert .pt to onnx in Torchsharp.

NiklasGustafsson · 2022-07-02T23:26:03Z

If so, *.pt could be a good format interchange between pytorch, Torchsharp and perhaps other LibTorch wrappers

That's what it's for. Please note that TorchSharp will lack the ability to create TorchScript files, even after this PR. We will just be able to load and save.

GeorgeS2019 · 2022-07-02T23:26:09Z

Another thought!

.pt is used between pytorch and LibTorch c++ wrappers, including Torchsharp.

Since .pt includes the architecture of networks and trained weights, .pt could be converted into Onnx using TorchSharp for interchange with ML.NET.

Likewise, that interchange could be done in ML.NET.

However, I think it makes more sense to have a utility to convert .pt to onnx in Torchsharp.

GeorgeS2019 · 2022-07-02T23:28:08Z

TorchSharp will lack the ability to create TorchScript files,

Agree, however, Torchsharp is able to load TorchSript files and save back TorchScript files though the libtorch c++. => that is loadable by pytorch or other libtorch wrappers

NiklasGustafsson · 2022-07-02T23:28:44Z

Since .pt includes the architecture of networks and trained weights, .pt could be converted into Onnx

I believe that's exactly how the ONNX exporter for Pytorch works. Since you have to start with Pytorch (not TorchSharp) in order to get a TorchScript file, I don't know why you would want to export to ONNX from TorchSharp. What is the scenario you have in mind?

GeorgeS2019 · 2022-07-02T23:31:39Z

I don't know why you would want to export to ONNX from TorchSharp. What is the scenario you have in mind?

The exported ONNX help keep the .NET communities staying WITHIN .NET :-)

Imagine!

Many pytorch communities are sharing models using TorchScript formats.

TorchSharp could serve as the starting point for importing these TorchShript format and allowing the .NET community to do further training or inference within Torchsharp OR exporting that to ONNX for inference using ML.NET

GeorgeS2019 · 2022-07-02T23:36:47Z

By doing that, this addresses the need for Deep Learning support in ML.NET based on a survey done in April 2021.

NiklasGustafsson · 2022-07-02T23:42:46Z

TorchSharp could serve as the starting point for importing these TorchShript format and allowing the .NET community to do further training or inference within Torchsharp OR exporting that to ONNX for inference using ML.NET

Not until you can start from scratch in TorchSharp. Right now, you cannot have a TorchScript file without starting in Python, where you can also create a ONNX file from the same model. Once we can do torch.jit.script() in TorchSharp, sure, but until then, this is still very limited functionality

GeorgeS2019 · 2022-07-02T23:44:02Z

Many pytorch communities are sharing models using TorchScript formats.

We do not have to start from scratch in TorchSharp :-)

GeorgeS2019 · 2022-07-02T23:45:15Z

Likewise, many .NET users do not care where Onnx come from as long as they are available for use in ML.NET

NiklasGustafsson · 2022-07-02T23:46:25Z

In order to write an ONNX exporter, we would need to go through the same work that we need to do in order to export to TorchScript. So, until we're in a position to do that, ONNX export will also have to wait. I agree that it's important, but it's separate from this functionality, which allows you to import TorchScript, modify it, and save it, but nothing else.

One thing at a time.

GeorgeS2019 · 2022-07-02T23:47:42Z

Agree ONE THING AT A TIME.

The final goal of keeping DEEP machine learning within the .NET is happening!

NiklasGustafsson added 6 commits July 1, 2022 14:13

First batch of work on torch.jit.

94169b0

Added a few file name validation tests.

931e452

Added another JIT'd test file -- this one was scripted, not traced.

e1da951

Fixed namespace issue.

b335e08

Update RELEASE NOTES

f847569

Disabled input and output type retrieval for ScriptModule

b3d2172

NiklasGustafsson requested review from dsyme, tarekgh and michaelgsharp July 2, 2022 01:15

tarekgh reviewed Jul 2, 2022

View reviewed changes

src/TorchSharp/JIT/ScriptModule.cs Outdated Show resolved Hide resolved

Fixed unit test and leftovers from debugging.

5a3cc16

tarekgh reviewed Jul 2, 2022

View reviewed changes

src/TorchSharp/JIT/ScriptModule.cs Show resolved Hide resolved

tarekgh reviewed Jul 2, 2022

View reviewed changes

src/TorchSharp/JIT/Type/Type.cs Outdated Show resolved Hide resolved

tarekgh previously approved these changes Jul 2, 2022

View reviewed changes

ScriptModule.named_modules() must rule out the "" module.

00e3c02

NiklasGustafsson dismissed tarekgh’s stale review via 00e3c02 July 2, 2022 17:35

NiklasGustafsson added 2 commits July 2, 2022 11:20

Replaced a boolean flag with a more general delegate.

9571402

Adding torch.jit.save and unit test, updated the release notes.

59bb7c8

Adding article on torchscript.

NiklasGustafsson commented Jul 2, 2022

View reviewed changes

Added more overloads for forward().

40cac59

Added unit test for a script module formed from a function in Pytorch.

Modified train() to comply with Pytorch.

401d074

Added doc comments.

NiklasGustafsson merged commit 0ccdf08 into dotnet:main Jul 2, 2022

NiklasGustafsson mentioned this pull request Jul 3, 2022

Resurrect JIT functionality and loading of models #146

Closed

GeorgeS2019 mentioned this pull request Aug 18, 2022

Add torchaudio.models.WaveRNN(). #703

Merged

GeorgeS2019 mentioned this pull request Sep 19, 2022

Executing a TorchScript that returns multiple values, throws an excetion #745

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load TorchScript modules #644

Load TorchScript modules #644

NiklasGustafsson commented Jul 2, 2022

tarekgh Jul 2, 2022

NiklasGustafsson Jul 2, 2022

tarekgh Jul 2, 2022

NiklasGustafsson Jul 2, 2022

tarekgh left a comment

NiklasGustafsson Jul 2, 2022

tarekgh Jul 4, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading

NiklasGustafsson commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading

Load TorchScript modules #644

Load TorchScript modules #644

Conversation

NiklasGustafsson commented Jul 2, 2022

tarekgh Jul 2, 2022

Choose a reason for hiding this comment

NiklasGustafsson Jul 2, 2022

Choose a reason for hiding this comment

tarekgh Jul 2, 2022

Choose a reason for hiding this comment

NiklasGustafsson Jul 2, 2022

Choose a reason for hiding this comment

tarekgh left a comment

Choose a reason for hiding this comment

NiklasGustafsson Jul 2, 2022

Choose a reason for hiding this comment

tarekgh Jul 4, 2022 • edited Loading

Choose a reason for hiding this comment

GeorgeS2019 commented Jul 2, 2022 • edited Loading

NiklasGustafsson commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 • edited Loading

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022 • edited Loading

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 • edited Loading

GeorgeS2019 commented Jul 2, 2022

GeorgeS2019 commented Jul 2, 2022

NiklasGustafsson commented Jul 2, 2022 • edited Loading

GeorgeS2019 commented Jul 2, 2022 • edited Loading

tarekgh Jul 4, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

NiklasGustafsson commented Jul 2, 2022 •

edited

Loading

GeorgeS2019 commented Jul 2, 2022 •

edited

Loading