How TorchSharp & Onnx can address the pain points of the ~900 ML.NET Apr2021 survey responses? #5874

GeorgeS2019 · 2021-07-09T17:56:31Z

Is your feature request related to a problem? Please describe.

The pain points of the Apr 2021 ML.NET survey and the result discussions

Describe the solution you'd like

It is clear that NLP is high on priority
This means **more deep learning NLP use cases ** e.g. using ML.NET to load pretrained Hugging Face transformer models using OnnxRuntime

We can do that by porting some of the PyTorch NLP Transformer codes to c# to address the pain points of ML.NET!!

The porting process is now more feasible due to [the recent renaming effort of TorchSharp] (dotnet/TorchSharp#308 (comment)) which make TorchSharp codes MORE resemble PyTorch

michaelgsharp · 2021-07-22T21:57:17Z

@briacht can you take a look at this?

briacht · 2021-07-23T17:44:41Z

Yes! I had a conversation with @GeorgeS2019 about deep learning in .NET.

As part of our deep learning plan, we will enable NLP scenarios.

I believe these suggestions will start at the TorchSharp level.

GeorgeS2019 · 2021-07-30T12:07:44Z

@michaelgsharp @briacht
Just identified an error in the c# example for the method 2 of OnnxCatalog.ApplyOnnxModel

The method 2 involves shapeDictionary, which is particularly useful for working with variable dimension inputs and outputs.

All examples provided by OnnxCatalog.ApplyOnnxMode use Image Classification through squeezenet onnx from Onnx zoo models

The users of the April survey requested NLP use case.

There are more image related examples involving ML.NET than NLP. Image use case, unlike NLP, which often does not involve variable dimension inputs and outputs.

I suggest the document provides, in addition to image, NLP examples from e.g. Onnx Zoo model e.g. GPT-2 => which will show HOW TO DEAL with VARIABLE dimension and the need to use ShapeDictionary.

shapeDictionary, which is particularly useful for working with variable dimension inputs and outputs.

This statement was introduced to the documentation through this PR to address the need for handling of variable axes of ONNX-models often found in NLP use case

We need expand the documentation to elaborate how to handle variable axes (e.g. using ShapeDictionary) especially in NLP use case

briacht · 2021-07-30T20:32:22Z

Thanks for the suggestion @GeorgeS2019!

@luisquintanilla can we add an issue to the docs repo for this?

GeorgeS2019 · 2021-08-17T17:31:28Z

@briacht

As correctly pointed out by @antoniovs1029

ML.NET doesn't currently have any transformer to do tensor reshaping, and it's necessary for users to actually implement their own reshape logic

This missing feature has been raised by @yaeldekel here Add "Reshape Transform"

=> I request to look into this "tensor reshaping" and the variable dimension discussed above. Perhaps both are related.

Here are NLP cases applying OnnxCatalog.ApplyOnnxModel onnx models

Perhaps by implementing the "Reshape Transform", this could address challenges when working with NLP with ML.NET?

michaelgsharp added the onnx Exporting ONNX models or loading ONNX models label Jul 22, 2021

briacht added the Deep Learning label Jul 29, 2021

This was referenced Aug 12, 2021

How TorchSharp can address the pain points of ~900 ML.NET Apr2021 survey responses dotnet/TorchSharp#308

Closed

Using TorchSharp with OnnxRuntime C# API dotnet/TorchSharp#197

Closed

GeorgeS2019 mentioned this issue Aug 17, 2021

Feedback to the plan for Torch integration to ML.NET dotnet/TorchSharp#328

Closed

briacht mentioned this issue Aug 30, 2021

Plan for Deep Learning in .NET #5918

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How TorchSharp & Onnx can address the pain points of the ~900 ML.NET Apr2021 survey responses? #5874

How TorchSharp & Onnx can address the pain points of the ~900 ML.NET Apr2021 survey responses? #5874

GeorgeS2019 commented Jul 9, 2021 •

edited

Loading

michaelgsharp commented Jul 22, 2021

briacht commented Jul 23, 2021

GeorgeS2019 commented Jul 30, 2021 •

edited

Loading

briacht commented Jul 30, 2021

GeorgeS2019 commented Aug 17, 2021

How TorchSharp & Onnx can address the pain points of the ~900 ML.NET Apr2021 survey responses? #5874

How TorchSharp & Onnx can address the pain points of the ~900 ML.NET Apr2021 survey responses? #5874

Comments

GeorgeS2019 commented Jul 9, 2021 • edited Loading

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

michaelgsharp commented Jul 22, 2021

briacht commented Jul 23, 2021

GeorgeS2019 commented Jul 30, 2021 • edited Loading

briacht commented Jul 30, 2021

GeorgeS2019 commented Aug 17, 2021

GeorgeS2019 commented Jul 9, 2021 •

edited

Loading

GeorgeS2019 commented Jul 30, 2021 •

edited

Loading