Is there any way to pass pre-loaded image into MLContext pipeline? #3460

ma-gu · 2019-04-22T13:45:41Z

System information

OS version/distro: Windows 10.0.17134
.NET Version (eg., dotnet --info): .NET Core 2.2.202

Issue

What did you do?
My intention was to implement object detection in video. Please see attached code sample and explanation below.
What happened?
I wasn't able to find proper API methods for configuring MLContext pipeline.
What did you expect?
I expected it should be possible to pass individual in-memory representation of image(as a Bitmap object for example) into pipeline.

Source code / logs

Let's pretend we have a task to run object detection model on frames coming from video stream.
It seems natural to pass 1 frame one by one as they arriving from camera.

I can't find such possibility with current MLContext pipeline and API available.

Here's example pipeline from machinelearning-samples :

            var pipeline = mlContext.Transforms.LoadImages(outputColumnName: "image", imageFolder: imagesFolder, inputColumnName: nameof(ImageNetData.ImagePath))
                            .Append(mlContext.Transforms.ResizeImages(outputColumnName: "image", imageWidth: ImageNetSettings.imageWidth, imageHeight: ImageNetSettings.imageHeight, inputColumnName: "image"))
                            .Append(mlContext.Transforms.ExtractPixels(outputColumnName: "image"))
                            .Append(mlContext.Transforms.ApplyOnnxModel(modelFile: modelLocation, outputColumnNames: new[] { TinyYoloModelSettings.ModelOutput }, inputColumnNames: new[] { TinyYoloModelSettings.ModelInput }));

Having LoadImages extension method overload accepting in-memory image representation (Bitmap or whatever) would definetly solve a problem:

public static ImageLoadingEstimator LoadImages(this TransformsCatalog catalog, params Bitmap[] images);

Am I missing some way implement this without storing frames onto the disk? Or does it mean ML.NET wasn't designed for realtime video processing?

Thanks in advance.

The text was updated successfully, but these errors were encountered:

ma-gu · 2019-04-22T13:59:10Z

Basically it seems to be a duplicate of #3369 but re-worded.

yaeldekel · 2019-04-22T17:46:35Z

Hi @ma-gu ,
Currently, the only way to skip the LoadImages step in the pipeline and pass a Bitmap directly to the pipeline, is by adding a custom IDataView implementation, similar to the one in this sample:
https://github.com/dotnet/machinelearning/blob/master/docs/samples/Microsoft.ML.Samples/Dynamic/SimpleDataViewImplementation.cs#L116

The sample implementation creates an IDataView from an IEnumerable of InputObjects.

You can replace that with a class that has a public field Image of type Bitmap, and replace the getters array in line 197 with an array of length 1 containing a ValueGetter<Bitmap> that returns the Image field of the current object in the IEnumerable.

You can then start your pipeline with a ResizeImages estimator, and fit it using an instance of this custom IDataView class.

ma-gu · 2019-04-23T06:55:51Z

Hi @yaeldekel ,
Thanks for instructions, I've already found SimpleDataViewImplementation and simialr advice, but I got stuck with implementing Cursor. Will try your suggestion and get back.

Thanks.

ma-gu · 2019-04-28T19:44:18Z

Hi @yaeldekel,

I've implemented BitmapDataView, could you please to have a look if you spot anything obviously wrong?

An issue I have currently is ML.NET throwing the following exception during prediction engine construction:

Could not determine an IDataView type for member Schema
Parameter name: rawType

   at Microsoft.ML.Data.InternalSchemaDefinition.GetVectorAndItemType(Type rawType, String name, Boolean& isVector, Type& itemType)
   at Microsoft.ML.Data.SchemaDefinition.Create(Type userType, Direction direction)
   at Microsoft.ML.Data.DataViewConstructionUtils.CreateInputRow[TRow](IHostEnvironment env, SchemaDefinition schemaDefinition)
   at Microsoft.ML.PredictionEngineBase`2..ctor(IHostEnvironment env, ITransformer transformer, Boolean ignoreMissingColumns, SchemaDefinition inputSchemaDefinition, SchemaDefinition outputSchemaDefinition)
   at Microsoft.ML.PredictionEngineExtensions.CreatePredictionEngine[TSrc,TDst](ITransformer transformer, IHostEnvironment env, Boolean ignoreMissingColumns, SchemaDefinition inputSchemaDefinition, SchemaDefinition outputSchemaDefinition)
   at ObjectDetection.OnnxModelScorer.LoadModel(BitmapDataView imageData) in C:\Users\Maxim\Source\Repos\machinelearning-samples\samples\csharp\getting-started\DeepLearning_ObjectDetection_Onnx_Visualization\ObjectDetection.Core\OnnxModelScorer.cs:line 70
   at ObjectDetection.OnnxModelScorer.Score(Bitmap image) in C:\Users\Maxim\Source\Repos\machinelearning-samples\samples\csharp\getting-started\DeepLearning_ObjectDetection_Onnx_Visualization\ObjectDetection.Core\OnnxModelScorer.cs:line 49
   at TestConsole.Program.Main(String[] args) in C:\Users\Maxim\Source\Repos\machinelearning-samples\samples\csharp\getting-started\DeepLearning_ObjectDetection_Onnx_Visualization\TestConsole\Program.cs:line 21

Exception is throwing from the last line in the following snippet(linked above to code in my fork of machinelearning-samples):

    var imageData = new BitmapDataView(image);
    var pipeline = mlContext.Transforms.ResizeImages(outputColumnName: "image", imageWidth: ImageNetSettings.imageWidth, imageHeight: ImageNetSettings.imageHeight, inputColumnName: "image")
       .Append(mlContext.Transforms.ExtractPixels(outputColumnName: "image"))
       .Append(mlContext.Transforms.ApplyOnnxModel(modelFile: modelLocation, outputColumnNames: new[] { TinyYoloModelSettings.ModelOutput }, inputColumnNames: new[] { TinyYoloModelSettings.ModelInput }));
    var model = pipeline.Fit(imageData);
    var predictionEngine = mlContext.Model.CreatePredictionEngine<BitmapDataView, ImageNetPrediction>(model);

Unfortunately I stuck at this point and seems Exception messages from ML.NET doesn't look helpful for those who isn't familiar with this library. I have a guess that something is wrong with my BitmapDataView implementation but can't figure out what exactly.

Thanks

codemzs · 2019-05-22T16:54:45Z

@wschin Does your fix address this?

jamsoft · 2021-03-07T14:40:51Z

@ma-gu Did you ever get this working?

ma-gu · 2021-03-07T16:09:53Z

@jamsoft Almost 2 years passed already... I believe I just gave up - otherwise I should provide working code snippet here.
I tired fighting with ML.NET writing unnecessary code in attempts to make it working when the same feature lives as one-liner in python stack(keras/tensorflow).

jamsoft · 2021-03-07T17:25:37Z

Thanks for your reply. Yeah, it's cost me a day to find out no-one can get it working and it seems like the most obvious use case as well. So odd. I also cannot use my 3090 with ML.NET as it needs drivers from 2019. I think I'm going to go look at bundling a Python exe again.

bbday · 2021-05-26T07:18:54Z

Also me have same issue with image processing, need to look into python trainer to avoid this issue.
In my case i need to rotate images at runtime during training to avoid to create X copies on my hdd.

This was referenced Apr 25, 2019

Making a prediction from ONNX model in ML.net #3562

Closed

Realtime image classification using pretrained onnx model #3582

Closed

CESARDELATORRE mentioned this issue May 13, 2019

How to pass a Stream of Images in Object Detection End-to-End dotnet/machinelearning-samples#445

Closed

This was referenced May 14, 2019

IDataView Type system needs to be extensible to properly support image, date, and other types #3274

Closed

IDataView Type System needs to be extensible to support Image scenarios properly #3723

Closed

codemzs assigned wschin May 22, 2019

codemzs added the P0 Priority of the issue for triage purpose: IMPORTANT, needs to be fixed right away. label May 22, 2019

wschin mentioned this issue May 24, 2019

Add DataView Type Register (supporting loading in-memory image and other in-memory custom types) #3263

Merged

wschin closed this as completed in #3263 May 28, 2019

jamsoft mentioned this issue Mar 18, 2021

Loading Existing ML.NET Image Classification Model and Adapting Predict To Use InMemory Images #5718

Closed

dotnet locked as resolved and limited conversation to collaborators Mar 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there any way to pass pre-loaded image into MLContext pipeline? #3460

Is there any way to pass pre-loaded image into MLContext pipeline? #3460

ma-gu commented Apr 22, 2019

ma-gu commented Apr 22, 2019

yaeldekel commented Apr 22, 2019

ma-gu commented Apr 23, 2019

ma-gu commented Apr 28, 2019

codemzs commented May 22, 2019

jamsoft commented Mar 7, 2021

ma-gu commented Mar 7, 2021 •

edited

jamsoft commented Mar 7, 2021 •

edited

bbday commented May 26, 2021

Is there any way to pass pre-loaded image into MLContext pipeline? #3460

Is there any way to pass pre-loaded image into MLContext pipeline? #3460

Comments

ma-gu commented Apr 22, 2019

System information

Issue

Source code / logs

ma-gu commented Apr 22, 2019

yaeldekel commented Apr 22, 2019

ma-gu commented Apr 23, 2019

ma-gu commented Apr 28, 2019

codemzs commented May 22, 2019

jamsoft commented Mar 7, 2021

ma-gu commented Mar 7, 2021 • edited

jamsoft commented Mar 7, 2021 • edited

bbday commented May 26, 2021

ma-gu commented Mar 7, 2021 •

edited

jamsoft commented Mar 7, 2021 •

edited