Tensorflow Object Detection API for 1-channel grayscale image #3369

baimukashev · 2018-02-12T10:52:08Z

System information

What is the top-level directory of the model you are using:
Have I written custom code (as opposed to using a stock example script provided in TensorFlow):
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):
TensorFlow installed from (source or binary):
TensorFlow version (use command below):
Bazel version (if compiling from source):
CUDA/cuDNN version:
GPU model and memory:
Exact command to reproduce:

Describe the problem

Is there any way to use pre-trained models in Object Detection API of Tensorflow, which trained for RGB images, for single channel grayscale images(depth) ?

Source code / logs

mttang · 2018-02-12T22:03:35Z

You could convert a single channel grayscale image to a 3 channel RGB image to use the pre-trained RGB models. Alternatively, you could train a grayscale model yourself by using convert_to_grayscale in models/research/object_detection/protos/image_resizer.proto.

vinay0410 · 2018-04-13T14:46:00Z

Hi @mttang,
I did that but when I try to run inferences on a single grayscale ( depth ) image using the ipynb notebook available. I get an error stating ValueError: Cannot feed value of shape (1, 480, 640) for Tensor u'image_tensor:0', which has shape '(?, ?, ?, 3)'

cmbowyer13 · 2018-05-14T06:09:16Z

Have you figured a workout around for grey scale images @vinay0410 using the object detection api?

richknight · 2018-07-04T04:02:01Z

@mttang - how do you convert a single channel grayscale image to an RGB image to use with one of the well known models ?

renandepadua · 2018-07-05T16:31:12Z

@richknight you can use OpenCV's applyColorMap function. I do that to apply the common models on IR image. https://docs.opencv.org/2.4/modules/contrib/doc/facerec/colormaps.html

TwistedHardware · 2018-08-10T07:57:19Z

Hi @vinay0410,

ValueError: Cannot feed value of shape (1, 480, 640) for Tensor u'image_tensor:0', which has shape '(?, ?, ?, 3)'

The expected shape of the tensor is (BATCH_SIZE, WIDTH, HEIGHT, CHANNELS). and you are passing (BATCH_SIZE, WIDTH, HEIGHT) without channel.

When you reshape your images you need to keep them as rank-3 images. meaning (480, 640, 1) instead of (480, 640). To do that in numpy:

import numpy as np
img = np.random.random((480,640)) # <-- this is rank-2 image
img = img.reshape(x.shape[0],x.shape[1],1) # <-- this makes it a rank-3 image

This changes your rank-2 image of shape 480 x 640 to a rank-3 image of shape 480 x 640 x 1

Arturexe · 2018-08-16T09:37:33Z

I managed to get it to work with the help of the Stackoverflow community. Take a look: https://stackoverflow.com/questions/51872412/tensorflow-numpy-image-reshape

The model is taken from Sentdexs tutorial on Youtube: https://www.youtube.com/watch?v=srPndLNMMpk&list=PLQVvvaa0QuDcNK5GeCQnxYnSSaar2tpku&index=6

Zrufy · 2020-08-05T12:12:15Z

@vinay0410 how do you solve that error?I'm in stuck with this problem!

denisb411 · 2020-08-06T21:19:13Z

Hi @vinay0410,

ValueError: Cannot feed value of shape (1, 480, 640) for Tensor u'image_tensor:0', which has shape '(?, ?, ?, 3)'

The expected shape of the tensor is (BATCH_SIZE, WIDTH, HEIGHT, CHANNELS). and you are passing (BATCH_SIZE, WIDTH, HEIGHT) without channel.

When you reshape your images you need to keep them as rank-3 images. meaning (480, 640, 1) instead of (480, 640). To do that in numpy:
import numpy as np
img = np.random.random((480,640)) # <-- this is rank-2 image
img = img.reshape(x.shape[0],x.shape[1],1) # <-- this makes it a rank-3 image
This changes your rank-2 image of shape 480 x 640 to a rank-3 image of shape 480 x 640 x 1

I can't see this working as the expected shape has 3 channels. Anyone tested it?

hongym7 · 2021-07-14T02:40:50Z

img_path = os.path.join(image_subdirectory, data['filename'])
with tf.gfile.GFile(img_path, 'rb') as fid:
encoded_jpg = fid.read()

encoded_jpg_io = io.BytesIO(encoded_jpg)
create_image = PIL.Image.open(encoded_jpg_io)

input image is grayscale image.
create_image is [ w, h, 3]

bignamehyp closed this as completed Feb 23, 2018

mr-onion-2 mentioned this issue Sep 8, 2019

False detections at night / 1-channel greyscale issue? blakeblackshear/frigate#67

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tensorflow Object Detection API for 1-channel grayscale image #3369

Tensorflow Object Detection API for 1-channel grayscale image #3369

baimukashev commented Feb 12, 2018

mttang commented Feb 12, 2018

vinay0410 commented Apr 13, 2018 •

edited

cmbowyer13 commented May 14, 2018

richknight commented Jul 4, 2018

renandepadua commented Jul 5, 2018 •

edited

TwistedHardware commented Aug 10, 2018

Arturexe commented Aug 16, 2018 •

edited

Zrufy commented Aug 5, 2020

denisb411 commented Aug 6, 2020

hongym7 commented Jul 14, 2021

Tensorflow Object Detection API for 1-channel grayscale image #3369

Tensorflow Object Detection API for 1-channel grayscale image #3369

Comments

baimukashev commented Feb 12, 2018

System information

Describe the problem

Source code / logs

mttang commented Feb 12, 2018

vinay0410 commented Apr 13, 2018 • edited

cmbowyer13 commented May 14, 2018

richknight commented Jul 4, 2018

renandepadua commented Jul 5, 2018 • edited

TwistedHardware commented Aug 10, 2018

Arturexe commented Aug 16, 2018 • edited

Zrufy commented Aug 5, 2020

denisb411 commented Aug 6, 2020

hongym7 commented Jul 14, 2021

vinay0410 commented Apr 13, 2018 •

edited

renandepadua commented Jul 5, 2018 •

edited

Arturexe commented Aug 16, 2018 •

edited