Input Image format #4

dgschwend · 2017-03-01T06:56:02Z

Hi David,

I am working on a similar project. Modifying the network according to the FPGA resources I have. I would like to know the input image format being followed.

I have gone through changing image files to lmdb sets. but the code you have implemented takes .bin image.

I would like to know how did you define input formats.

Any help in this is highly appreciable.

Thanks
Divya

dgschwend · 2017-03-01T06:57:31Z

Hi Divya
The input image is currently extracted from the Caffe model. Using "classify.py" from DIGITS/Caffe, the network is run forward and then the blob from layer "data" is dumped in binary format. See "classify.py" here https://www.dropbox.com/sh/i0h55cvhhoymc9s/AADDaBbM7YfOKHZCcK3RkqAXa?dl=0
There would surely be a more elegant way, if you investigate how Caffe stores the image data internally.
Regards
David

dgschwend · 2017-03-01T12:56:22Z

PS: Here's some Python code which does the JPG->.bin conversion:

#!/usr/bin/env python3

# need openBLAS
# need libraries: pip install pillow, numpy
import PIL.Image                    # pillow image library
import sys
import numpy as np
#import scipy.misc, scipy.ndimage

"""
Load an image from disk and write back as binary input data.

Arguments:
path -- path to an image on disk
width -- resize dimension
height -- resize dimension
"""

def convert_image(path, width, height, outfile):

    image = PIL.Image.open(path)
    image = image.convert('RGB')
    image = np.array(image)

    # Transform to desired size (half-crop, half-fill)
    # height_ratio = float(image.shape[0])/height
    # width_ratio = float(image.shape[1])/width
    # new_ratio = (width_ratio + height_ratio) / 2.0
    # resize_width = int(round(image.shape[1] / new_ratio))
    # resize_height = int(round(image.shape[0] / new_ratio))
    # if width_ratio > height_ratio and (height - resize_height) % 2 == 1:
    #     resize_height += 1
    # elif width_ratio < height_ratio and (width - resize_width) % 2 == 1:
    #     resize_width += 1
    # image = scipy.misc.imresize(image, (resize_height, resize_width), interp='bicubic')
    # if width_ratio > height_ratio:
    #     start = int(round((resize_width-width)/2.0))
    #     image = image[:,start:start+width]
    # else:
    #     start = int(round((resize_height-height)/2.0))
    #     image = image[start:start+height,:]
    #
    # # Fill ends of dimension that is too short with random noise
    # if width_ratio > height_ratio:
    #     padding = (height - resize_height)/2
    #     noise_size = (padding, width, 3)
    #     noise = np.random.randint(0, 255, noise_size).astype('uint8')
    #     image = np.concatenate((noise, image, noise), axis=0)
    # else:
    #     padding = (width - resize_width)/2
    #     noise_size = (height, padding, 3)
    #     noise = np.random.randint(0, 255, noise_size).astype('uint8')
    #     image = np.concatenate((noise, image, noise), axis=1)

    processed = np.zeros((3, width, height), np.float32)

    # Transpose from (height, width, channels) to (channels, height, width)
    #processed = processed.transpose((2,0,1))

    # Channel Swap: RGB -> BGR
    #image = image[(2,1,0),:,:]

    # Subtract Mean, Swap Channels RGB -> BGR, Transpose (H,W,CH) to (CH,H,W)
    #mean_rgb = [104,117,123]
    processed[0,:,:] = (image[:,:,2]-104.0)
    processed[1,:,:] = (image[:,:,1]-117.0)
    processed[2,:,:] = (image[:,:,0]-123.0)

    print("Saving input data in binary format")
    data = np.array(processed)

    print("  shape: %s" % data.shape)
    CH = data.shape[0]
    W = data.shape[1]
    H = data.shape[2]
    pixels = []

    for y in range(H):
        for x in range(W):
            for c in range(CH):
                pixel = data[c,x,y]
                if pixel is None: pixel = 99999
                pixels.append(pixel);

    # Write Pixels to binary file
    print("  write to file %s" % outfile)
    floatstruct = struct.pack('f'*len(pixels), *pixels)
    with open(outfile, "wb") as f:
        f.write(floatstruct)

if __name__ == "__main__":
    path = ""
    width = 256
    height = 256
    outfile = "image.bin"

    if len(sys.argv) == 2:
        path = sys.argv[1]
    elif len(sys.argv) == 4:
        path = sys.argv[1]
        width = int(sys.argv[2])
        height = int(sys.argv[3])
    elif len(sys.argv) == 5:
        path = sys.argv[1]
        width = int(sys.argv[2])
        height = int(sys.argv[3])
        outfile = sys.argv[4]
    else:
        print("usage: %s <inputfile> [width height [outputfile]]" % sys.argv[0])
        exit(-1)

    convert_image(path,width,height,outfile)

divyapraneetha · 2017-03-02T06:31:43Z

Thanks for the source. Worked for different images to be resized to 256x256x3. Now I can work on different Images.

Divya

SapnaBhardwaj1 · 2017-05-24T16:14:33Z

Hi David,

According to the FPGA resources I have, I am downsizing neural network. I would like to know way of generating weights.bin file. I have gone through material available online to extract weights from .caffemodel. The extracted weights need to be considered as float? How to decide on float/int and how accuracy varies on this. And also can you share the reference to generate weights.bin. Thanks in advance for any suggestions.

Thank You

dgschwend · 2017-05-26T14:30:09Z

Can you please open a new issue as this is not related to the input image format?

dgschwend · 2017-06-02T08:48:56Z

for reference, conversions scripts are available under tools/convert_caffemodel

dgschwend mentioned this issue Mar 1, 2017

TestBench Result: FAILURE (on FPGA) #2

Closed

dgschwend mentioned this issue May 24, 2017

Final caffemodel file #8

Closed

dgschwend closed this as completed Jun 2, 2017

dgschwend mentioned this issue Dec 30, 2017

input image for simulation #25

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input Image format #4

Input Image format #4

dgschwend commented Mar 1, 2017

dgschwend commented Mar 1, 2017 •

edited

Loading

dgschwend commented Mar 1, 2017

divyapraneetha commented Mar 2, 2017

SapnaBhardwaj1 commented May 24, 2017

dgschwend commented May 26, 2017

dgschwend commented Jun 2, 2017

Input Image format #4

Input Image format #4

Comments

dgschwend commented Mar 1, 2017

dgschwend commented Mar 1, 2017 • edited Loading

dgschwend commented Mar 1, 2017

divyapraneetha commented Mar 2, 2017

SapnaBhardwaj1 commented May 24, 2017

dgschwend commented May 26, 2017

dgschwend commented Jun 2, 2017

dgschwend commented Mar 1, 2017 •

edited

Loading