Image processing workflow #806

gheinrich · 2016-06-01T14:23:36Z

This adds data and view extensions to train image processing networks in DIGITS. This may be used for de-noising, super-resolution, segmentation, etc.

The image processing data extension creates datasets for which both the input and the label are images.

The image view extension displays the network output as an image.

Only works with Caffe for now. Torch wrappers need to be updated to deal with image labels (currently only scalar or vector labels are supported).

pansk · 2016-06-02T09:37:36Z

👍 for image labels in torch.
Is it possible also to have multiple image labels, or multiple image sources?

E.G. providing a normal map and a diffuse map and produce a shaded image (multiple input), or provide a single image and produce a super-resolution and a segmentation map (multiple output)

As far as you know, can this be done by creating the DB manually, and adding multi-channel images (for labels, sources, or both)?

gheinrich · 2016-06-02T10:26:07Z

Hi @pansk we are using Caffe Datum objects to store data in LMDB. We may have one LMDB for features (inputs) and one LMDB for labels. That Datum format is used for both Caffe and Torch. We support only one Datum for the features and only one Datum for the labels. Datum objects require your data to be either actual ".png/.jpg/..." images ("encoded" case) or any 3D (Channels x Height x Width) tensor ("unencoded" case).

You can store multiple images in your unencoded Datum if you put them side by side across the channel dimension. For example you can store 2 RGB images by constructing a 6-channel tensor.

Given this Datum limitation what I'm planning to do for the moment in Torch is to accept anything that can be stored in Datum objects. Later on we might consider using HDF5 for those generic datasets and extend support to N-dimensional tensors though that is a longer-term project.

Would that work for you?

lukeyeager · 2016-06-16T17:54:58Z

digits/dataset/generic/test_views.py

 from bs4 import BeautifulSoup
+import json
+import numpy as np


Major nitpick here, but PEP8 likes to separate standard library imports and 3rd party imports:

Imports should be grouped in the following order:

standard library imports

related third party imports

local application/library specific imports
You should put a blank line between each group of imports.
https://www.python.org/dev/peps/pep-0008/#imports

I've been trying to follow that format in our code since #501.

actually I am not sure I get the difference between a standard library import and a 3rd party import. Is it correct to say that numpy and PIL.Image are 3rd party imports and json, os, tempfile are standard library imports?

I think what they're distinguishing between are packages that come with a standard Python install (i.e. apt-get install python) vs. add-on packages (i.e. pip install Flask). At least, that's been my interpretation. I'm open to push-back if you think it's dumb.

import json import os import tempfile from bs4 import BeautifulSoup import numpy as np import PIL.Image from digits import extensions import digits.test_views from digits.utils import constants

I get it, thanks!

To be used with networks where the input and the output are images

gheinrich · 2016-07-28T11:43:24Z

Rebased and updated according to comments:
#830 (comment)
#830 (comment)

m5061125 · 2017-03-09T23:10:44Z

hi, gheinrich.
I am using digits 5 for medical segmentation, so far it is works fine for 2D images, I am wondering that whether if the digits support 3D image classiffication or segmentation. I know that caffe support N-D convolution and pooling by feeding with the hdf5 data format. Pls let me know if digits already support it, thanks.

gheinrich added the enhancement label Jun 1, 2016

gheinrich force-pushed the dev/image-processing branch 2 times, most recently from fb6277d to e51aba6 Compare June 8, 2016 08:17

gheinrich mentioned this pull request Jun 10, 2016

Image segmentation workflow #830

Merged

lukeyeager reviewed Jun 16, 2016
View reviewed changes

lukeyeager mentioned this pull request Jun 16, 2016

Explore generic datasets #822

Merged

gheinrich force-pushed the dev/image-processing branch from e51aba6 to 7a51e51 Compare June 16, 2016 20:40

gheinrich force-pushed the dev/image-processing branch 2 times, most recently from 427dea0 to 2deb27c Compare July 4, 2016 11:55

This was referenced Jul 4, 2016

Support image labels in Torch #880

Merged

Support variable input size #881

Merged

gheinrich force-pushed the dev/image-processing branch 2 times, most recently from f50bc5c to 6f14123 Compare July 28, 2016 10:14

gheinrich added 2 commits July 28, 2016 12:14

Add Image processing data extension

a239d52

To be used with networks where the input and the output are images

Add image output visualization

43d3400

gheinrich force-pushed the dev/image-processing branch from 6f14123 to 09b1854 Compare July 28, 2016 10:14

Add tests for image processing extension

cd5a050

gheinrich force-pushed the dev/image-processing branch from 09b1854 to cd5a050 Compare July 28, 2016 10:40

lukeyeager self-assigned this Aug 2, 2016

gheinrich merged commit cd5a050 into NVIDIA:master Aug 2, 2016

gheinrich deleted the dev/image-processing branch November 30, 2016 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image processing workflow #806

Image processing workflow #806

gheinrich commented Jun 1, 2016

pansk commented Jun 2, 2016

gheinrich commented Jun 2, 2016

lukeyeager Jun 16, 2016

gheinrich Jun 16, 2016

lukeyeager Jun 16, 2016

gheinrich Jun 16, 2016

gheinrich commented Jul 28, 2016

m5061125 commented Mar 9, 2017

Image processing workflow #806

Image processing workflow #806

Conversation

gheinrich commented Jun 1, 2016

pansk commented Jun 2, 2016

gheinrich commented Jun 2, 2016

lukeyeager Jun 16, 2016

Choose a reason for hiding this comment

gheinrich Jun 16, 2016

Choose a reason for hiding this comment

lukeyeager Jun 16, 2016

Choose a reason for hiding this comment

gheinrich Jun 16, 2016

Choose a reason for hiding this comment

gheinrich commented Jul 28, 2016

m5061125 commented Mar 9, 2017