How to implement "array label" input via ops.ExternalSource() #123

wangguangyuan · 2018-08-18T07:01:59Z

https://github.com/NVIDIA/DALI/blob/master/dali/benchmark/resnet50_bench.py
The example given is to read all the pictures into the memory. When our data is very large, it is impossible to read all the data into the memory. We need to read only the batch_size image at a time. At the same time, the example passes the image through ops.ExternalSource(), but does not pass the label corresponding to the image, which causes me to match the image and the label when I can't retrain. Is there any way to pass the image and label together via ops.ExternalSource()?
At the same time, I am puzzled that we can achieve the multi-threaded processing by rewriting iter_setup(self) to send external data？

JanuszL · 2018-08-19T19:50:44Z

Hi,
That is a good question. In this example, we load all the data at the beginning as the input set is small. In the real use case, you should put your loading code inside iter_setup and load batch of images at each iteration. Regarding loading the labels, you can treat them as any other data and load them using ExternalSource as well. In such case, you will have one source for images and the second one for the labels.

At the same time, I am puzzled that we can achieve the multi-threaded processing by rewriting iter_setup(self) to send external data？

If I understand your question correctly, you can multi-threaded processing inside iter_setup but this code needs to feed data using feed_input to all ExternalSources before iter_setup ends. However, you can continue you background processing outside iter_setup.
@ptrendx I hope I haven't missed anything.

wangguangyuan · 2018-08-20T17:21:33Z

@JanuszL
from nvidia.dali.pipeline import Pipeline
import nvidia.dali.ops as ops
import nvidia.dali.types as types
import numpy as np
import tensorflow as tf
import nvidia.dali.plugin.tf as dali_tf

def read_image_path(file_path):
resluts, labels, images_path = [], [], []
i = 0
with open(file_path, 'r') as f:
for line in f:
i = i + 1
image_path, label = line.strip().split()
resluts.append(image_path)
labels.append(np.array([int(label)], dtype=np.int32))
# labels.append(np.array([i], dtype=np.int32))
images_path.append(np.array([i], dtype=np.int32))
# labels.append([int(label)])
# return resluts, labels
return resluts, labels, images_path

def make_batch(size, iter, images_path, labels, ids):
# images_path, labels, ids = read_image_path(file_path)
data = [np.fromstring(open(path, 'rb').read(), dtype=np.uint8) for path in images_path[itersize:size(iter+1)]]
return data, labels[itersize:size(iter+1)], ids[itersize:size(iter+1)]

class C2Pipe(Pipeline):
def init(self, batch_size, num_threads, device_id, file_path, pipelined=True, async=True):
super(C2Pipe, self).init(batch_size,
num_threads,
device_id,
exec_pipelined=pipelined,
exec_async=async)

    # self.file_path = file_path
    self.images_path, self.labels, self.ids = read_image_path(file_path)
    self.input = ops.ExternalSource()
    self.label_input = ops.ExternalSource()
    self.id_input = ops.ExternalSource()

    self.decode = ops.HostDecoder(output_type=types.RGB)

    self.rcm = ops.FastResizeCropMirror(crop=[224, 224])

    self.np = ops.NormalizePermute(device="gpu",
                                   output_dtype=types.FLOAT,
                                   mean=[128., 128., 128.],
                                   std=[1., 1., 1.],
                                   height=224,
                                   width=224,
                                   image_type=types.RGB)

    self.uniform = ops.Uniform(range=(0., 1.))
    self.resize_uniform = ops.Uniform(range=(256., 480.))
    self.mirror = ops.CoinFlip(probability=0.5)
    # self.ct = ops.Cast(d)
    self.cast = ops.Cast(device="cpu",
                         dtype=types.INT32)
    self.iter = 0

def define_graph(self):
    self.jpegs = self.input(name='image')
    self.label = self.label_input(name='label')
    self.id = self.id_input(name='id')

    images = self.decode(self.jpegs)
    resized = self.rcm(images, crop_pos_x=self.uniform(),
                       crop_pos_y=self.uniform(),
                       mirror=self.mirror(),
                       resize_shorter=self.resize_uniform())

    output = self.np(resized.gpu())
    return output, self.label, self.id

def iter_setup(self):
    raw_data, raw_label, raw_id = make_batch(self.batch_size, self.iter, self.images_path, self.labels, self.ids)
    self.feed_input(self.jpegs, raw_data)
    self.feed_input(self.label, raw_label)
    self.feed_input(self.id, raw_id)
    self.iter += 1

pipe = C2Pipe(batch_size=32, num_threads=2, device_id=0, file_path='./test.txt')
serialized_pipes = pipe.serialize()
daliop_t = dali_tf.DALIIterator()
with tf.device('/gpu:%i' % device_id):
image, label = daliop_t(serialized_pipeline=serialized_pipes,
shape=[32, 224, 224, 3],
image_type=tf.float32,
label_type=tf.int32,
num_threads=2,
device_id=device_id)

Traceback (most recent call last):
File "/share5/public/guangyuan/workplace/horovod_project/dali_from_raw_data_test.py", line 135, in
serialized_pipes = pipe.serialize()
File "/home/guangyuan/.local/lib/python2.7/site-packages/nvidia/dali/pipeline.py", line 271, in serialize
return self._pipe.SerializeToProtobuf()
RuntimeError: CHECK failed: IsInitialized(): Can't serialize message of type "dali_proto.PipelineDef" because it is missing required fields: op[0].name, op[7].name, op[8].name

JanuszL · 2018-08-20T17:51:57Z

Registered at DALI-207

wangguangyuan · 2018-08-21T02:12:12Z

@JanuszL
The function used in the dali serialization process must be provided by dali?
Is it not possible to serialize an external function?
If so, is there any other solution?

JanuszL · 2018-08-24T07:24:42Z

Hi,
We haven't test this scenario. We will check if this is DALI bug or limitation of how DALI and TF are integrated.
Putting in our ToDo list.

JanuszL · 2018-08-29T08:44:29Z

Hi,
Currently external source is not serialized and deosn't work for TF. Real reason is that DALI cannot call python code from inside - in this case iter_setup is not callable (with current design) and there is no easy way to feed data into ExternalSource. It should work with PyTorch as well as MxNet.
We plan to develop such functionality in the future but it is hard to tell now when.

wangguangyuan · 2018-08-31T02:38:07Z

Hi, thank you very much, is there a document about the dali architecture? Some of the features I need may need to be added from source.

JanuszL · 2018-08-31T07:16:25Z

Hi,
Please look into https://github.com/NVIDIA/DALI#additional-resources first. Regarding detailed documentation which describes a design of each class, we don't have such document and it is probably not going to be done anytime soon. You can still use Doxygen file from DALI source to generate development docs.

klecki · 2021-08-09T11:15:29Z

Hi,
the support for External Source callbacks/iterator in TensorFlow via the tf.data.Dataset compatible API was merged and will be present in DALI 1.5.

DALI 1.4 already supports inputs from other tf.data.Datasets in the experimental.DALIDatasetWithInputs.

You can see more in the documentation: https://docs.nvidia.com/deeplearning/dali/main-user-guide/docs/plugins/tensorflow_plugin_api.html#experimental

Tutorial is under review in: #3212

JanuszL · 2021-08-30T12:42:49Z

Hi,
DALI 1.5 is out and can be used to test the new functionality.

JanuszL added the question Further information is requested label Aug 19, 2018

JanuszL closed this as completed Aug 19, 2018

JanuszL reopened this Aug 20, 2018

JanuszL added the bug Something isn't working label Aug 20, 2018

JanuszL added enhancement New feature or request and removed bug Something isn't working labels Sep 6, 2018

Jeremy-Yung mentioned this issue Sep 27, 2019

How to copy the images from GPU to CPU more efficiently? #1291

Open

JanuszL added external contribution welcome A good place to start contributing to the project and removed question Further information is requested labels Jan 21, 2020

JanuszL added this to ToDo in Users requests via automation Jan 21, 2020

klecki modified the milestones: Release_1.4.0, Release_1.5.0 Aug 9, 2021

JanuszL closed this as completed Aug 9, 2021

Users requests automation moved this from ToDo to Done Aug 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to implement "array label" input via ops.ExternalSource() #123

How to implement "array label" input via ops.ExternalSource() #123

wangguangyuan commented Aug 18, 2018

JanuszL commented Aug 19, 2018

wangguangyuan commented Aug 20, 2018 •

edited

JanuszL commented Aug 20, 2018

wangguangyuan commented Aug 21, 2018 •

edited

JanuszL commented Aug 24, 2018

JanuszL commented Aug 29, 2018

wangguangyuan commented Aug 31, 2018

JanuszL commented Aug 31, 2018

klecki commented Aug 9, 2021

JanuszL commented Aug 30, 2021

How to implement "array label" input via ops.ExternalSource() #123

How to implement "array label" input via ops.ExternalSource() #123

Comments

wangguangyuan commented Aug 18, 2018

JanuszL commented Aug 19, 2018

wangguangyuan commented Aug 20, 2018 • edited

JanuszL commented Aug 20, 2018

wangguangyuan commented Aug 21, 2018 • edited

JanuszL commented Aug 24, 2018

JanuszL commented Aug 29, 2018

wangguangyuan commented Aug 31, 2018

JanuszL commented Aug 31, 2018

klecki commented Aug 9, 2021

JanuszL commented Aug 30, 2021

wangguangyuan commented Aug 20, 2018 •

edited

wangguangyuan commented Aug 21, 2018 •

edited