You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Following offline discussion with @lamblin and @vdumoulin, we agreed that the most important kind of type checking to perform in the data processing pipeline is probably the semantics of the axes of the data. Data streams should provide information about what each axis of the input and output represents e.g.
An image: (channel, height, width)
A batch of images: `(batch, channel, height, width)
A sentence (sequence of indices): (features) (maybe a labels role?) or before going into Blocks: (time, batch, features)
A set of n-grams from a sentence: (batch, features)
The behaviour of data streams regarding these labels should be configurable, so they can either ignore, warn or raise errors if the data input is not what they expected.
Some things that need to be thought about:
Do we just use strings, or do use singletons (allowing us to create a class hierarchy)?
Do we want to add dimensionality e.g. each axis has a dimensionality (or can be variable)? This could be useful to check that e.g. an image has exactly 3 colour channels.
Longer term, this would also allow for the kind of checking that Pylearn2 performs (e.g. make sure that the data dimension is the same as the input layer/brick).
Following offline discussion with @lamblin and @vdumoulin, we agreed that the most important kind of type checking to perform in the data processing pipeline is probably the semantics of the axes of the data. Data streams should provide information about what each axis of the input and output represents e.g.
(channel, height, width)
(features)
(maybe alabels
role?) or before going into Blocks:(time, batch, features)
(batch, features)
The behaviour of data streams regarding these labels should be configurable, so they can either ignore, warn or raise errors if the data input is not what they expected.
Some things that need to be thought about:
This is closely related to mila-iqia/blocks#30
The text was updated successfully, but these errors were encountered: