Separate the reusable data processors from the data layers #244

kloudkl · 2014-03-19T17:30:52Z

This PR will resolve #148 and take over #196.

kloudkl · 2014-03-19T18:30:37Z

Welcome comments on the API.

jeffdonahue · 2014-03-19T18:38:29Z

include/caffe/data/data_sink.hpp

+  explicit MemoryDataSink(const DataSinkParameter& param);
+  virtual ~MemoryDataSink();
+
+  void SaveNextbatch(const shared_ptr<Blob<Dtype> > data) = 0;


SaveNextBatch (capitalize B, and change other occurrences as well)

jeffdonahue · 2014-03-19T18:40:35Z

This looks great! Could you update imagenet.prototxt with how you anticipate that looking with the new data source/processing/sink API?

sguada · 2014-03-19T19:53:29Z

@kloudkl I'm also working in #148, will create a PR soon, and then we can compare both.

I think data_sinks, should be separated of data_sources.

@Yangqing I don't know how protobuf handles changing enum types over time. I'm not sure if we should use enum type for the different kinds of data_sources and data_processing. What would happen if we add a new one. Would the new protobuf compatible with the old one?

jeffdonahue · 2014-03-19T20:01:14Z

@sguada, I believe the enum type will work fine (I'm also using it for the new LayerTypes in #219), we just can't change any of the ID numbers once we release. So we should always use the "next available" ID number in the enum (increment by 1 the largest ID ever used in the enum) and never reuse a number even if we decide to retire a particular data source or processing type.

sguada · 2014-03-21T00:48:51Z

examples/new_data_layer/imagenet_train.prototxt

+    data_blob {
+      data_source {
+        name: "ilvsrc12_train_leveldb"
+        leveldb {


I would add param with type: leveldb. In that case is clear that only one type is possible

kloudkl · 2014-03-21T03:54:16Z

There is no need to re-invent the data layers into data source. Only the data processors need to be extracted. A base class InputLayer will be added to simplify integration of the processors with the data layers. There will also be OutputLayer correspondingly.

kloudkl · 2014-08-27T11:17:45Z

Solved by #954.

[TravisCI] google/protobuf renamed the 3.0 branch

jeffdonahue reviewed Mar 19, 2014
View reviewed changes

sguada reviewed Mar 21, 2014
View reviewed changes

kloudkl changed the title ~~New data layer~~ Separate the resuable data processors from the data layers Mar 21, 2014

shelhamer changed the title ~~Separate the resuable data processors from the data layers~~ Separate the reusable data processors from the data layers Mar 22, 2014

shelhamer added the work in progress label Mar 22, 2014

This was referenced Mar 23, 2014

Add WARPLossLayer and gradient check test cases #126

Closed

Within-channel LRN layer #273

Merged

shelhamer mentioned this pull request Apr 3, 2014

How to convert binaryproto to npy (like ilsvrc_2012_mean.npy)? #290

Closed

kloudkl added 7 commits April 4, 2014 20:23

Add the data processors API

4be8cc7

Add CroppingDataProcessor adapted from DataLayerPrefetch and test it

cd21852

Test CroppingDataProcessor in CPU & GPU modes during TRAIN & TEST phase

21433e3

Add and test MirroringDataProcessor adapted from DataLayerPrefetch

42ec06c

Add and test MeanSubtractionDataProcessor following DataLayerPrefetch

b76d1f2

Add and test ScalingDataProcessor

d00ef28

Implement and test MeanZeroingDataProcessor

375eeff

kloudkl mentioned this pull request Apr 10, 2014

Implement the RBM layer to learn binary codes for large scale image retrieval #274

Closed

sguada mentioned this pull request Jun 28, 2014

Allow images of different sizes as inputs #557

Closed

kloudkl mentioned this pull request Jul 1, 2014

Transform layers [DON'T MERGE] #569

Closed

kloudkl closed this Aug 27, 2014

jmerkow mentioned this pull request Jul 16, 2015

Incorporating cuda unified memory into caffe #2775

Closed

lukeyeager added a commit to lukeyeager/caffe that referenced this pull request Oct 3, 2016

Merge pull request BVLC#244 from lukeyeager/nvidia/travis-protobuf3-url

cd968cc

[TravisCI] google/protobuf renamed the 3.0 branch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separate the reusable data processors from the data layers #244

Separate the reusable data processors from the data layers #244

kloudkl commented Mar 19, 2014

kloudkl commented Mar 19, 2014

jeffdonahue Mar 19, 2014

jeffdonahue commented Mar 19, 2014

sguada commented Mar 19, 2014

jeffdonahue commented Mar 19, 2014

sguada Mar 21, 2014

kloudkl Mar 21, 2014

kloudkl commented Mar 21, 2014

kloudkl commented Aug 27, 2014

Separate the reusable data processors from the data layers #244

Separate the reusable data processors from the data layers #244

Conversation

kloudkl commented Mar 19, 2014

kloudkl commented Mar 19, 2014

jeffdonahue Mar 19, 2014

Choose a reason for hiding this comment

jeffdonahue commented Mar 19, 2014

sguada commented Mar 19, 2014

jeffdonahue commented Mar 19, 2014

sguada Mar 21, 2014

Choose a reason for hiding this comment

kloudkl Mar 21, 2014

Choose a reason for hiding this comment

kloudkl commented Mar 21, 2014

kloudkl commented Aug 27, 2014