Trainer2 #1285

beam2d · 2016-06-16T08:45:48Z

fix #914. I wrote an updated version of the training loop abstraction. This includes many improvements based on the feedback to the old version (#958). For example,

Design around the dataset abstraction is improved. Most considerable applications are supported with some efforts of customization.
New Trainer supports using multiple datasets and multiple optimizers.
Evaluation report is abstracted into Reporter object. This makes it easy to collect many observable values like loss/accuracy, activation statistics, etc.

I have not updated the tutorial document yet, which should be done before merging it.

bkvogel · 2016-07-10T10:42:45Z

chainer/iterators/sequential_iterator.py

+    also useful when the order of examples is important and should not be
+    broken.
+
+    Args:


Consider adding a "shuffle" (bool) argument, with default value False. If set to True, it takes on the same behavior as ShuffledIterator. There seems to be quite a bit of code duplication between these two classes. If this change is implemented, ShuffledIterator could then be removed.

Thank you, I agree with you. I updated the code and merge these iterators into SerialIterator.

Fix document to pass test

delta2323 · 2016-07-11T05:06:08Z

@beam2d I checked ExponentialShift. It seems OK.

delta2323 · 2016-07-11T05:29:17Z

docs/source/reference/iterators.rst

+
+Chainer provides some iterators that implement typical strategies to create minibatches by iterating over datasets.
+:class:`SerialIterator` is the simplest one, which extract mini batches in the main thread.
+:class:`MultiprocessIterator` is a parallelized version of :class:`ShuffledIterator`. It maintains worker subprocesses to load the next mini batch in parallel.


ShuffledIterator does no longer exist.

Use datasets insteald of using ptb data directly

delta2323 · 2016-07-11T13:42:08Z

LGTM 👍

beam2d added 30 commits May 4, 2016 17:51

Add DatasetMixin

e77af17

Add Iterator interface

7a8a1c9

Make dataset a module

00c63c2

Add TupleDataset

82df35c

Add DictDataset

660e965

Add SubDataset

f68352b

Add ImageDataset

728d390

Add basic iterators

24e8e65

Add MultiprocessIterator

7f108ec

Add datasets to datasets module

b85d3e6

Add download utils

81b4d9f

Make download utils safer and more flexible

c403664

Add MNIST dataset

7b9023d

Add PTB words dataset

6644fe5

Print message on downloading

1157eee

Unify the MNIST loading functions

a8f1dcb

Fix docstring

59af054

Add CIFAR-10/100 datasets

b95798e

Add concat_examples

2915233

Add updater

0e3c0d2

Support copy to device at concat_examples

dbf4556

Slighly update comments

e031555

Add Iterator.is_new_epoch

5c52406

Add Updater.finalize

0ad6a7f

Add report

82ce5b0

Add trainer and related things

212504a

Move out argument to the initializer

c44ff13

Add snapshot extension

bb430a4

Rename report module

5b9c02b

Add name to the snapshot extension

6104b11

Fix documents for updated interface

81489bb

bkvogel reviewed Jul 10, 2016
View reviewed changes

beam2d and others added 13 commits July 11, 2016 10:03

Merge SequentialIterator and ShuffledIterator into SerialIterator

bd38795

Fix the tests for iterators

c019ef1

Remove unused document

bdb4b45

Fix documents on iterators

a22503b

Fix document to pass test

0640ad7

Fix iterators

05808b1

Use SKIP

7395456

Fix conf

484cf09

Merge pull request #1371 from pfnet/trainer-doc

252e9c0

Fix document to pass test

Make Evaluator return the result dictionary

4f4144c

Fix the test evaluation of PTB example

8ed3d75

Merge branch 'trainer2' of github.com:pfnet/chainer into trainer2

042ff0d

Add ExponentialShift and remove ExponentialDecay

b1702a7

delta2323 reviewed Jul 11, 2016
View reviewed changes

delta2323 and others added 9 commits July 11, 2016 14:41

Flake8

59e9c11

Replace ShuffledIterator with SerialIterator from the document.

e6e0bd3

Remove unnecessary [:]

aa3381b

Repeat a rarely-failing test on MultiprocessIterator

0bad3c2

Fix a test of MultiprocessIterator

182b264

Use datasets insteald of using ptb data directly

21de5f1

Use six

6b7c912

Use _

f148a35

Merge pull request #1373 from pfnet/fix-word2vec

1a0a9f8

Use datasets insteald of using ptb data directly

delta2323 merged commit 26b380d into master Jul 11, 2016

beam2d added this to the v1.11.0 milestone Jul 12, 2016

beam2d deleted the trainer2 branch July 25, 2016 01:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trainer2 #1285

Trainer2 #1285

beam2d commented Jun 16, 2016

bkvogel Jul 10, 2016

beam2d Jul 11, 2016

delta2323 commented Jul 11, 2016

delta2323 Jul 11, 2016

delta2323 commented Jul 11, 2016

Trainer2 #1285

Trainer2 #1285

Conversation

beam2d commented Jun 16, 2016

bkvogel Jul 10, 2016

Choose a reason for hiding this comment

beam2d Jul 11, 2016

Choose a reason for hiding this comment

delta2323 commented Jul 11, 2016

delta2323 Jul 11, 2016

Choose a reason for hiding this comment

delta2323 commented Jul 11, 2016