[WIP] Keep track of loaded shards #1826

francoishernandez · 2020-07-09T17:32:27Z

This would allow to restart from (approximately) the same point in the data when continuing some training, instead of restarting from shard 0 each time.

BramVanroy · 2020-07-27T07:14:11Z

onmt/inputters/inputter.py

@@ -706,6 +706,32 @@ def __iter__(self):
                return


+class Tracker(object):


Perhaps Tracker is a bit of a vague name? Suggestions: DataTracker of DataProgressTracker.

BramVanroy · 2020-07-27T07:19:09Z

onmt/train_single.py

+    if tracker_queue is not None:
+        data_tracker = tracker_queue.get()


Doesn't this conflict with the initialised data_tracker above? (not sure) In which event would a data_tracker not be initialised but only available in the queue?

BramVanroy · 2020-07-27T07:22:02Z

Even though I am not entirely sure of how this works, it seems good to me. You seem to track the latest shard that has been used during training and also save it in the checkpoints. When continuing training, the latest shard can then be used. Smart!

I wrote some question inline but those are not critiques but rather questions to better understand what is going on.

wip track loaded shards

a17dde8

francoishernandez force-pushed the track_shards branch from 8c52dc7 to d5e8f61 Compare July 10, 2020 07:55

fix tracker_queue single

0c5544e

francoishernandez force-pushed the track_shards branch from d5e8f61 to 0c5544e Compare July 10, 2020 08:00

francoishernandez added 2 commits July 10, 2020 12:19

add check initialization

3deab91

remove useless line break

39178c4

francoishernandez mentioned this pull request Jul 24, 2020

Dynamic training: on-the-fly data transformation Zenglinxiao/OpenNMT-py#2

Merged

BramVanroy reviewed Jul 27, 2020

View reviewed changes

francoishernandez mentioned this pull request Sep 4, 2020

Some adaptations to the dynamic data pipeline Zenglinxiao/OpenNMT-py#4

Merged

francoishernandez added the legacy label Sep 29, 2020

francoishernandez mentioned this pull request Feb 2, 2021

How to reproduce the same training process when using "train_from" #2006

Open

vince62s closed this Nov 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Keep track of loaded shards #1826

[WIP] Keep track of loaded shards #1826

francoishernandez commented Jul 9, 2020

BramVanroy Jul 27, 2020

BramVanroy Jul 27, 2020 •

edited

Loading

BramVanroy commented Jul 27, 2020

		@@ -706,6 +706,32 @@ def __iter__(self):
		return


		class Tracker(object):

		if tracker_queue is not None:
		data_tracker = tracker_queue.get()

[WIP] Keep track of loaded shards #1826

[WIP] Keep track of loaded shards #1826

Conversation

francoishernandez commented Jul 9, 2020

BramVanroy Jul 27, 2020

Choose a reason for hiding this comment

BramVanroy Jul 27, 2020 • edited Loading

Choose a reason for hiding this comment

BramVanroy commented Jul 27, 2020

BramVanroy Jul 27, 2020 •

edited

Loading