Commit 11095ab
committed
refactor(model): remove episode long RNN state tracking
R2D2 and friends use this to great effect, but either I don't train long enough or I messed up the implementation. Remove it for now to simplify model usage, and let LSTM only operate on the current batch of observations (which are a rolling window of timesteps)
BREAKING CHANGE: this removes long-term RNN state tracking across episodes. Tracking the state was a significant amount of code and it wasn't clear that it made the model substantially better at any given task.
The overhead associated with keeping lots of hidden states in memory and calculating state histories was not insignificant on CPU training setups as well.1 parent 28afa2c commit 11095ab
File tree
33 files changed
+63
-642
lines changed- libraries
- mathy_python
- mathy
- agents
- a3c
- muzero
- zero
- envs/gym
- tests
- website
- docs
- api
- ml
- snippets/ml
- tests/ml
33 files changed
+63
-642
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
221 | 221 | | |
222 | 222 | | |
223 | 223 | | |
224 | | - | |
| 224 | + | |
225 | 225 | | |
226 | 226 | | |
227 | 227 | | |
228 | 228 | | |
229 | 229 | | |
230 | 230 | | |
231 | | - | |
232 | | - | |
233 | | - | |
234 | | - | |
235 | | - | |
236 | | - | |
237 | | - | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | | - | |
242 | | - | |
243 | | - | |
244 | | - | |
245 | | - | |
246 | | - | |
247 | | - | |
248 | 231 | | |
249 | 232 | | |
250 | 233 | | |
251 | | - | |
252 | | - | |
253 | | - | |
254 | | - | |
255 | | - | |
256 | | - | |
257 | | - | |
258 | | - | |
259 | | - | |
260 | | - | |
261 | | - | |
262 | | - | |
263 | | - | |
264 | | - | |
265 | | - | |
266 | | - | |
267 | | - | |
268 | | - | |
269 | | - | |
270 | | - | |
271 | 234 | | |
272 | 235 | | |
273 | 236 | | |
| |||
277 | 240 | | |
278 | 241 | | |
279 | 242 | | |
280 | | - | |
281 | 243 | | |
282 | 244 | | |
283 | 245 | | |
284 | 246 | | |
285 | 247 | | |
286 | 248 | | |
287 | 249 | | |
288 | | - | |
289 | | - | |
290 | | - | |
291 | | - | |
292 | | - | |
293 | | - | |
294 | | - | |
295 | | - | |
296 | | - | |
297 | 250 | | |
298 | 251 | | |
299 | 252 | | |
300 | 253 | | |
301 | 254 | | |
302 | 255 | | |
303 | 256 | | |
304 | | - | |
305 | | - | |
306 | | - | |
307 | 257 | | |
308 | 258 | | |
309 | 259 | | |
| |||
315 | 265 | | |
316 | 266 | | |
317 | 267 | | |
318 | | - | |
319 | 268 | | |
320 | 269 | | |
321 | 270 | | |
322 | 271 | | |
323 | 272 | | |
324 | 273 | | |
325 | | - | |
326 | | - | |
327 | | - | |
328 | | - | |
329 | | - | |
330 | | - | |
331 | | - | |
332 | | - | |
333 | | - | |
334 | | - | |
335 | | - | |
336 | | - | |
337 | | - | |
338 | | - | |
339 | | - | |
340 | | - | |
341 | | - | |
342 | | - | |
343 | | - | |
344 | | - | |
345 | | - | |
346 | | - | |
347 | | - | |
348 | | - | |
349 | | - | |
350 | 274 | | |
351 | 275 | | |
352 | 276 | | |
| |||
406 | 330 | | |
407 | 331 | | |
408 | 332 | | |
409 | | - | |
410 | | - | |
411 | | - | |
412 | | - | |
413 | | - | |
414 | | - | |
415 | | - | |
416 | | - | |
417 | | - | |
418 | | - | |
419 | | - | |
420 | 333 | | |
421 | 334 | | |
422 | 335 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
29 | | - | |
30 | 29 | | |
31 | 30 | | |
32 | 31 | | |
| |||
46 | 45 | | |
47 | 46 | | |
48 | 47 | | |
49 | | - | |
50 | 48 | | |
51 | 49 | | |
52 | 50 | | |
| |||
85 | 83 | | |
86 | 84 | | |
87 | 85 | | |
88 | | - | |
89 | 86 | | |
90 | | - | |
| 87 | + | |
91 | 88 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
| 16 | + | |
23 | 17 | | |
24 | 18 | | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | 19 | | |
29 | 20 | | |
30 | 21 | | |
| |||
69 | 60 | | |
70 | 61 | | |
71 | 62 | | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | | - | |
77 | | - | |
78 | | - | |
79 | | - | |
80 | | - | |
81 | | - | |
82 | | - | |
83 | | - | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
88 | | - | |
89 | | - | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
90 | 72 | | |
91 | 73 | | |
92 | 74 | | |
| |||
98 | 80 | | |
99 | 81 | | |
100 | 82 | | |
101 | | - | |
102 | | - | |
103 | | - | |
104 | | - | |
105 | | - | |
106 | | - | |
107 | | - | |
108 | | - | |
109 | | - | |
110 | | - | |
111 | | - | |
112 | | - | |
113 | | - | |
114 | | - | |
115 | | - | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
120 | | - | |
121 | | - | |
122 | 83 | | |
123 | 84 | | |
124 | 85 | | |
| |||
128 | 89 | | |
129 | 90 | | |
130 | 91 | | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | 92 | | |
136 | 93 | | |
137 | 94 | | |
| |||
159 | 116 | | |
160 | 117 | | |
161 | 118 | | |
162 | | - | |
163 | | - | |
164 | 119 | | |
165 | 120 | | |
166 | 121 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
11 | 10 | | |
12 | 11 | | |
13 | 12 | | |
14 | | - | |
15 | | - | |
16 | | - | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
21 | | - | |
22 | | - | |
23 | | - | |
24 | | - | |
25 | | - | |
26 | | - | |
27 | | - | |
28 | | - | |
29 | | - | |
30 | | - | |
31 | | - | |
32 | | - | |
33 | | - | |
34 | | - | |
35 | | - | |
36 | | - | |
37 | | - | |
38 | 13 | | |
39 | 14 | | |
40 | 15 | | |
| |||
44 | 19 | | |
45 | 20 | | |
46 | 21 | | |
47 | | - | |
48 | | - | |
49 | 22 | | |
50 | 23 | | |
51 | 24 | | |
| |||
55 | 28 | | |
56 | 29 | | |
57 | 30 | | |
58 | | - | |
59 | 31 | | |
60 | 32 | | |
61 | 33 | | |
| |||
74 | 46 | | |
75 | 47 | | |
76 | 48 | | |
77 | | - | |
78 | 49 | | |
79 | 50 | | |
80 | 51 | | |
81 | 52 | | |
82 | 53 | | |
83 | 54 | | |
84 | | - | |
85 | | - | |
86 | | - | |
87 | | - | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
14 | 14 | | |
0 commit comments