hi authors,
it seems that in the current codebase, there is currently no built-in epoch summary proving that Q updates are broadly taking effect, which means that the distribution of the Q values among these memories, are unknown: how many of them are of zero Q values, and how many them are of long-tail Q values.
Logically, it is very possible that many memories may never be retrieved again and thus those may remain at their initial q value ( set to zero according to the paper) .
So my question is that have you ever tracked this Q value distribution ?
for example, this monitoring metrics about
- total memory count
- number and ratio of nonzero-Q memories
- number and ratio of positive / zero / negative Q memories
- mean / median / min / max Q
thanks.
hi authors,
it seems that in the current codebase, there is currently no built-in epoch summary proving that Q updates are broadly taking effect, which means that the distribution of the Q values among these memories, are unknown: how many of them are of zero Q values, and how many them are of long-tail Q values.
Logically, it is very possible that many memories may never be retrieved again and thus those may remain at their initial q value ( set to zero according to the paper) .
So my question is that have you ever tracked this Q value distribution ?
for example, this monitoring metrics about
thanks.