question about the distribution of Q values at each epoch

hi authors,

it seems that in the current codebase, there is currently no built-in epoch summary proving that Q updates are broadly taking effect, which means that the distribution of the Q values among these memories, are unknown: how many of them are of zero Q values, and how many them are of long-tail Q values. 

Logically, it is very possible that many memories may never be retrieved again and thus those may remain at their initial q value ( set to zero according to the paper) .

So my question is that have you ever tracked this Q value distribution ? 
for example, this monitoring metrics about  

  - total memory count
  - number and ratio of nonzero-Q memories
  - number and ratio of positive / zero / negative Q memories
  - mean / median / min / max Q

thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question about the distribution of Q values at each epoch #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

question about the distribution of Q values at each epoch #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions