You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm curious about how well they act generally over a long time window. GPT-3 was much better than the metrics suggested, simply by virtue of its flexibility during direct interactions. Are there any videos I can watch to see how "generally good" these models are?
Thanks!
The text was updated successfully, but these errors were encountered:
I have done a few multi hour survival recordings, including several 12+ hour marathons in which neither death nor success would free the agent from the world.
While I lost my longest attempt due to an unfortunate windows update corrupting the video I am converting and uploading what I have now. (might be a few hours, these are a little hefty, and my internet isn't top notch.)
Sorry, I got distracted after recording by the release of the datasets/other scripts for BASALT.
Do these include the foundation / early game model? I'm curious whether some of the pathologies of the diamond getter (like running into lava) were caused by the RL training.
I'm curious about how well they act generally over a long time window. GPT-3 was much better than the metrics suggested, simply by virtue of its flexibility during direct interactions. Are there any videos I can watch to see how "generally good" these models are?
Thanks!
The text was updated successfully, but these errors were encountered: