GitHub - holi-lab/POISE: Official code release for "Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor’s Internal States"

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
assets		assets
index.html		index.html
robots.txt		robots.txt
sitemap.xml		sitemap.xml

About

Official code release for "Your Language Model is Its Own Critic: Reinforcement Learning with Value Estimation from Actor’s Internal States"

No releases published