-
[TMLR] RLTF: Reinforcement Learning from Unit Test Feedback. , 2023.07
Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye
-
[Preprint] InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback , 2023.06
John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao
-
[Preprint] Coarse-Tuning Models of Code with Reinforcement Learning Feedback , 2023.05
Abhinav Jain, Chima Adiole, Swarat Chaudhuri, Thomas Reps, Chris Jermaine
-
[TMLR]
PPOCoder
Execution-based Code Generation using Deep Reinforcement Learning. , 2023.01Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy
-
[NIPS2022] CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning. , 2022.07
Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Silvio Savarese, Steven C.H. Hoi
-
[ACL2022]
COMPCODER
Compilable Neural Code Generation with Compiler Feedback. , 2022.03Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu