Skip to content

Latest commit

 

History

History
31 lines (13 loc) · 1.92 KB

RL-with-CodeLMs.md

File metadata and controls

31 lines (13 loc) · 1.92 KB

Paper Collection for Reinforcement Learning with CodeLMs

  1. [TMLR] RLTF: Reinforcement Learning from Unit Test Feedback. arXiv, 2023.07

    Jiate Liu, Yiqin Zhu, Kaiwen Xiao, Qiang Fu, Xiao Han, Wei Yang, Deheng Ye

  2. [Preprint] InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback arXiv, 2023.06

    John Yang, Akshara Prabhakar, Karthik Narasimhan, Shunyu Yao

  3. [Preprint] Coarse-Tuning Models of Code with Reinforcement Learning Feedback arXiv, 2023.05

    Abhinav Jain, Chima Adiole, Swarat Chaudhuri, Thomas Reps, Chris Jermaine

  4. [TMLR] PPOCoder Execution-based Code Generation using Deep Reinforcement Learning. arXiv, 2023.01

    Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy

  5. [NIPS2022] CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning. arXiv, 2022.07

    Hung Le, Yue Wang, Akhilesh Deepak Gotmare, Silvio Savarese, Steven C.H. Hoi

  6. [ACL2022] COMPCODER Compilable Neural Code Generation with Compiler Feedback. arXiv, 2022.03

    Xin Wang, Yasheng Wang, Yao Wan, Fei Mi, Yitong Li, Pingyi Zhou, Jin Liu, Hao Wu, Xin Jiang, Qun Liu