The official Repo of the paper MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation 📄✨
Welcome to the official repository of our paper! 🎉
The dataset can be found in the data
directory, which includes details of each question, such as the question's challenge type. The raw questions can be accessed using the 'question' field. We also provide a version of the prompt with a prefix for users, as mentioned in the paper. Creating this dataset was labor-intensive. If you find any errors, please submit an issue to provide feedback. Thank you very much! 🙏
The leaderboard is available at MHPP Leaderboard. 🏆
- Run your model on our dataset in the
/data/
folder. You can either take the "prompt" field as input (as we do in the paper) or create a new input by adding more user instructions before or after the "question" field. 💻 - Save the result in JSONL format. Remember to include the question ID, function name, and model response in each line. 📝
- Click "File a request" on the leaderboard page to create an issue, and upload the JSONL file in the "Model Response" part. 📤
- Fill in other necessary information and submit. 📄
- Wait. ⏳
- Get your model result with a detailed report. 📊
Thank you for your interest and participation! 😊
If you use MHPP in your research, please consider citing us:
@article{dai2024mhpp,
title={MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation},
author={Dai, Jianbo and Lu, Jianqiao and Feng, Yunlong and Ruan, Rongju and Cheng, Ming and Tan, Haochen and Guo, Zhijiang},
journal={arXiv preprint arXiv:2405.11430},
year={2024}
}