This repository with source code and data was based on the original article and it was adapted aking into account the peculiarities of the Russian language.
Please make sure that all required python packages are installed via:
pip3 install -r requirements.txt
Note
The release version of the dataset has changed to ruIFEval_v0.1. Some prompts have been improved in this version.
You need to create a jsonl file with two entries: prompt and response.
Then, call evaluation_main
. For example:
# Content of `--input_response_data` should be like:
# {"prompt": "Напиши 300+ слов ...", "response": "PUT YOUR MODEL RESPONSE HERE"}
# {"prompt": "Я планирую отправиться в путешествие ...", "response": "PUT YOUR MODEL RESPONSE HERE"}
# ...
python3 -m evaluation_main \
--input_data=./data/ruIFEval_v0.1.jsonl \
--input_response_data=./data/response_gemini_pro_v0.1.jsonl \
--output_dir=./data/