code_repair task evaluation

Dear Authors,
Thanks for the great work! I'm planning to do some work based on this dataset, and encounter a small problem:
Following README, I've run the code_repair inference for several open-source LLMs, and the results are saved to `CodeScope/code_repair/inference/result/code_repair_eval_{model_name}.jsonl` by default. 
I'm confused about the next step to take. 
1. The jsonl file looks like this
![image](https://github.com/WeixiangYAN/CodeScope/assets/51442540/c6e79520-7c62-4566-a1d8-f9c59e655729)
If I'm not mistaken, `source_code` is the code for LLM to debug, `code_repairing_0` is the output of the LLM, with code and some explanations.
2. However, the next step shows that the Evaluator seems to need an input jsonl in a different format? 
![623901717054517_ pic](https://github.com/WeixiangYAN/CodeScope/assets/51442540/bbb4130a-0cd8-4509-b554-25e64458f656)

The question is how to change the jsonl in 1 to the required input format of 2, Specifically:
1. Should the "source_code" in 2 replaced by the code extracted from the model's output, are there some code to enable this conversion? 
2. for "lang_cluster", "lang" etc. should I replace the value to "{model_name}" in the README? Or it's just a placeholder and I just keep the original value in the `code_repair_eval_{model_name}.jsonl` without changing anything?

Thanks for your help!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

code_repair task evaluation #2

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

code_repair task evaluation #2

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions