Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MultiPL-E Integration #12

Closed
loubnabnl opened this issue Oct 19, 2022 · 4 comments
Closed

MultiPL-E Integration #12

loubnabnl opened this issue Oct 19, 2022 · 4 comments

Comments

@loubnabnl
Copy link
Collaborator

As part of the integration of MultiPL-E benchmark create Dockerfile/Docker image with all dependencies required to execute the code generations for different programming languages

@arjunguha
Copy link
Contributor

arjunguha commented Oct 19, 2022

Tagging @ytzi and @canders1. Let's do this together. I actually have a little evaluation container here in progress, that only supports Racket (never mind why!)

Here my high-level sketch:

  1. We build a container that runs with --network none and supports all 18+ languages. (Let's also get some of the others to work that are in the Stack.)
  2. It receives the name of a directory/file that is read-only that contains JSON completions. ([example file])(https://github.com/nuprl/MultiPL-E/blob/mbpp/experiments/go-davinci-0.2-keep/HumanEval_0_has_close_elements.json)
  3. It receives the name of a directory (writable) where it emits the result of a run. ([example file])(https://github.com/nuprl/MultiPL-E/blob/mbpp/experiments/go-davinci-0.2-keep/HumanEval_0_has_close_elements.results.json)

I think, Docker/Podman will allow both the directories above to be the same directory if desired.

I think we should do this in a separate repository that omits the analysis scripts, inference running, datasets, and results. BigCode will have their own analyses/inference approach.

@loubnabnl
Copy link
Collaborator Author

loubnabnl commented Oct 19, 2022

Yes it makes sense to do this in a separate repository (maybe use the original MultiPL-E repo?), we can use this issue to discuss and track the progress of the task.

In any case we will need to add the metrics to evaluate and to this repo, and then add the Dockerfile when it's ready to the setup instructions

@arjunguha
Copy link
Contributor

This is now in progress in here:

https://github.com/nuprl/MultiPL-E/tree/only_code/evaluation

When it is a little more mature, we will upload a container image. For now, you have to build it yourself. The makefile has commands to build the container and to test:

https://github.com/nuprl/MultiPL-E/blob/only_code/evaluation/Makefile#L5

(You can replace podman with docker, either should work.)

@arjunguha arjunguha changed the title MultiPL-E Integration - create Docker image MultiPL-E Integration Jan 12, 2023
@loubnabnl
Copy link
Collaborator Author

closing this issue as MultiPL-E was integrated in #44 🥳

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

No branches or pull requests

2 participants