Make `main.py` compatible with OpenAI compatible APIs #189

hmellor · 2024-01-23T15:27:36Z

Solves #161 and #148 and is an alternative to #179.

Employs the DRY principle by only changing the creation of the Evaluator class in main.py and generation.parallel_generations function. Therefore, won't need to maintain multiple Evaluator classes in parallel.

Using the completions instead of chat.completions was a design choice because it eliminates errors/confusion from additional chat templating taking place behind the API.

If you want to evaluate a model running behind an OpenAI compatible API, then you can use base_url to send any generation requests to that URL.

If you are self-hosting an OpenAI compatible API:
- Set base_url to the url you are hosting with (i.e. http://localhost:8000/v1).
- Set model to the served name of your model.
If you are using OpenAI's API:
- Set the environment variable OPENAI_API_KEY.
- Set base_url to https://api.openai.com/v1.
- Set model to the name of the OpenAI model you want to use (e.g. gpt-3.5-turbo-1106).

hmellor · 2024-01-23T15:53:56Z

@loubnabnl, if you have time I'd appreciate a review, thanks!

tshrjn · 2024-02-01T20:11:32Z

Seems like there is an issue with chat format:

    raise self._make_status_error_from_response(err.response) from None
openai.BadRequestError: Error code: 400 - {'error': {'message': "'messages' is a required property", 'type': 'invalid_request_error', 'param': None, 'code': None}}
  0%|                                                                                                                                                      | 0/164 [00:02<?, ?it/s]
Task exception was never retrieved
future: <Task finished name='Task-4' coro=<tqdm_asyncio.gather.<locals>.wrap_awaitable() done, defined at /opt/homebrew/anaconda3/envs/7diamond/lib/python3.10/site-packages/tqdm/asyncio.py:75> exception=BadRequestError('Error code: 400 - {\'error\': {\'message\': "\'messages\' is a required property", \'type\': \'invalid_request_error\', \'param\': None, \'code\': None}}')>
Traceback (most recent call last):
  File "/opt/homebrew/anaconda3/envs/env_name/lib/python3.10/site-packages/tqdm/asyncio.py", line 76, in wrap_awaitable
    return i, await f
  File "/opt/homebrew/anaconda3/envs/env_name/lib/python3.10/site-packages/openai/resources/completions.py", line 1020, in create
    return await self._post(
  File "/opt/homebrew/anaconda3/envs/env_name/lib/python3.10/site-packages/openai/_base_client.py", line 1705, in post
    return await self.request(cast_to, opts, stream=stream, stream_cls=stream_cls)
  File "/opt/homebrew/anaconda3/envs/env_name/lib/python3.10/site-packages/openai/_base_client.py", line 1408, in request
    return await self._request(
  File "/opt/homebrew/anaconda3/envs/7diamond/lib/python3.10/site-packages/openai/_base_client.py", line 1499, in _request
    raise self._make_status_error_from_response(err.response) from None
openai.BadRequestError: Error code: 400 - {'error': {'message': "'messages' is a required property", 'type': 'invalid_request_error', 'param': None, 'code': None}}

hmellor · 2024-02-02T10:39:07Z

@tshrjn you're going to need to provide more context, the word chat doesn't feature in my PR at all.

In the PR description I explicitly state that I am not using the chat endpoint, so I don't know what you did to get a chat error.

nielstron · 2024-06-29T15:07:34Z

I tested this branch and it worked perfectly fine. Only caveat, it really only works with completion models (i.e. babbage, davinci at OpenAI) and not with chat models! But this is expected due to the format of the benchmark.

hmellor added 3 commits January 23, 2024 15:13

Make main.py compatible with OpenAI compatible APIs

24f6f1a

Move import to be conditional

5260fa7

Handle batch_size > 1

7fe3981

Include prompt in generation passed to postprocess_generation

352ace2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `main.py` compatible with OpenAI compatible APIs #189

Make `main.py` compatible with OpenAI compatible APIs #189

hmellor commented Jan 23, 2024 •

edited

Loading

hmellor commented Jan 23, 2024

tshrjn commented Feb 1, 2024

hmellor commented Feb 2, 2024

nielstron commented Jun 29, 2024

Make main.py compatible with OpenAI compatible APIs #189

Are you sure you want to change the base?

Make main.py compatible with OpenAI compatible APIs #189

Conversation

hmellor commented Jan 23, 2024 • edited Loading

hmellor commented Jan 23, 2024

tshrjn commented Feb 1, 2024

hmellor commented Feb 2, 2024

nielstron commented Jun 29, 2024

Make `main.py` compatible with OpenAI compatible APIs #189

Make `main.py` compatible with OpenAI compatible APIs #189

hmellor commented Jan 23, 2024 •

edited

Loading