# Aiera Speaker Assignment
This notebook provides a sample execution of the task packaged in `tasks/aiera-speaker-assign` using OpenAI's `gpt-4-turbo-2024-04-09`

Execution requires the setting of environment variables `OPEN_API_KEY` and `OPENAI_ORG_ID`. See [OpenAI's docs](https://platform.openai.com/docs/api-reference/authentication) for more details on how to set these up.

In [None]:
# import relevant libraries
from lm_eval import simple_evaluate, tasks
from lm_eval.models.openai_completions import OpenaiChatCompletionsLM
import openai
import os

openai.api_key = os.environ["OPENAI_API_KEY"]
openai.org_id = os.environ["OPENAI_ORG_ID"]

CWD = os.getcwd()
REPO_ROOT = os.path.abspath(os.path.join(CWD, ".."))

## First set up the model and task manager

In [None]:
model = OpenaiChatCompletionsLM(model="gpt-4-turbo-2024-04-09")

In [None]:
task_manager = tasks.TaskManager(
    include_path=f"{REPO_ROOT}/tasks", include_defaults=False
)
task_manager.all_tasks

## Run the evaluation

In [None]:
results = simple_evaluate( # call simple_evaluate
    model=model,
    tasks=["aiera_speaker_assign"],
    num_fewshot=0,
    task_manager=task_manager,
    write_out = False,
    # limit = 2 # allows you to limit the number of samples run
)

## Inspect the results

In [None]:
results["results"]