# LLM Experiments Setup

To run the LLM-based experiment, we must setup a local instance of the LLM. We will utilize the [Ollama project](https://github.com/ollama/ollama) to run an instance of the LLAMA 2 (7B) model.

## Environment Setup

There are few environment variables that are required to initialize the model. First, make a local copy of the environment file by running `cp .env.example .env`. Then, change the variable values to point to your local directories. The Ollama API is available on the local port 11434. You can also access the GUI on the local port 3000. We use [python-dotenv](https://pypi.org/project/python-dotenv/) library to load the defined environment variables into this notebook. Note that you will have to restart the kernel every time you update the `.env` file for the changes to be reflected.

## Docker and Docker Compose

We have provided a Docker Compose configuration for a more streamlined setup process. First, make sure you have [Docker installed](https://docs.docker.com/engine/install/). Then, from the root of this repository, it suffices to run `docker compose up`.

## Configuration

The config values specified here are used throughout the codebase and should work with their defaults if you are running Ollama via the provide Docker configuration. Don't forget to update the `.env` path, if it differs from the default one.

In [1]:
from dotenv import load_dotenv

from ced.tools.llm import LLMConfig, LLMGateway

In [2]:
_ = load_dotenv(dotenv_path="./.env")

## Create and Test the Model

We create the model using our custom prompt and configuration variables. Note that a first run might take some time, as the model weights shall be downloaded (~3.8 GB).

In [3]:
config = LLMConfig()
gateway = LLMGateway(config=config)

In [24]:
assert gateway.pull()
assert gateway.create(config=LLMConfig.modelfile())

In [11]:
r1 = gateway.generate(prompt="obs: A1 respawn; A2 respawn;")
r2 = gateway.generate(prompt="obs: A1 (PINK GREEN); A2 (PINK YELLOW);", context=r1.context)
r3 = gateway.generate(prompt="obs: A1 has PINK; A2 has PINK;", context=r2.context)
print(r1.response, r2.response, r3.response)