# Reviewing Model Generated Answers

In this notebook we'll load and review answers generated by both the base and fine-tuned models. We only have enough GPU memory to load one of these models at a time so we'll need to clear memory between different model runs!

# Set Base Directories

In [2]:
DATA_DIRECTORY = "../data"
MODEL_DIRECTORY = "../model"
MODEL_NAME = "meta-llama/Llama-2-7b-chat-hf"

# Load Train/Test Data Frame

In [3]:
import pandas as pd

path = f"{DATA_DIRECTORY}/train-test-df.csv"
df = pd.read_csv(path, na_filter=False)
print(f"Loaded {df.shape[0]:,d} Train/Test records.")
#print(df.fold.value_counts())
df.sample(n=1)

Loaded 1,050 Train/Test records.


Unnamed: 0,fold,excerpt,question,answer,hashID
299,7,Birkenstock files for U.S. IPO as listings rec...,What factors have contributed to Birkenstock's...,Birkenstock's decision to file for an IPO in t...,bb48045a4245b64edb69b48849381b59


# Load Evaluation Prompt Prefix

In [4]:
path = f"{DATA_DIRECTORY}/level-1-prompt-prefix.txt"
with open(path) as ifp: prefix = ifp.read()

print(prefix)

Carefully read the excerpt below and then provide a clear concise answer to the follow-up question.


# Load Model, Tokenizer and Generator

In [5]:
import torch
from utilities import display_sample, load_base_model, load_lora

# Clear memory between different model runs!

In [6]:
tokenizer, model, generator = None, None, None
torch.cuda.empty_cache()
!gpustat

[1m[37mip-172-25-5-124    [m  Thu Oct  5 12:00:18 2023  [1m[30m535.54.03[m
[36m[0][m [34mNVIDIA A10G     [m |[31m 25'C[m, [32m  0 %[m | [36m[1m[33m  309[m / [33m23028[m MB |


## Base Model

In [None]:
tokenizer, model, generator = load_base_model(model_name)

## Fine-tuned Model

In [7]:
fold = 10
directory = f"{MODEL_DIRECTORY}/Llama-2-7b-qa-{fold:02d}"
tokenizer, model, generator = load_lora(MODEL_NAME, directory)

Loading checkpoint shards:   0%|          | 0/2 [00:00<?, ?it/s]

## Sample Predictions

In [10]:
x = df.sample(n=1).iloc[0]
display_sample(x, prefix, generator, max_new_tokens=256)

INSTRUCTIONS:
+--------------------------------------------------------------------------------------------------------------------------+
| Carefully read the excerpt below and then provide a clear concise answer to the follow-up question.                      |
+--------------------------------------------------------------------------------------------------------------------------+

EXCERPT:
+--------------------------------------------------------------------------------------------------------------------------+
| "Upskilling of local communities to make them eligible for the jobs clean energy offers is crucial" for a just           |
| transition, said NC Thirumalai, sector head, strategic studies at the Bangalore-based think tank Center for Study of     |
| Science, Technology and Policy.  He said federal job training plans, such as the Skill India programme, can be pivoted   |
| to ready workers for clean energy jobs. People across the auto industry for example – from manufact