### Creating synthetic dataset
The process is going to be, create a good prompt for llama-2-40b-chat-hf. And infere over the 63k prompt in order to make a emotion-reason dataset. By this way, we can try to make a LLM that is capable to explain emotions and may make better predictions


In [1]:
import os
import torch
from datasets import load_dataset
from transformers import (
    AutoModelForCausalLM,
    AutoTokenizer,
    BitsAndBytesConfig,
    HfArgumentParser,
    TrainingArguments,
    pipeline,
    logging,
)
from peft import LoraConfig, PeftModel
from trl import SFTTrainer

  from .autonotebook import tqdm as notebook_tqdm


[2023-08-23 07:55:14,341] [INFO] [real_accelerator.py:133:get_accelerator] Setting ds_accelerator to cuda (auto detect)


In [2]:
!huggingface-cli login --token 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

Token will not been saved to git credential helper. Pass `add_to_git_credential=True` if you want to set the git credential as well.
Token is valid (permission: write).
Your token has been saved to /root/.cache/huggingface/token
Login successful


In [3]:
# The model that you want to train from the Hugging Face hub
model_name = "meta-llama/Llama-2-13b-chat-hf"

# The instruction dataset to use
dataset_name = "RikoteMaster/Emotion_Recognition_4_llama2_chat"

# Fine-tuned model name
new_model = "llama-2-7b-sentiment-analyzer"

device_map = {"": 0}

################################################################################
# QLoRA parameters
################################################################################

# LoRA attention dimension
lora_r = 64

# Alpha parameter for LoRA scaling
lora_alpha = 16

# Dropout probability for LoRA layers
lora_dropout = 0.1

################################################################################
# bitsandbytes parameters
################################################################################

# Activate 4-bit precision base model loading
use_4bit = True

# Compute dtype for 4-bit base models
bnb_4bit_compute_dtype = "float16"

# Quantization type (fp4 or nf4)
bnb_4bit_quant_type = "nf4"

# Activate nested quantization for 4-bit base models (double quantization)
use_nested_quant = False

################################################################################
# Loading model
################################################################################

# Load dataset (you can process it here)
dataset = load_dataset(dataset_name, split="train")

# Load tokenizer and model with QLoRA configuration
compute_dtype = getattr(torch, bnb_4bit_compute_dtype)

bnb_config = BitsAndBytesConfig(
    load_in_4bit=use_4bit,
    bnb_4bit_quant_type=bnb_4bit_quant_type,
    bnb_4bit_compute_dtype=compute_dtype,
    bnb_4bit_use_double_quant=use_nested_quant,
)

# Check GPU compatibility with bfloat16
if compute_dtype == torch.float16 and use_4bit:
    major, _ = torch.cuda.get_device_capability()
    if major >= 8:
        print("=" * 80)
        print("Your GPU supports bfloat16: accelerate training with bf16=True")
        print("=" * 80)

# Load base model
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    quantization_config=bnb_config,
    device_map=device_map
)
model.config.use_cache = False
model.config.pretraining_tp = 1

# Load LLaMA tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = "right" # Fix weird overflow issue with fp16 training

# Load LoRA configuration
peft_config = LoraConfig(
    lora_alpha=lora_alpha,
    lora_dropout=lora_dropout,
    r=lora_r,
    bias="none",
    task_type="CAUSAL_LM",
)

Your GPU supports bfloat16: accelerate training with bf16=True


Downloading shards:   0%|                                                                           | 0/3 [00:00<?, ?it/s]
Downloading (…)of-00003.safetensors:   0%|                                                    | 0.00/9.95G [00:00<?, ?B/s][A
Downloading (…)of-00003.safetensors:   0%|                                           | 10.5M/9.95G [00:00<11:16, 14.7MB/s][A
Downloading (…)of-00003.safetensors:   0%|                                           | 21.0M/9.95G [00:00<06:16, 26.4MB/s][A
Downloading (…)of-00003.safetensors:   0%|▏                                          | 31.5M/9.95G [00:01<04:58, 33.2MB/s][A
Downloading (…)of-00003.safetensors:   0%|▏                                          | 41.9M/9.95G [00:01<04:46, 34.6MB/s][A
Downloading (…)of-00003.safetensors:   1%|▏                                          | 52.4M/9.95G [00:01<04:02, 40.9MB/s][A
Downloading (…)of-00003.safetensors:   1%|▎                                          | 62.9M/9.95G [00:01<03:57, 41.7MB/s

Downloading (…)of-00003.safetensors:   7%|██▉                                         | 671M/9.95G [00:14<03:02, 50.8MB/s][A
Downloading (…)of-00003.safetensors:   7%|███                                         | 682M/9.95G [00:14<02:55, 52.8MB/s][A
Downloading (…)of-00003.safetensors:   7%|███                                         | 692M/9.95G [00:14<03:01, 51.0MB/s][A
Downloading (…)of-00003.safetensors:   7%|███                                         | 703M/9.95G [00:14<03:04, 50.2MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▏                                        | 713M/9.95G [00:14<02:56, 52.4MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▏                                        | 724M/9.95G [00:15<03:00, 51.2MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▏                                        | 734M/9.95G [00:15<02:54, 52.8MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▎                                        | 744M/9.95G [00:15<02:57, 51.7M

Downloading (…)of-00003.safetensors:  14%|█████▊                                     | 1.35G/9.95G [00:35<06:14, 22.9MB/s][A
Downloading (…)of-00003.safetensors:  14%|█████▉                                     | 1.36G/9.95G [00:35<06:16, 22.8MB/s][A
Downloading (…)of-00003.safetensors:  14%|█████▉                                     | 1.37G/9.95G [00:36<06:09, 23.2MB/s][A
Downloading (…)of-00003.safetensors:  14%|█████▉                                     | 1.38G/9.95G [00:36<06:08, 23.2MB/s][A
Downloading (…)of-00003.safetensors:  14%|██████                                     | 1.39G/9.95G [00:37<06:18, 22.6MB/s][A
Downloading (…)of-00003.safetensors:  14%|██████                                     | 1.41G/9.95G [00:37<06:13, 22.9MB/s][A
Downloading (…)of-00003.safetensors:  14%|██████                                     | 1.42G/9.95G [00:37<06:01, 23.6MB/s][A
Downloading (…)of-00003.safetensors:  14%|██████▏                                    | 1.43G/9.95G [00:38<05:57, 23.8M

Downloading (…)of-00003.safetensors:  21%|████████▊                                  | 2.04G/9.95G [00:54<03:25, 38.5MB/s][A
Downloading (…)of-00003.safetensors:  21%|████████▉                                  | 2.06G/9.95G [00:55<03:29, 37.8MB/s][A
Downloading (…)of-00003.safetensors:  21%|████████▉                                  | 2.07G/9.95G [00:55<03:28, 37.8MB/s][A
Downloading (…)of-00003.safetensors:  21%|████████▉                                  | 2.08G/9.95G [00:55<03:24, 38.5MB/s][A
Downloading (…)of-00003.safetensors:  21%|█████████                                  | 2.09G/9.95G [00:56<03:24, 38.4MB/s][A
Downloading (…)of-00003.safetensors:  21%|█████████                                  | 2.10G/9.95G [00:56<03:19, 39.3MB/s][A
Downloading (…)of-00003.safetensors:  21%|█████████                                  | 2.11G/9.95G [00:56<03:21, 39.0MB/s][A
Downloading (…)of-00003.safetensors:  21%|█████████▏                                 | 2.12G/9.95G [00:56<03:18, 39.4M

Downloading (…)of-00003.safetensors:  27%|███████████▊                               | 2.73G/9.95G [01:13<03:50, 31.4MB/s][A
Downloading (…)of-00003.safetensors:  28%|███████████▊                               | 2.74G/9.95G [01:13<03:44, 32.1MB/s][A
Downloading (…)of-00003.safetensors:  28%|███████████▊                               | 2.75G/9.95G [01:13<03:41, 32.6MB/s][A
Downloading (…)of-00003.safetensors:  28%|███████████▉                               | 2.76G/9.95G [01:14<03:37, 33.1MB/s][A
Downloading (…)of-00003.safetensors:  28%|███████████▉                               | 2.77G/9.95G [01:14<03:50, 31.2MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████                               | 2.78G/9.95G [01:14<04:15, 28.0MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████                               | 2.79G/9.95G [01:15<04:26, 26.8MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████                               | 2.80G/9.95G [01:15<04:27, 26.8M

Downloading (…)of-00003.safetensors:  34%|██████████████▋                            | 3.41G/9.95G [01:45<05:30, 19.8MB/s][A
Downloading (…)of-00003.safetensors:  34%|██████████████▊                            | 3.42G/9.95G [01:46<05:28, 19.9MB/s][A
Downloading (…)of-00003.safetensors:  34%|██████████████▊                            | 3.43G/9.95G [01:47<05:27, 19.9MB/s][A
Downloading (…)of-00003.safetensors:  35%|██████████████▊                            | 3.44G/9.95G [01:47<05:27, 19.9MB/s][A
Downloading (…)of-00003.safetensors:  35%|██████████████▉                            | 3.45G/9.95G [01:48<05:22, 20.1MB/s][A
Downloading (…)of-00003.safetensors:  35%|██████████████▉                            | 3.46G/9.95G [01:48<05:18, 20.4MB/s][A
Downloading (…)of-00003.safetensors:  35%|███████████████                            | 3.47G/9.95G [01:49<05:16, 20.4MB/s][A
Downloading (…)of-00003.safetensors:  35%|███████████████                            | 3.48G/9.95G [01:49<05:15, 20.5M

Downloading (…)of-00003.safetensors:  41%|█████████████████▋                         | 4.09G/9.95G [02:11<02:42, 36.1MB/s][A
Downloading (…)of-00003.safetensors:  41%|█████████████████▋                         | 4.10G/9.95G [02:12<02:41, 36.3MB/s][A
Downloading (…)of-00003.safetensors:  41%|█████████████████▊                         | 4.11G/9.95G [02:12<02:41, 36.1MB/s][A
Downloading (…)of-00003.safetensors:  41%|█████████████████▊                         | 4.12G/9.95G [02:12<02:39, 36.4MB/s][A
Downloading (…)of-00003.safetensors:  42%|█████████████████▊                         | 4.13G/9.95G [02:13<02:39, 36.4MB/s][A
Downloading (…)of-00003.safetensors:  42%|█████████████████▉                         | 4.14G/9.95G [02:13<02:38, 36.7MB/s][A
Downloading (…)of-00003.safetensors:  42%|█████████████████▉                         | 4.15G/9.95G [02:13<02:56, 32.8MB/s][A
Downloading (…)of-00003.safetensors:  42%|█████████████████▉                         | 4.16G/9.95G [02:13<02:35, 37.3M

Downloading (…)of-00003.safetensors:  48%|████████████████████▋                      | 4.78G/9.95G [02:29<02:11, 39.3MB/s][A
Downloading (…)of-00003.safetensors:  48%|████████████████████▋                      | 4.79G/9.95G [02:30<02:09, 39.7MB/s][A
Downloading (…)of-00003.safetensors:  48%|████████████████████▊                      | 4.80G/9.95G [02:30<02:04, 41.4MB/s][A
Downloading (…)of-00003.safetensors:  48%|████████████████████▊                      | 4.81G/9.95G [02:30<02:06, 40.7MB/s][A
Downloading (…)of-00003.safetensors:  48%|████████████████████▊                      | 4.82G/9.95G [02:30<02:05, 40.7MB/s][A
Downloading (…)of-00003.safetensors:  49%|████████████████████▉                      | 4.83G/9.95G [02:31<02:03, 41.5MB/s][A
Downloading (…)of-00003.safetensors:  49%|████████████████████▉                      | 4.84G/9.95G [02:31<02:00, 42.3MB/s][A
Downloading (…)of-00003.safetensors:  49%|████████████████████▉                      | 4.85G/9.95G [02:31<01:58, 43.1M

Downloading (…)of-00003.safetensors:  55%|███████████████████████▌                   | 5.46G/9.95G [02:48<02:12, 33.8MB/s][A
Downloading (…)of-00003.safetensors:  55%|███████████████████████▋                   | 5.47G/9.95G [02:48<02:12, 33.9MB/s][A
Downloading (…)of-00003.safetensors:  55%|███████████████████████▋                   | 5.48G/9.95G [02:48<02:11, 34.0MB/s][A
Downloading (…)of-00003.safetensors:  55%|███████████████████████▋                   | 5.49G/9.95G [02:49<02:09, 34.3MB/s][A
Downloading (…)of-00003.safetensors:  55%|███████████████████████▊                   | 5.51G/9.95G [02:49<02:12, 33.5MB/s][A
Downloading (…)of-00003.safetensors:  55%|███████████████████████▊                   | 5.52G/9.95G [02:49<02:14, 33.0MB/s][A
Downloading (…)of-00003.safetensors:  56%|███████████████████████▉                   | 5.53G/9.95G [02:49<02:12, 33.4MB/s][A
Downloading (…)of-00003.safetensors:  56%|███████████████████████▉                   | 5.54G/9.95G [02:50<02:11, 33.6M

Downloading (…)of-00003.safetensors:  62%|██████████████████████████▌                | 6.16G/9.95G [03:10<01:56, 32.5MB/s][A
Downloading (…)of-00003.safetensors:  62%|██████████████████████████▋                | 6.17G/9.95G [03:10<01:54, 32.9MB/s][A
Downloading (…)of-00003.safetensors:  62%|██████████████████████████▋                | 6.18G/9.95G [03:10<01:54, 33.0MB/s][A
Downloading (…)of-00003.safetensors:  62%|██████████████████████████▋                | 6.19G/9.95G [03:11<01:52, 33.4MB/s][A
Downloading (…)of-00003.safetensors:  62%|██████████████████████████▊                | 6.20G/9.95G [03:11<01:50, 33.9MB/s][A
Downloading (…)of-00003.safetensors:  62%|██████████████████████████▊                | 6.21G/9.95G [03:11<01:50, 33.8MB/s][A
Downloading (…)of-00003.safetensors:  63%|██████████████████████████▉                | 6.22G/9.95G [03:11<01:50, 33.8MB/s][A
Downloading (…)of-00003.safetensors:  63%|██████████████████████████▉                | 6.23G/9.95G [03:12<01:49, 33.9M

Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▌             | 6.85G/9.95G [03:41<03:45, 13.8MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▋             | 6.86G/9.95G [03:42<04:16, 12.1MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▋             | 6.87G/9.95G [03:43<04:32, 11.3MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▋             | 6.88G/9.95G [03:44<04:40, 11.0MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▊             | 6.89G/9.95G [03:45<04:43, 10.8MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▊             | 6.90G/9.95G [03:46<04:46, 10.6MB/s][A
Downloading (…)of-00003.safetensors:  69%|█████████████████████████████▊             | 6.91G/9.95G [03:47<04:46, 10.6MB/s][A
Downloading (…)of-00003.safetensors:  70%|█████████████████████████████▉             | 6.92G/9.95G [03:48<04:45, 10.6M

Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▌          | 7.53G/9.95G [04:11<01:07, 35.6MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▌          | 7.54G/9.95G [04:11<01:13, 32.9MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▋          | 7.55G/9.95G [04:12<01:22, 29.0MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▋          | 7.56G/9.95G [04:12<01:34, 25.4MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▋          | 7.57G/9.95G [04:13<01:42, 23.2MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▊          | 7.58G/9.95G [04:14<01:46, 22.3MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▊          | 7.59G/9.95G [04:14<01:49, 21.5MB/s][A
Downloading (…)of-00003.safetensors:  76%|████████████████████████████████▊          | 7.60G/9.95G [04:15<01:47, 21.8M

Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▍       | 8.21G/9.95G [04:50<02:01, 14.3MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▌       | 8.22G/9.95G [04:51<02:01, 14.2MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▌       | 8.23G/9.95G [04:52<02:00, 14.3MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▌       | 8.24G/9.95G [04:52<01:58, 14.4MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▋       | 8.25G/9.95G [04:53<01:56, 14.6MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▋       | 8.26G/9.95G [04:54<01:54, 14.8MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▊       | 8.27G/9.95G [04:54<01:51, 15.0MB/s][A
Downloading (…)of-00003.safetensors:  83%|███████████████████████████████████▊       | 8.28G/9.95G [04:55<01:48, 15.3M

Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▍    | 8.89G/9.95G [05:17<00:42, 24.6MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▍    | 8.90G/9.95G [05:17<00:41, 25.3MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▌    | 8.91G/9.95G [05:18<00:41, 24.9MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▌    | 8.92G/9.95G [05:18<00:40, 25.4MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▌    | 8.93G/9.95G [05:18<00:39, 25.4MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▋    | 8.94G/9.95G [05:19<00:39, 25.5MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▋    | 8.95G/9.95G [05:19<00:38, 25.7MB/s][A
Downloading (…)of-00003.safetensors:  90%|██████████████████████████████████████▋    | 8.97G/9.95G [05:20<00:37, 26.1M

Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▍ | 9.57G/9.95G [05:40<00:10, 36.4MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▍ | 9.58G/9.95G [05:40<00:10, 36.2MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▍ | 9.59G/9.95G [05:40<00:09, 36.3MB/s][A
Downloading (…)of-00003.safetensors:  97%|█████████████████████████████████████████▌ | 9.60G/9.95G [05:41<00:09, 37.6MB/s][A
Downloading (…)of-00003.safetensors:  97%|█████████████████████████████████████████▌ | 9.62G/9.95G [05:41<00:08, 37.1MB/s][A
Downloading (…)of-00003.safetensors:  97%|█████████████████████████████████████████▌ | 9.63G/9.95G [05:41<00:08, 36.9MB/s][A
Downloading (…)of-00003.safetensors:  97%|█████████████████████████████████████████▋ | 9.64G/9.95G [05:42<00:08, 37.3MB/s][A
Downloading (…)of-00003.safetensors:  97%|█████████████████████████████████████████▋ | 9.65G/9.95G [05:42<00:08, 37.4M

Downloading (…)of-00003.safetensors:   3%|█▎                                          | 294M/9.90G [00:06<03:11, 50.1MB/s][A
Downloading (…)of-00003.safetensors:   3%|█▎                                          | 304M/9.90G [00:07<03:29, 45.8MB/s][A
Downloading (…)of-00003.safetensors:   3%|█▍                                          | 315M/9.90G [00:07<03:15, 48.9MB/s][A
Downloading (…)of-00003.safetensors:   3%|█▍                                          | 325M/9.90G [00:07<04:01, 39.7MB/s][A
Downloading (…)of-00003.safetensors:   3%|█▍                                          | 336M/9.90G [00:07<03:40, 43.5MB/s][A
Downloading (…)of-00003.safetensors:   3%|█▌                                          | 346M/9.90G [00:08<04:03, 39.3MB/s][A
Downloading (…)of-00003.safetensors:   4%|█▌                                          | 357M/9.90G [00:08<05:35, 28.4MB/s][A
Downloading (…)of-00003.safetensors:   4%|█▋                                          | 367M/9.90G [00:09<04:46, 33.2M

Downloading (…)of-00003.safetensors:  10%|████▍                                       | 996M/9.90G [00:21<02:57, 50.2MB/s][A
Downloading (…)of-00003.safetensors:  10%|████▎                                      | 1.01G/9.90G [00:22<02:59, 49.5MB/s][A
Downloading (…)of-00003.safetensors:  10%|████▍                                      | 1.02G/9.90G [00:23<06:13, 23.8MB/s][A
Downloading (…)of-00003.safetensors:  10%|████▌                                      | 1.04G/9.90G [00:23<03:58, 37.2MB/s][A
Downloading (…)of-00003.safetensors:  11%|████▌                                      | 1.06G/9.90G [00:23<03:00, 49.0MB/s][A
Downloading (…)of-00003.safetensors:  11%|████▋                                      | 1.07G/9.90G [00:23<02:39, 55.5MB/s][A
Downloading (…)of-00003.safetensors:  11%|████▋                                      | 1.08G/9.90G [00:23<02:45, 53.4MB/s][A
Downloading (…)of-00003.safetensors:  11%|████▋                                      | 1.09G/9.90G [00:24<03:03, 48.1M

Downloading (…)of-00003.safetensors:  17%|███████▌                                   | 1.73G/9.90G [00:38<02:40, 51.0MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▌                                   | 1.74G/9.90G [00:38<02:46, 48.9MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▌                                   | 1.75G/9.90G [00:38<02:47, 48.6MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▋                                   | 1.76G/9.90G [00:38<02:39, 51.0MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▋                                   | 1.77G/9.90G [00:38<02:42, 50.1MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▋                                   | 1.78G/9.90G [00:39<02:47, 48.5MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▊                                   | 1.79G/9.90G [00:39<03:23, 39.9MB/s][A
Downloading (…)of-00003.safetensors:  18%|███████▊                                   | 1.80G/9.90G [00:39<03:05, 43.6M

Downloading (…)of-00003.safetensors:  25%|██████████▌                                | 2.44G/9.90G [00:53<02:52, 43.1MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▋                                | 2.45G/9.90G [00:53<02:50, 43.7MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▋                                | 2.46G/9.90G [00:53<03:10, 39.0MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▋                                | 2.47G/9.90G [00:53<03:04, 40.3MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▊                                | 2.49G/9.90G [00:54<02:58, 41.6MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▊                                | 2.50G/9.90G [00:54<02:55, 42.1MB/s][A
Downloading (…)of-00003.safetensors:  25%|██████████▉                                | 2.51G/9.90G [00:55<05:16, 23.3MB/s][A
Downloading (…)of-00003.safetensors:  26%|██████████▉                                | 2.53G/9.90G [00:55<03:20, 36.7M

Downloading (…)of-00003.safetensors:  32%|█████████████▋                             | 3.16G/9.90G [01:11<03:07, 36.0MB/s][A
Downloading (…)of-00003.safetensors:  32%|█████████████▋                             | 3.17G/9.90G [01:12<03:07, 35.9MB/s][A
Downloading (…)of-00003.safetensors:  32%|█████████████▊                             | 3.18G/9.90G [01:12<03:08, 35.6MB/s][A
Downloading (…)of-00003.safetensors:  32%|█████████████▊                             | 3.19G/9.90G [01:12<03:10, 35.2MB/s][A
Downloading (…)of-00003.safetensors:  32%|█████████████▉                             | 3.20G/9.90G [01:13<03:09, 35.3MB/s][A
Downloading (…)of-00003.safetensors:  32%|█████████████▉                             | 3.21G/9.90G [01:13<03:05, 36.2MB/s][A
Downloading (…)of-00003.safetensors:  33%|█████████████▉                             | 3.22G/9.90G [01:13<03:02, 36.5MB/s][A
Downloading (…)of-00003.safetensors:  33%|██████████████                             | 3.23G/9.90G [01:13<03:04, 36.1M

Downloading (…)of-00003.safetensors:  39%|████████████████▋                          | 3.85G/9.90G [01:29<01:53, 53.5MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▊                          | 3.86G/9.90G [01:29<01:52, 53.8MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▊                          | 3.87G/9.90G [01:29<01:56, 51.9MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▊                          | 3.88G/9.90G [01:29<01:58, 50.8MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▉                          | 3.89G/9.90G [01:29<01:55, 52.2MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▉                          | 3.90G/9.90G [01:30<01:56, 51.6MB/s][A
Downloading (…)of-00003.safetensors:  39%|████████████████▉                          | 3.91G/9.90G [01:30<01:52, 53.5MB/s][A
Downloading (…)of-00003.safetensors:  40%|█████████████████                          | 3.92G/9.90G [01:30<01:57, 50.8M

Downloading (…)of-00003.safetensors:  46%|███████████████████▊                       | 4.55G/9.90G [01:45<02:09, 41.3MB/s][A
Downloading (…)of-00003.safetensors:  46%|███████████████████▊                       | 4.56G/9.90G [01:45<02:06, 42.3MB/s][A
Downloading (…)of-00003.safetensors:  46%|███████████████████▊                       | 4.57G/9.90G [01:45<02:05, 42.4MB/s][A
Downloading (…)of-00003.safetensors:  46%|███████████████████▉                       | 4.58G/9.90G [01:45<02:04, 42.8MB/s][A
Downloading (…)of-00003.safetensors:  46%|███████████████████▉                       | 4.59G/9.90G [01:46<02:02, 43.5MB/s][A
Downloading (…)of-00003.safetensors:  46%|███████████████████▉                       | 4.60G/9.90G [01:46<02:06, 41.7MB/s][A
Downloading (…)of-00003.safetensors:  47%|████████████████████                       | 4.61G/9.90G [01:46<02:05, 42.2MB/s][A
Downloading (…)of-00003.safetensors:  47%|████████████████████                       | 4.62G/9.90G [01:46<02:03, 42.8M

Downloading (…)of-00003.safetensors:  53%|██████████████████████▋                    | 5.23G/9.90G [02:00<01:35, 48.9MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▊                    | 5.24G/9.90G [02:00<01:36, 48.3MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▊                    | 5.25G/9.90G [02:00<01:32, 50.5MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▊                    | 5.26G/9.90G [02:01<01:33, 49.8MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▉                    | 5.27G/9.90G [02:01<01:29, 51.6MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▉                    | 5.28G/9.90G [02:01<01:31, 50.3MB/s][A
Downloading (…)of-00003.safetensors:  53%|██████████████████████▉                    | 5.30G/9.90G [02:01<01:34, 48.9MB/s][A
Downloading (…)of-00003.safetensors:  54%|███████████████████████                    | 5.31G/9.90G [02:02<01:30, 50.7M

Downloading (…)of-00003.safetensors:  60%|█████████████████████████▊                 | 5.93G/9.90G [02:15<01:19, 49.7MB/s][A
Downloading (…)of-00003.safetensors:  60%|█████████████████████████▊                 | 5.95G/9.90G [02:15<01:30, 43.6MB/s][A
Downloading (…)of-00003.safetensors:  60%|█████████████████████████▊                 | 5.96G/9.90G [02:15<01:32, 42.8MB/s][A
Downloading (…)of-00003.safetensors:  60%|█████████████████████████▉                 | 5.97G/9.90G [02:16<01:36, 40.7MB/s][A
Downloading (…)of-00003.safetensors:  60%|█████████████████████████▉                 | 5.98G/9.90G [02:16<02:00, 32.7MB/s][A
Downloading (…)of-00003.safetensors:  60%|█████████████████████████▉                 | 5.99G/9.90G [02:17<01:51, 35.1MB/s][A
Downloading (…)of-00003.safetensors:  61%|██████████████████████████                 | 6.00G/9.90G [02:17<01:45, 37.1MB/s][A
Downloading (…)of-00003.safetensors:  61%|██████████████████████████                 | 6.01G/9.90G [02:17<01:45, 37.0M

Downloading (…)of-00003.safetensors:  67%|████████████████████████████▊              | 6.63G/9.90G [02:32<01:17, 42.2MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▊              | 6.64G/9.90G [02:32<01:15, 43.0MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▊              | 6.65G/9.90G [02:33<01:15, 42.9MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▉              | 6.66G/9.90G [02:33<01:14, 43.3MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▉              | 6.67G/9.90G [02:33<01:13, 43.8MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▉              | 6.68G/9.90G [02:33<01:13, 44.0MB/s][A
Downloading (…)of-00003.safetensors:  68%|█████████████████████████████              | 6.69G/9.90G [02:34<01:45, 30.4MB/s][A
Downloading (…)of-00003.safetensors:  68%|█████████████████████████████▏             | 6.71G/9.90G [02:34<01:09, 45.7M

Downloading (…)of-00003.safetensors:  74%|███████████████████████████████▊           | 7.32G/9.90G [02:50<01:06, 39.0MB/s][A
Downloading (…)of-00003.safetensors:  74%|███████████████████████████████▊           | 7.33G/9.90G [02:51<01:06, 38.6MB/s][A
Downloading (…)of-00003.safetensors:  74%|███████████████████████████████▊           | 7.34G/9.90G [02:51<01:07, 38.2MB/s][A
Downloading (…)of-00003.safetensors:  74%|███████████████████████████████▉           | 7.35G/9.90G [02:51<01:05, 39.1MB/s][A
Downloading (…)of-00003.safetensors:  74%|███████████████████████████████▉           | 7.36G/9.90G [02:51<01:06, 38.3MB/s][A
Downloading (…)of-00003.safetensors:  74%|████████████████████████████████           | 7.37G/9.90G [02:52<01:04, 39.4MB/s][A
Downloading (…)of-00003.safetensors:  75%|████████████████████████████████           | 7.38G/9.90G [02:52<01:03, 39.9MB/s][A
Downloading (…)of-00003.safetensors:  75%|████████████████████████████████           | 7.39G/9.90G [02:52<01:03, 39.2M

Downloading (…)of-00003.safetensors:  81%|██████████████████████████████████▊        | 8.02G/9.90G [03:08<00:51, 36.3MB/s][A
Downloading (…)of-00003.safetensors:  81%|██████████████████████████████████▊        | 8.03G/9.90G [03:08<00:49, 37.9MB/s][A
Downloading (…)of-00003.safetensors:  81%|██████████████████████████████████▉        | 8.04G/9.90G [03:08<00:50, 36.7MB/s][A
Downloading (…)of-00003.safetensors:  81%|██████████████████████████████████▉        | 8.05G/9.90G [03:08<00:50, 36.6MB/s][A
Downloading (…)of-00003.safetensors:  81%|███████████████████████████████████        | 8.06G/9.90G [03:09<00:48, 38.0MB/s][A
Downloading (…)of-00003.safetensors:  82%|███████████████████████████████████        | 8.07G/9.90G [03:09<00:48, 37.4MB/s][A
Downloading (…)of-00003.safetensors:  82%|███████████████████████████████████        | 8.08G/9.90G [03:10<01:08, 26.6MB/s][A
Downloading (…)of-00003.safetensors:  82%|███████████████████████████████████▏       | 8.11G/9.90G [03:10<00:44, 40.8M

Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▉     | 8.73G/9.90G [03:25<00:26, 43.9MB/s][A
Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▉     | 8.75G/9.90G [03:25<00:26, 44.3MB/s][A
Downloading (…)of-00003.safetensors:  88%|██████████████████████████████████████     | 8.76G/9.90G [03:26<00:24, 47.5MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████     | 8.77G/9.90G [03:26<00:24, 46.2MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████     | 8.78G/9.90G [03:26<00:24, 46.0MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▏    | 8.79G/9.90G [03:26<00:24, 46.3MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▏    | 8.80G/9.90G [03:26<00:23, 46.3MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▏    | 8.81G/9.90G [03:27<00:23, 46.1M

Downloading (…)of-00003.safetensors:  95%|████████████████████████████████████████▉  | 9.43G/9.90G [03:40<00:10, 46.1MB/s][A
Downloading (…)of-00003.safetensors:  95%|████████████████████████████████████████▉  | 9.44G/9.90G [03:40<00:09, 48.4MB/s][A
Downloading (…)of-00003.safetensors:  95%|█████████████████████████████████████████  | 9.45G/9.90G [03:40<00:09, 48.0MB/s][A
Downloading (…)of-00003.safetensors:  95%|█████████████████████████████████████████  | 9.46G/9.90G [03:40<00:09, 47.2MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████  | 9.47G/9.90G [03:41<00:09, 46.9MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▏ | 9.48G/9.90G [03:41<00:08, 49.8MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▏ | 9.49G/9.90G [03:41<00:08, 48.8MB/s][A
Downloading (…)of-00003.safetensors:  96%|█████████████████████████████████████████▏ | 9.50G/9.90G [03:41<00:08, 48.1M

Downloading (…)of-00003.safetensors:   7%|██▉                                          | 409M/6.18G [00:04<00:52, 109MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▏                                         | 430M/6.18G [00:04<00:52, 109MB/s][A
Downloading (…)of-00003.safetensors:   7%|███▎                                         | 451M/6.18G [00:04<00:52, 110MB/s][A
Downloading (…)of-00003.safetensors:   8%|███▍                                         | 472M/6.18G [00:04<00:51, 110MB/s][A
Downloading (…)of-00003.safetensors:   8%|███▌                                         | 493M/6.18G [00:05<00:51, 111MB/s][A
Downloading (…)of-00003.safetensors:   8%|███▋                                         | 514M/6.18G [00:05<00:51, 110MB/s][A
Downloading (…)of-00003.safetensors:   9%|███▉                                         | 535M/6.18G [00:05<00:51, 110MB/s][A
Downloading (…)of-00003.safetensors:   9%|████                                         | 556M/6.18G [00:05<00:52, 107M

Downloading (…)of-00003.safetensors:  27%|████████████                                | 1.69G/6.18G [00:17<00:42, 106MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████▏                               | 1.71G/6.18G [00:18<00:41, 107MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████▎                               | 1.73G/6.18G [00:18<00:41, 108MB/s][A
Downloading (…)of-00003.safetensors:  28%|████████████▏                              | 1.75G/6.18G [00:19<01:47, 41.1MB/s][A
Downloading (…)of-00003.safetensors:  29%|████████████▎                              | 1.77G/6.18G [00:19<01:27, 50.1MB/s][A
Downloading (…)of-00003.safetensors:  29%|████████████▍                              | 1.79G/6.18G [00:19<01:13, 59.5MB/s][A
Downloading (…)of-00003.safetensors:  29%|████████████▌                              | 1.81G/6.18G [00:20<01:03, 68.9MB/s][A
Downloading (…)of-00003.safetensors:  30%|████████████▊                              | 1.84G/6.18G [00:20<00:55, 77.9M

Downloading (…)of-00003.safetensors:  48%|█████████████████████                       | 2.96G/6.18G [00:33<00:29, 108MB/s][A
Downloading (…)of-00003.safetensors:  48%|█████████████████████▏                      | 2.98G/6.18G [00:33<00:29, 108MB/s][A
Downloading (…)of-00003.safetensors:  49%|█████████████████████▎                      | 3.00G/6.18G [00:33<00:29, 109MB/s][A
Downloading (…)of-00003.safetensors:  49%|█████████████████████▌                      | 3.02G/6.18G [00:33<00:31, 102MB/s][A
Downloading (…)of-00003.safetensors:  49%|█████████████████████                      | 3.03G/6.18G [00:34<00:32, 96.3MB/s][A
Downloading (…)of-00003.safetensors:  49%|█████████████████████▏                     | 3.04G/6.18G [00:34<00:35, 87.7MB/s][A
Downloading (…)of-00003.safetensors:  49%|█████████████████████▏                     | 3.05G/6.18G [00:34<00:37, 83.1MB/s][A
Downloading (…)of-00003.safetensors:  50%|█████████████████████▎                     | 3.06G/6.18G [00:34<00:42, 73.8M

Downloading (…)of-00003.safetensors:  67%|████████████████████████████▌              | 4.11G/6.18G [00:46<00:23, 89.0MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▋              | 4.12G/6.18G [00:46<00:23, 86.4MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▊              | 4.13G/6.18G [00:46<00:24, 82.9MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▊              | 4.14G/6.18G [00:46<00:23, 87.7MB/s][A
Downloading (…)of-00003.safetensors:  67%|████████████████████████████▉              | 4.16G/6.18G [00:46<00:20, 97.8MB/s][A
Downloading (…)of-00003.safetensors:  68%|█████████████████████████████▊              | 4.18G/6.18G [00:47<00:19, 103MB/s][A
Downloading (…)of-00003.safetensors:  68%|█████████████████████████████▉              | 4.20G/6.18G [00:47<00:18, 106MB/s][A
Downloading (…)of-00003.safetensors:  68%|██████████████████████████████              | 4.23G/6.18G [00:47<00:17, 109M

Downloading (…)of-00003.safetensors:  87%|██████████████████████████████████████▍     | 5.39G/6.18G [00:59<00:07, 110MB/s][A
Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▋     | 5.41G/6.18G [01:01<00:18, 42.3MB/s][A
Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▋     | 5.42G/6.18G [01:01<00:16, 47.1MB/s][A
Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▊     | 5.43G/6.18G [01:01<00:14, 53.0MB/s][A
Downloading (…)of-00003.safetensors:  88%|█████████████████████████████████████▉     | 5.45G/6.18G [01:01<00:10, 66.0MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████     | 5.47G/6.18G [01:01<00:09, 77.9MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▏    | 5.49G/6.18G [01:01<00:07, 86.2MB/s][A
Downloading (…)of-00003.safetensors:  89%|██████████████████████████████████████▍    | 5.52G/6.18G [01:02<00:07, 92.8M

In [43]:
from datasets import load_dataset
import pandas as pd

ds = load_dataset("RikoteMaster/Emotion_Recognition_4_llama2_chat")
ds = ds['train']
ds = pd.DataFrame.from_dict(ds)

print(ds)

def bigger_formatting(text, label):
    prompt = f"""<s>[INST] In this task, you will be performing a classification exercise aimed at identifying the underlying emotion conveyed by a given sentence. The emotions to consider are as follows:

    Anger, Joy, Sadness, Guilt, Shame, fear or disgust. 
    
    Sentence: {text} Emotion: {label} [/INST] Please answer only with the explanation of the Emotion. For example, in the input Sentence: I feel sad when my mother leaves home. Emotion: Sadness. You should answer EXPLANATION: In this sentence the feeling of sadness is due to the person is not going to see her mother in a period of time. """
    return prompt

for index, row in ds.iterrows():
    ds.loc[index, 'text'] = bigger_formatting(row['Text_processed'], row['Emotion'])

print(ds['text'][0])


                                          Text_processed  Emotion  Augmented  \
0      when i was on holiday with my former boyfriend...    shame      False   
1      news of a neighbors girl going missing accompa...     fear       True   
2      i experienced this feeling when the doctors fo...     fear      False   
3      i experienced a strong surge of displeasure to...    shame       True   
4      when the realization dawned upon me that i was...  sadness       True   
...                                                  ...      ...        ...   
61458  This game is dying because toxic people like y...    anger      False   
61459  I would say they do because it was a horrible ...  sadness      False   
61460                      Nah rest is pretty cringe too    shame      False   
61461                        You re an awful human being  disgust      False   
61462  Oh ok so we re agreeing I didn t see that last...    shame      False   

                                       

In [53]:
from tqdm import tqdm

def prediction(text):
    pipe = pipeline(task="text-generation", model=model, tokenizer=tokenizer, max_length=700)
    result = pipe(text)
    return result[0]['generated_text']

predictions = []

# Set the total number of iterations (progress total)
total_iterations = len(ds)

# Create a tqdm progress bar
for index, row in tqdm(ds.iterrows(), total=total_iterations, desc="Generating Predictions"):
    prediction_aux = prediction(row['text'])
    predictions.append(prediction_aux)
    
from numpy import asarray
from numpy import save, load

#save data
save('data.npy', asarray(predictions))

predicted = load('data.npy')
predicted[0]


Generating Predictions:   0%|                                                        | 1/61463 [00:05<92:57:36,  5.44s/it]


KeyboardInterrupt: 

In [52]:
# save numpy array as npy file
from numpy import asarray
from numpy import save, load

#save data
save('data.npy', asarray(predictions))

predicted = load('data.npy')
predicted[0]

'<s>[INST] In this task, you will be performing a classification exercise aimed at identifying the underlying emotion conveyed by a given sentence. The emotions to consider are as follows:\n\n    Anger, Joy, Sadness, Guilt, Shame, fear or disgust. \n    \n    Sentence: when i was on holiday with my former boyfriend he hit me and  behaved in a very evil and ugly manner in front of the hotel  personnel Emotion: shame [/INST] Please answer only with the explanation of the Emotion. For example, in the input Sentence: I feel sad when my mother leaves home. Emotion: Sadness. You should answer EXPLANATION: In this sentence the feeling of sadness is due to the person is not going to see her mother in a period of time. \n\nEXPLANATION: In this sentence, the feeling of shame is due to the fact that the person was hit by their former boyfriend in front of hotel personnel, which is a public setting and could be considered embarrassing and humiliating. The person may feel ashamed of being treated i