# Data labeling example
This notebook shows how to use EDSL to automate data labeling and content analysis. Using a dataset of mock customer service tickets as an example, we design a data labeling task as a survey of questions about the tickets that we administer to a language model, generating a summary dataset.

The steps are:
1. Technical setup<br>
2. Identifying data for review<br>
3. Constructing questions about the data<br>
4. Delivering the questions to language models<br>
5. Analyzing the responses<br>

*View this notebook at the Coop: https://www.expectedparrot.com/content/c1188684-2b19-4fd3-af77-cfa63536919b*

[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/expectedparrot/edsl/blob/main/docs/notebooks/data_labeling_example.ipynb)

### Technical setup
Before running the code below please see instructions on:

* [Installing the EDSL package](https://docs.expectedparrot.com/en/latest/installation.html)
* Activating [Remote Inference](https://docs.expectedparrot.com/en/latest/remote_inference.html) at your [Coop](https://docs.expectedparrot.com/en/latest/coop.html) account or storing [API Keys](https://docs.expectedparrot.com/en/latest/api_keys.html) for the language models that you want to use with EDSL

We also have a [Starter Tutorial](https://docs.expectedparrot.com/en/latest/starter_tutorial.html) about EDSL basics.

In [1]:
# ! pip install edsl

## Selecting data for review
First we identify some data for review. Data can be created using the EDSL tools or imported from other sources. For purposes of this demo we import a set of hypothetical customer tickets for a transportation app:

In [2]:
tickets = [
    "I just realized I left my phone in the car on my last ride. Can you help me get it back?",
    "I'm unhappy with my recent experience. The driver was very rude and unprofessional.",
    "I was charged more than the estimated fare for my trip yesterday. Can you explain why?",
    "The car seat provided was not properly installed, and I felt my child was at risk. Please ensure driver training.",
    "My driver took a longer route than necessary, resulting in a higher fare. I request a fare adjustment.",
    "I had a great experience with my driver today! Very friendly and efficient service.",
    "I'm concerned about the vehicle's cleanliness. It was not up to the standard I expect.",
    "The app keeps crashing every time I try to book a ride. Please fix this issue.",
    "My driver was exceptional - safe driving, polite, and the car was spotless. Kudos!",
    "I felt unsafe during my ride due to the driver's erratic behavior. This needs to be addressed immediately.",
    "The driver refused to follow my preferred route, which is shorter. I'm not satisfied with the service.",
    "Impressed with the quick response to my ride request and the driver's professionalism.",
    "I was charged for a ride I never took. Please refund me as soon as possible.",
    "The promo code I tried to use didn't work. Can you assist with this?",
    "There was a suspicious smell in the car, and I'm worried about hygiene standards.",
    "My driver was very considerate, especially helping me with my luggage. Appreciate the great service!",
    "The app's GPS seems inaccurate. It directed the driver to the wrong pick-up location.",
    "I want to compliment my driver's excellent navigation and time management during rush hour.",
    "The vehicle didn't match the description in the app. It was confusing and concerning.",
    "I faced an issue with payment processing after my last ride. Can you look into this?",
]

## Constructing questions about the data
Next we create some questions about the data. EDSL provides classes for a variety of common question types, including multiple choice, free text, checkbox, linear scale and many others. We import the question type classes that we want to use and then construct questions in the relevant templates. We use a <b><i>{{ placeholder }}</b></i> in each question text in order to parameterize the questions with the individual ticket contents. For more details about constructing questions, please see the <a href="https://docs.expectedparrot.com/en/latest/questions.html">Questions</a> section of the docs.

In [3]:
from edsl.questions import (
    QuestionMultipleChoice,
    QuestionCheckBox,
    QuestionFreeText,
    QuestionList,
    QuestionYesNo,
    QuestionLinearScale,
)

In [4]:
question_issues = QuestionCheckBox(
    question_name="issues",
    question_text="Check all of the issues mentioned in this ticket: {{ ticket }}",
    question_options=[
        "safety",
        "cleanliness",
        "driver performance",
        "GPS/route",
        "lost item",
        "other",
    ],
)

In [5]:
question_primary_issue = QuestionFreeText(
    question_name="primary_issue",
    question_text="What is the primary issue in this ticket? Ticket: {{ ticket }}",
)

In [6]:
question_accident = QuestionMultipleChoice(
    question_name="accident",
    question_text="If the primary issue in this ticket is safety, was there an accident where someone was hurt? Ticket: {{ ticket }}",
    question_options=["Yes", "No", "Not applicable"],
)

In [7]:
question_sentiment = QuestionMultipleChoice(
    question_name="sentiment",
    question_text="What is the sentiment of this ticket? Ticket: {{ ticket }}",
    question_options=[
        "Very positive",
        "Somewhat positive",
        "Neutral",
        "Somewhat negative",
        "Very negative",
    ],
)

In [8]:
question_refund = QuestionYesNo(
    question_name="refund",
    question_text="Does the customer ask for a refund in this ticket? Ticket: {{ ticket }}",
)

In [9]:
question_priority = QuestionLinearScale(
    question_name="priority",
    question_text="On a scale from 0 to 5, what is the priority level of this ticket? Ticket: {{ ticket }}",
    question_options=[0, 1, 2, 3, 4, 5],
    option_labels={0: "Lowest", 5: "Highest"},
)

## Building a survey
We combine the questions into a survey in order to administer them together:

In [10]:
from edsl import Survey

survey = Survey(
    questions=[
        question_issues,
        question_primary_issue,
        question_accident,
        question_sentiment,
        question_refund,
        question_priority,
    ]
)

(Survey questions are administered asynchronously by default. To learn about adding conditional logic such as skip/stop rules and targeted memory, please see the <a href="https://docs.expectedparrot.com/en/latest/surveys.html">Surveys</a> section of the docs.)

We can review our questions in a readable format, or export them as a survey to use with human respondents or at other survey platforms:

In [11]:
survey

## Designing AI agents
A key feature of EDSL is the ability to create personas for AI agents that the language models are prompted to use in generating responses to the questions. This is done by passing a dictionary of traits to Agent objects:

In [12]:
from edsl import Agent

agent = Agent(
    traits={
        "persona": "You are an expert customer service agent.",
        "years_experience": 15,
    }
)

## Selecting language models
EDSL allows us to select the language models to use in generating results. To see all available models:

In [13]:
from edsl import Model

Model.available()

[['01-ai/Yi-34B-Chat', 'deep_infra', 0],
 ['Austism/chronos-hermes-13b-v2', 'deep_infra', 1],
 ['Gryphe/MythoMax-L2-13b', 'deep_infra', 2],
 ['Gryphe/MythoMax-L2-13b-turbo', 'deep_infra', 3],
 ['HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1', 'deep_infra', 4],
 ['Phind/Phind-CodeLlama-34B-v2', 'deep_infra', 5],
 ['Qwen/Qwen2-72B-Instruct', 'deep_infra', 6],
 ['Qwen/Qwen2-7B-Instruct', 'deep_infra', 7],
 ['Sao10K/L3-70B-Euryale-v2.1', 'deep_infra', 8],
 ['bigcode/starcoder2-15b', 'deep_infra', 9],
 ['bigcode/starcoder2-15b-instruct-v0.1', 'deep_infra', 10],
 ['claude-3-5-sonnet-20240620', 'anthropic', 11],
 ['claude-3-haiku-20240307', 'anthropic', 12],
 ['claude-3-opus-20240229', 'anthropic', 13],
 ['claude-3-sonnet-20240229', 'anthropic', 14],
 ['codellama/CodeLlama-34b-Instruct-hf', 'deep_infra', 15],
 ['codellama/CodeLlama-70b-Instruct-hf', 'deep_infra', 16],
 ['cognitivecomputations/dolphin-2.6-mixtral-8x7b', 'deep_infra', 17],
 ['cognitivecomputations/dolphin-2.9.1-llama-3-70b', 'deep_i

Here we select GPT 4 (which is also used by default if no model is specified for a survey):

In [14]:
model = Model("gpt-4-1106-preview")

## Conducting the analysis
With our data and questions we're now ready to package our survey and deliver it to an AI. We do this by inserting the contents of each ticket into each question as an independent "scenario" for review, and then running the survey.

In [15]:
from edsl import Scenario

scenarios = [Scenario({"ticket": t}) for t in tickets]

We run the survey by chaining the components with the <i>by</i> method and then calling the <i>run</i> method:

In [16]:
results = survey.by(scenarios).by(agent).by(model).run()

## Inspecting the results
EDSL comes with built-in methods for analyzing results. For more details on working with results, please see the <a href="https://docs.expectedparrot.com/en/latest/results.html">Results</a> section of the docs.

To inspect the components of results:

In [17]:
results.columns

['agent.agent_instruction',
 'agent.agent_name',
 'agent.persona',
 'agent.years_experience',
 'answer.accident',
 'answer.issues',
 'answer.primary_issue',
 'answer.priority',
 'answer.refund',
 'answer.sentiment',
 'comment.accident_comment',
 'comment.issues_comment',
 'comment.priority_comment',
 'comment.refund_comment',
 'comment.sentiment_comment',
 'iteration.iteration',
 'model.frequency_penalty',
 'model.logprobs',
 'model.max_tokens',
 'model.model',
 'model.presence_penalty',
 'model.temperature',
 'model.top_logprobs',
 'model.top_p',
 'prompt.accident_system_prompt',
 'prompt.accident_user_prompt',
 'prompt.issues_system_prompt',
 'prompt.issues_user_prompt',
 'prompt.primary_issue_system_prompt',
 'prompt.primary_issue_user_prompt',
 'prompt.priority_system_prompt',
 'prompt.priority_user_prompt',
 'prompt.refund_system_prompt',
 'prompt.refund_user_prompt',
 'prompt.sentiment_system_prompt',
 'prompt.sentiment_user_prompt',
 'question_options.accident_question_options',

We can select just the responses to the questions to display:

In [18]:
results.select(
    "ticket", "issues", "primary_issue", "accident", "sentiment", "refund", "priority"
).print()

scenario.ticket,answer.issues,answer.primary_issue,answer.accident,answer.sentiment,answer.refund,answer.priority
I just realized I left my phone in the car on my last ride. Can you help me get it back?,['lost item'],The primary issue is that the customer has left their phone in the car during their last ride and is seeking assistance to retrieve it.,No,Somewhat negative,No,4
I'm unhappy with my recent experience. The driver was very rude and unprofessional.,['driver performance'],The primary issue in this ticket is that the customer experienced rudeness and unprofessional behavior from a driver.,No,Very negative,No,4
I was charged more than the estimated fare for my trip yesterday. Can you explain why?,['other'],The primary issue in this ticket is that the customer was charged an amount exceeding the estimated fare for their trip and is seeking an explanation for the discrepancy.,Not applicable,Somewhat negative,No,3
"The car seat provided was not properly installed, and I felt my child was at risk. Please ensure driver training.","['safety', 'driver performance']","The primary issue in this ticket is that the car seat provided was not installed correctly, compromising the child's safety. The customer is requesting enhanced driver training to prevent future safety risks.",No,Somewhat negative,No,5
"My driver took a longer route than necessary, resulting in a higher fare. I request a fare adjustment.","['driver performance', 'GPS/route']","The primary issue in this ticket is that the customer's driver took a longer route than was necessary, which led to the customer being charged a higher fare than expected. The customer is requesting a fare adjustment to compensate for the unnecessary overcharge.",No,Somewhat negative,No,3
I had a great experience with my driver today! Very friendly and efficient service.,['driver performance'],There is no issue reported in this ticket. It contains positive feedback about a great experience with a friendly and efficient driver.,Not applicable,Very positive,No,0
I'm concerned about the vehicle's cleanliness. It was not up to the standard I expect.,['cleanliness'],"The primary issue in this ticket is the customer's dissatisfaction with the cleanliness of the vehicle, which did not meet their expected standard.",Not applicable,Somewhat negative,No,3
The app keeps crashing every time I try to book a ride. Please fix this issue.,['other'],The primary issue reported in the ticket is that the application is crashing every time the customer attempts to book a ride.,Not applicable,Somewhat negative,No,4
I felt unsafe during my ride due to the driver's erratic behavior. This needs to be addressed immediately.,"['safety', 'driver performance']",The primary issue in the ticket is that the customer felt unsafe due to the driver's erratic behavior during the ride.,Not applicable,Very negative,No,5
"The driver refused to follow my preferred route, which is shorter. I'm not satisfied with the service.","['driver performance', 'GPS/route']","The primary issue in this ticket is that the driver refused to follow the customer's preferred route, which the customer believes is shorter, leading to dissatisfaction with the service provided.",No,Somewhat negative,No,2


We can apply some lables to our table:

In [19]:
(
    results.select(
        "ticket",
        "issues",
        "primary_issue",
        "accident",
        "sentiment",
        "refund",
        "priority",
    ).print(
        pretty_labels={
            "scenario.ticket": "Ticket",
            "answer.issues": "Issues",
            "answer.primary_issue": "Primary issue",
            "answer.accident": "Accident",
            "answer.sentiment": "Sentiment",
            "answer.refund": "Refund request",
            "answer.priority": "Priority",
        }
    )
)

Ticket,Issues,Primary issue,Accident,Sentiment,Refund request,Priority
I just realized I left my phone in the car on my last ride. Can you help me get it back?,['lost item'],The primary issue is that the customer has left their phone in the car during their last ride and is seeking assistance to retrieve it.,No,Somewhat negative,No,4
I'm unhappy with my recent experience. The driver was very rude and unprofessional.,['driver performance'],The primary issue in this ticket is that the customer experienced rudeness and unprofessional behavior from a driver.,No,Very negative,No,4
I was charged more than the estimated fare for my trip yesterday. Can you explain why?,['other'],The primary issue in this ticket is that the customer was charged an amount exceeding the estimated fare for their trip and is seeking an explanation for the discrepancy.,Not applicable,Somewhat negative,No,3
"The car seat provided was not properly installed, and I felt my child was at risk. Please ensure driver training.","['safety', 'driver performance']","The primary issue in this ticket is that the car seat provided was not installed correctly, compromising the child's safety. The customer is requesting enhanced driver training to prevent future safety risks.",No,Somewhat negative,No,5
"My driver took a longer route than necessary, resulting in a higher fare. I request a fare adjustment.","['driver performance', 'GPS/route']","The primary issue in this ticket is that the customer's driver took a longer route than was necessary, which led to the customer being charged a higher fare than expected. The customer is requesting a fare adjustment to compensate for the unnecessary overcharge.",No,Somewhat negative,No,3
I had a great experience with my driver today! Very friendly and efficient service.,['driver performance'],There is no issue reported in this ticket. It contains positive feedback about a great experience with a friendly and efficient driver.,Not applicable,Very positive,No,0
I'm concerned about the vehicle's cleanliness. It was not up to the standard I expect.,['cleanliness'],"The primary issue in this ticket is the customer's dissatisfaction with the cleanliness of the vehicle, which did not meet their expected standard.",Not applicable,Somewhat negative,No,3
The app keeps crashing every time I try to book a ride. Please fix this issue.,['other'],The primary issue reported in the ticket is that the application is crashing every time the customer attempts to book a ride.,Not applicable,Somewhat negative,No,4
I felt unsafe during my ride due to the driver's erratic behavior. This needs to be addressed immediately.,"['safety', 'driver performance']",The primary issue in the ticket is that the customer felt unsafe due to the driver's erratic behavior during the ride.,Not applicable,Very negative,No,5
"The driver refused to follow my preferred route, which is shorter. I'm not satisfied with the service.","['driver performance', 'GPS/route']","The primary issue in this ticket is that the driver refused to follow the customer's preferred route, which the customer believes is shorter, leading to dissatisfaction with the service provided.",No,Somewhat negative,No,2


EDSL also comes with methods for accessing results as a dataframe or SQL table:

In [20]:
df = (
    results
    .select(
        "issues",
        "primary_issue",
        "accident",
        "sentiment",
        "refund",
        "priority"
    )
    .to_pandas(remove_prefix=True)
)
df

Unnamed: 0,accident,issues,primary_issue,priority,refund,sentiment
0,No,['lost item'],The primary issue is that the customer has lef...,4,No,Somewhat negative
1,No,['driver performance'],The primary issue in this ticket is that the c...,4,No,Very negative
2,Not applicable,['other'],The primary issue in this ticket is that the c...,3,No,Somewhat negative
3,No,"['safety', 'driver performance']",The primary issue in this ticket is that the c...,5,No,Somewhat negative
4,No,"['driver performance', 'GPS/route']",The primary issue in this ticket is that the c...,3,No,Somewhat negative
5,Not applicable,['driver performance'],There is no issue reported in this ticket. It ...,0,No,Very positive
6,Not applicable,['cleanliness'],The primary issue in this ticket is the custom...,3,No,Somewhat negative
7,Not applicable,['other'],The primary issue reported in the ticket is th...,4,No,Somewhat negative
8,Not applicable,"['safety', 'driver performance']",The primary issue in the ticket is that the cu...,5,No,Very negative
9,No,"['driver performance', 'GPS/route']",The primary issue in this ticket is that the d...,2,No,Somewhat negative


We can also access results as a SQL table:

In [21]:
results.sql("""
select ticket, issues, primary_issue, accident, sentiment, refund, priority 
from self
""", shape="wide")

Unnamed: 0,ticket,issues,primary_issue,accident,sentiment,refund,priority
0,I just realized I left my phone in the car on ...,['lost item'],The primary issue is that the customer has lef...,No,Somewhat negative,No,4
1,I'm unhappy with my recent experience. The dri...,['driver performance'],The primary issue in this ticket is that the c...,No,Very negative,No,4
2,I was charged more than the estimated fare for...,['other'],The primary issue in this ticket is that the c...,Not applicable,Somewhat negative,No,3
3,The car seat provided was not properly install...,"['safety', 'driver performance']",The primary issue in this ticket is that the c...,No,Somewhat negative,No,5
4,"My driver took a longer route than necessary, ...","['driver performance', 'GPS/route']",The primary issue in this ticket is that the c...,No,Somewhat negative,No,3
5,I had a great experience with my driver today!...,['driver performance'],There is no issue reported in this ticket. It ...,Not applicable,Very positive,No,0
6,I'm concerned about the vehicle's cleanliness....,['cleanliness'],The primary issue in this ticket is the custom...,Not applicable,Somewhat negative,No,3
7,The app keeps crashing every time I try to boo...,['other'],The primary issue reported in the ticket is th...,Not applicable,Somewhat negative,No,4
8,I felt unsafe during my ride due to the driver...,"['safety', 'driver performance']",The primary issue in the ticket is that the cu...,Not applicable,Very negative,No,5
9,The driver refused to follow my preferred rout...,"['driver performance', 'GPS/route']",The primary issue in this ticket is that the d...,No,Somewhat negative,No,2


To export results to a CSV file:

In [22]:
results.to_csv("data_labeling_example.csv")

## Posting content to the Coop

In [23]:
results.push(visibility="public")

{'description': None,
 'object_type': 'results',
 'url': 'https://www.expectedparrot.com/content/15f024f0-e669-4b67-95eb-393974ef55c3',
 'uuid': '15f024f0-e669-4b67-95eb-393974ef55c3',
 'version': '0.1.30.dev3',
 'visibility': 'public'}

In [24]:
survey.push(visibility="public")

{'description': None,
 'object_type': 'survey',
 'url': 'https://www.expectedparrot.com/content/5fd2002c-f189-468a-b8c9-af92fb69fd10',
 'uuid': '5fd2002c-f189-468a-b8c9-af92fb69fd10',
 'version': '0.1.30.dev3',
 'visibility': 'public'}

In [25]:
from edsl import Coop, Notebook

coop = Coop()

notebook = Notebook(path="data_labeling_example.ipynb")

coop.create(notebook, description="Data Labeling Example", visibility="public") 

{'description': None,
 'object_type': 'notebook',
 'url': 'https://www.expectedparrot.com/content/c1188684-2b19-4fd3-af77-cfa63536919b',
 'uuid': 'c1188684-2b19-4fd3-af77-cfa63536919b',
 'version': '0.1.30.dev3',
 'visibility': 'public'}