uptrain-ai · Dominastorm · Mar 12, 2024 · Mar 12, 2024 · Mar 12, 2024
diff --git a/docs/assets/helicone1.webp b/docs/assets/helicone1.webp
diff --git a/docs/assets/helicone2.webp b/docs/assets/helicone2.webp
diff --git a/docs/assets/langfuse.webp b/docs/assets/langfuse.webp
diff --git a/docs/integrations/observation-tools/helicone.mdx b/docs/integrations/observation-tools/helicone.mdx
@@ -0,0 +1,202 @@
+---
+title: Helicone
+---
+[Helicone](https://www.helicone.ai/) helps you understand how your application is performing with its monitoring tools. 
+We will walk you through the use of Helicone for monitoring and log your UpTrain evaluations in Helicone Dashboards
+
+## How to integrate?
+
+### Prerequisites
+
+```python
+%pip install openai uptrain -qU
+```
+
+<Steps>
+    <Step title = "Define Your OpenAI and Helicone Key">
+    You can get your Helicone API key [here](https://www.helicone.ai/developer) and OpenAI API key [here](https://platform.openai.com/api-keys)
+    ```python
+    import os
+    HELICONE_API_KEY = os.environ["HELICONE_API_KEY"]
+    OPENAI_API_KEY = os.environ["OPENAI_API_KEY"]
+    ```
+    </Step>
+
+    <Step title = "Define OpenAI client">
+    ```python
+    import os
+    from openai import OpenAI
+    import uuid
+    import requests
+
+    update_headers = {
+        'Authorization': f'Bearer {HELICONE_API_KEY}',
+        'Content-Type': 'application/json',
+    }
+
+    client = OpenAI(
+    api_key=OPENAI_API_KEY,  # Replace with your OpenAI API key
+    base_url="http://oai.hconeai.com/v1",  # Set the API endpoint
+    default_headers= {  # Optionally set default headers or set per request (see below)
+        "Helicone-Auth": f"Bearer {HELICONE_API_KEY}",
+    }
+    )
+    ```
+    </Step>
+
+    <Step title = "Let's define our dataset">
+    ```python
+    data = [
+    {
+        "question": "What causes diabetes?",
+        "context": "Diabetes is a metabolic disorder characterized by high blood sugar levels. It is primarily caused by a combination of genetic and environmental factors, including obesity and lack of physical activity.",
+    },
+    {
+        "question": "What is the capital of France?",
+        "context": "Paris is the capital of France. It is a place where people speak French and enjoy baguettes. I once heard that the Eiffel Tower was built by aliens, but don\'t quote me on that.",
+    },
+    {
+        "question": "How is pneumonia treated?",
+        "context": "Pneumonia is an infection that inflames the air sacs in one or both lungs. It is typically treated with antibiotics, rest, and supportive care. The choice of antibiotics depends on the type of pneumonia and its severity.",
+    }
+]
+    ```
+    </Step>
+
+    <Step title = "Define your prompt">
+    ```python
+    def create_prompt(question, context):
+    prompt = f"""
+    Context information is below.
+    ---------------------
+    {context}
+    ---------------------
+    Given the context information and not prior knowledge, answer the query.
+    Query: {question}
+    Answer:
+    """
+    return prompt
+    ```
+    </Step>
+
+    <Step title = "Define funtion to generate responses">
+    ```python
+    def generate_responses(prompt):
+    response = client.chat.completions.create(
+    model="gpt-4",
+    messages=[
+        {"role": "user", "content": prompt}
+    ],
+    extra_headers={ # Can also attach headers per request
+        "Helicone-Auth": f"Bearer {HELICONE_API_KEY}",
+        "Helicone-Request-Id": f"{my_helicone_request_id}"
+    },
+    ).choices[0].message.content
+    return response
+    ```
+    </Step>
+
+    <Step title = "Define UpTrain Function to run Evaluations">
+    ```python
+    from uptrain import EvalLLM, Evals, ResponseMatching
+
+    eval_llm = EvalLLM(openai_api_key = OPENAI_API_KEY)
+    ```
+    We have used the following 5 metrics from UpTrain's library:
+
+    1. [Response Conciseness](https://docs.uptrain.ai/predefined-evaluations/response-quality/response-conciseness): Evaluates how concise the generated response is or if it has any additional irrelevant information for the question asked.
+
+    2. [Factual Accuracy](https://docs.uptrain.ai/predefined-evaluations/context-awareness/factual-accuracy): Evaluates whether the response generated is factually correct and grounded by the provided context.
+
+    3. [Context Utilization](https://docs.uptrain.ai/predefined-evaluations/context-awareness/context-utilization): Evaluates how complete the generated response is for the question specified given the information provided in the context. Also known as Reponse Completeness wrt context
+
+    4. [Response Relevance](https://docs.uptrain.ai/predefined-evaluations/response-quality/response-relevance): Evaluates how relevant the generated response was to the question specified.
+
+    Each score has a value between 0 and 1. 
+
+    You can look at the complete list of UpTrain's supported metrics [here](https://docs.uptrain.ai/predefined-evaluations/overview)
+    ```python
+    def uptrain_evaluate(item):
+    res = eval_llm.evaluate(
+        project_name = "Helicone-Demo",
+        data = item,
+        checks = [
+        Evals.RESPONSE_CONCISENESS,
+        Evals.RESPONSE_RELEVANCE,
+        Evals.RESPONSE_COMPLETENESS_WRT_CONTEXT,
+        Evals.FACTUAL_ACCURACY,
+        ]
+    )
+    return res
+    ```
+    </Step>
+
+    <Step title = "Run the evaluations and log the data to Helicone t">
+    ```python
+    results = []
+for index in range(len(data)):
+
+    question = data[index]['question']
+    context = data[index]['context']    
+
+    prompt = create_prompt(question, context)
+
+    my_helicone_request_id = str(uuid.uuid4())
+
+    response = generate_responses(prompt)
+
+    eval_data = [
+        {
+            'question': question,
+            'context': context, 
+            'response': response, 
+        }
+    ]
+
+
+    result = uptrain_evaluate(eval_data)
+    results.append(result)
+
+    for i in result[0].keys():
+        if i.startswith('score') or i.startswith('explanation'):
+            json_data = {
+                            'key': i,
+                            'value': str(result[0][i]),
+                        }
+            status = requests.put(f'https://api.hconeai.com/v1/request/{my_helicone_request_id}/property', headers=update_headers, json=json_data)
+    ```
+    </Step>
+</Steps>
+
+### Visualize Results in Helicone Dashboards
+
+You can log into [Helicone Dashoards](https://www.helicone.ai/dashboard) to observe your LLM applications over cost, tokens, latency
+<Frame>
+  <img src="/assets/helicone1.webp" />
+</Frame>
+
+You can also look at individual records
+
+<Frame>
+  <img src="/assets/helicone2.webp" />
+</Frame>
+
+
+<CardGroup cols={2}>
+  <Card
+    title="Tutorial"
+    href="https://github.com/uptrain-ai/uptrain/blob/main/examples/integrations/observation_tools/helicone.ipynb"
+    icon="github"
+    color="#808080"
+  >
+    Open this tutorial in GitHub
+  </Card>
+  <Card
+    title="Have Questions?"
+    href="https://join.slack.com/t/uptraincommunity/shared_invite/zt-1yih3aojn-CEoR_gAh6PDSknhFmuaJeg"
+    icon="slack"
+    color="#808080"
+  >
+    Join our community for any questions or requests
+  </Card>
+</CardGroup>
diff --git a/docs/integrations/observation-tools/langfuse.mdx b/docs/integrations/observation-tools/langfuse.mdx
@@ -11,7 +11,7 @@ Scores can be used in a variety of ways in Langfuse:
 3. Fine Tuning: Filter and [export](https://langfuse.com/docs/export-and-fine-tuning) by scores as .csv or .JSONL for fine-tuning
 4. Analytics: Detailed score reporting and dashboards with drill downs into use cases and user segments
 
-This notebook demonstrates how to use Langfuse to create traces and evaluate using UpTrain
+In this guide, we will walk you through using Langfuse to create traces and evaluate them using UpTrain.
 
 ## How to integrate?
 ### Setup
@@ -226,6 +226,11 @@ for _, row in df.iterrows():
 df.head()
 ```
 
+You can visualize these results on Lanfuse:
+<Frame>
+  <img src="/assets/langfuse.webp" />
+</Frame>
+
 <CardGroup cols={2}>
   <Card
     title="Tutorial"

diff --git a/docs/mint.json b/docs/mint.json
@@ -189,7 +189,8 @@
         {
           "group": "Observation Tools",
           "pages": [
-            "integrations/observation-tools/langfuse"
+            "integrations/observation-tools/langfuse",
+            "integrations/observation-tools/helicone"
           ]
         },
         {