---
sidebar_label: Qualcomm Inference Suite
---

# ChatQIS

This will help you getting started with Qualcomm Inference Suite [chat models](/docs/concepts/chat_models). For detailed documentation of all ChatQIS features and configurations head to the [API reference](https://python.langchain.com/api_reference/langchain_qualcomm_inference_suite/chat_models/langchain_qualcomm_inference_suite.chat_models.ChatQIS.html).

## Overview
### Integration details

| Class                                                                                                                                                            | Package                                                                                                              | Local | Serializable | JS support |                                                 Package downloads                                                  |                                                 Package latest                                                  |
|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------|:-----:|:------------:|:----------:|:------------------------------------------------------------------------------------------------------------------:|:---------------------------------------------------------------------------------------------------------------:|
| [ChatQIS](https://python.langchain.com/api_reference/langchain_qualcomm_inference_suite/chat_models/langchain_qualcomm_inference_suite.chat_models.ChatQIS.html) | [langchain-qualcomm-inference-suite](https://python.langchain.com/api_reference/langchain_qualcomm_inference_suite/) |   ❌   |      ❌       |     ❌      | ![PyPI - Downloads](https://img.shields.io/pypi/dm/langchain-qualcomm-inference-suite?style=flat-square&label=%20) | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-qualcomm-inference-suite?style=flat-square&label=%20) |

### Model features
| [Tool calling](/docs/how_to/tool_calling) | [Structured output](/docs/how_to/structured_output/) | JSON mode | [Image input](/docs/how_to/multimodal_inputs/) | Audio input | Video input | [Token-level streaming](/docs/how_to/chat_streaming/) | Native async | [Token usage](/docs/how_to/chat_token_usage_tracking/) | [Logprobs](/docs/how_to/logprobs/) |
|:-----------------------------------------:|:----------------------------------------------------:|:---------:|:----------------------------------------------:|:-----------:|:-----------:|:-----------------------------------------------------:|:------------:|:------------------------------------------------------:|:----------------------------------:|
|                     ❌                     |                          ❌                           |     ❌     |                       ❌                        |      ❌      |      ❌      |                           ✅                           |      ✅       |                           ✅                            |                 ❌                  |

## Setup

To access Qualcomm Inference Suite models please reach out to your Qualcomm Inference Suite service provider for support. They will provide you an API key and API endpoint. Then you can and install and make use of the `langchain-qualcomm-inference-suite` integration package.

### Credentials

Please reach out to your Qualcomm Inference Suite service provider for support to generate an API key and obtain the API endpoint. Once you've done this set the `IMAGINE_API_KEY` and `IMAGINE_API_ENDPOINT` environment variables:

In [1]:
import getpass
import os

if not os.getenv("IMAGINE_API_KEY"):
    os.environ["IMAGINE_API_KEY"] = getpass.getpass("Enter your Qualcomm Inference Suite API key: ")
if not os.getenv("IMAGINE_API_ENDPOINT"):
    os.environ["IMAGINE_API_ENDPOINT"] = input("Enter your Qualcomm Inference Suite API endpoint: ")

### Installation

The LangChain Qualcomm Inference Suite integration lives in the `langchain-qualcomm-inference-suite` package:

In [None]:
%pip install -qU langchain-qualcomm-inference-suite

## Instantiation

Now we can instantiate our model object and generate chat completions:

In [2]:
from langchain_qualcomm_inference_suite import ChatQIS

llm = ChatQIS(
    model="Llama-3.1-8B",
    temperature=0,
    max_tokens=None,
    timeout=None,
    max_retries=2,
    # other params...
)

## Invocation


In [4]:
messages = [
    (
        "system",
        "You are a helpful assistant that translates English to French. Translate the user sentence.",
    ),
    ("human", "I love programming."),
]
ai_msg = llm.invoke(messages)
ai_msg

AIMessage(content='The translation of "I love programming" to French is:\n\n"J\'adore le programmation."', additional_kwargs={}, response_metadata={'token_usage': {'prompt_tokens': 36, 'total_tokens': 58, 'completion_tokens': 22}, 'model_name': 'Llama-3.1-8B', 'system_fingerprint': '', 'finish_reason': <FinishReason.stop: 'stop'>}, id='run-984263a5-9ead-408f-8954-3659d3671d22-0', usage_metadata={'input_tokens': 36, 'output_tokens': 22, 'total_tokens': 58})

In [5]:
print(ai_msg.content)

The translation of "I love programming" to French is:

"J'adore le programmation."


## Chaining

We can [chain](/docs/how_to/sequence/) our model with a prompt template like so:


In [6]:
from langchain_core.prompts import ChatPromptTemplate

prompt = ChatPromptTemplate(
    [
        (
            "system",
            "You are a helpful assistant that translates {input_language} to {output_language}.",
        ),
        ("human", "{input}"),
    ]
)

chain = prompt | llm
chain.invoke(
    {
        "input_language": "English",
        "output_language": "German",
        "input": "I love programming.",
    }
)

AIMessage(content='Das Programmieren ist mein Hobby.', additional_kwargs={}, response_metadata={'token_usage': {'prompt_tokens': 31, 'total_tokens': 39, 'completion_tokens': 8}, 'model_name': 'Llama-3.1-8B', 'system_fingerprint': '', 'finish_reason': <FinishReason.stop: 'stop'>}, id='run-89137506-a3a1-4799-bf8d-564a1a574dee-0', usage_metadata={'input_tokens': 31, 'output_tokens': 8, 'total_tokens': 39})

## API reference

For detailed documentation of all ChatQIS features and configurations head to the [API reference](https://python.langchain.com/api_reference/langchain_qualcomm_inference_suite/chat_models/langchain_qualcomm_inference_suite.chat_models.ChatQIS.html)