## Using LLMs via Hugging Face Inference Client

Thankfully HuggingFace has made its new [__Inference Client__](https://huggingface.co/docs/huggingface_hub/en/package_reference/inference_client) free to use with some basic rate limits etc. in place so you don't end up making unlimited requests on its servers.

The best part is you can access 150,000+ deep learning models without worrying about your infrastructure. Similar to the inference API

In [1]:
from huggingface_hub import InferenceClient

  from .autonotebook import tqdm as notebook_tqdm


Feel free to refer to the [documentation](https://huggingface.co/docs/huggingface_hub/en/package_reference/inference_client#huggingface_hub.InferenceClient) at any time as needed for more details on function names, arguments and more.

In [2]:
from dotenv import load_dotenv
import os

load_dotenv()
hf_key = os.getenv("HF_TOKEN")

In [3]:
model_name = "meta-llama/Llama-3.2-1B-Instruct"
client = InferenceClient(model=model_name, api_key=hf_key)

chat = [
    { "role": "user", "content": "Explain what is Generative AI in 2 bullet points" },
]

response = client.chat_completion(chat, max_tokens=1000)
print(response)

ChatCompletionOutput(choices=[ChatCompletionOutputComplete(finish_reason='stop', index=0, message=ChatCompletionOutputMessage(role='assistant', content='Here are two key points about Generative AI:\n\n• **Generative AI refers to the use of artificial intelligence (AI) algorithms that can create new data or content that resembles existing data or content, often with a specific purpose or goal in mind.** This can include tasks such as generating text, images, videos, music, or even entire articles.\n\n• **Generative AI models can be trained on large datasets to learn patterns and relationships, and then use this knowledge to produce new, original content that may not have been created by humans before.** This can result in innovative and often surprising outputs that can be useful for a wide range of applications, from art and design to entertainment and education.', tool_call_id=None, tool_calls=None), logprobs=None, content_filter_results={'hate': {'filtered': False}, 'self_harm': {'fi

In [4]:
print(response.choices[0].message.content)

Here are two key points about Generative AI:

• **Generative AI refers to the use of artificial intelligence (AI) algorithms that can create new data or content that resembles existing data or content, often with a specific purpose or goal in mind.** This can include tasks such as generating text, images, videos, music, or even entire articles.

• **Generative AI models can be trained on large datasets to learn patterns and relationships, and then use this knowledge to produce new, original content that may not have been created by humans before.** This can result in innovative and often surprising outputs that can be useful for a wide range of applications, from art and design to entertainment and education.
