<img src="https://drive.google.com/uc?export=view&id=1wYSMgJtARFdvTt5g7E20mE4NmwUFUuog" width="200">

[![Gen AI Experiments](https://img.shields.io/badge/Gen%20AI%20Experiments-GenAI%20Bootcamp-blue?style=for-the-badge&logo=artificial-intelligence)](https://github.com/buildfastwithai/gen-ai-experiments)
[![Gen AI Experiments GitHub](https://img.shields.io/github/stars/buildfastwithai/gen-ai-experiments?style=for-the-badge&logo=github&color=gold)](http://github.com/buildfastwithai/gen-ai-experiments)


[![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1rNwI_8GcF-9WG4wuzvvvv4KgMlZYNuX7?usp=sharing)

## Master Generative AI in 8 Weeks
**What You'll Learn:**
- Master cutting-edge AI tools & frameworks
- 6 weeks of hands-on, project-based learning
- Weekly live mentorship sessions
- No coding experience required
- Join Innovation Community
Transform your AI ideas into reality through hands-on projects and expert mentorship.
[Start Your Journey](https://www.buildfastwithai.com/genai-course)




# 🚀 Using Baseten Inference for LLMs
This notebook demonstrates how to use **Baseten Inference API** to run different Large Language Models (LLMs) with the familiar OpenAI client.

Baseten Model APIs are built for production first, with extra ordinary performance and reliability. Baseten Provides OpenAI compatible SDK, Pre-optimized performance, secure ans very scaleable Inference.

## 🔧 Setup
Before starting, install the required library and set up your API key.

In [None]:
!pip install -q openai

## ⚙️ Initialize Client
Replace the `api_key` with your own key from Baseten. We'll use `moonshotai/Kimi-K2-Instruct` as the model.

In [None]:

from openai import OpenAI
from google.colab import userdata

# Initialize client with Baseten endpoint
client = OpenAI(
    api_key=userdata.get('BASETEN_API_KEY'),  # Setup your Baseten key in Secrets
    base_url="https://inference.baseten.co/v1"
)


## ✨ Use Case 1: Hello World in Python
Let's start with a simple prompt asking the model to generate a Hello World program.

In [None]:

response = client.chat.completions.create(
    model="moonshotai/Kimi-K2-Instruct",
    messages=[{"role": "user", "content": "Write a Hello World program in Python"}],
    max_tokens=100
)

print(response.choices[0].message.content)


```python
# hello_world.py

def main():
    print("Hello, World!")

if __name__ == "__main__":
    main()
```


## ✨ Use Case 2: Text Summarization
We can ask the model to summarize a piece of text into a short paragraph.

In [None]:

long_text = """Artificial Intelligence is transforming the way we interact with technology.
From natural language processing to computer vision, AI models are being applied
across industries to improve efficiency, accuracy, and decision-making."""

response = client.chat.completions.create(
    model="moonshotai/Kimi-K2-Instruct",
    messages=[{"role": "user", "content": f"Summarize the following text:\n{long_text}"}],
    max_tokens=150
)

print(response.choices[0].message.content)


AI is revolutionizing technology interactions, enhancing efficiency and decision-making across industries through applications like natural language processing and computer vision.


## ✨ Use Case 3: Creative Writing
Finally, let’s generate a short creative story using the model.

In [None]:

response = client.chat.completions.create(
    model="moonshotai/Kimi-K2-Instruct",
    messages=[{"role": "user", "content": "Write a short science fiction story about humans living on Mars."}],
    max_tokens=200,
    temperature=0.9
)

print(response.choices[0].message.content)


**Red Horizon**  
*Sol 1,247, Ares Calendar*

The dust storm had been raging for six days when the sky finally cleared, revealing the twin moons of Mars hanging like pale lanterns above the Valles Marineris. Dr. Lila Moreau pressed her gloved hand against the viewport of Hab-3, watching the last red clouds unravel into nothing. Somewhere out there, beneath the regolith, the *Ariadne* was waiting—an alien artifact no human had yet managed to wake.

“Storm’s over,” she said into the comm. “Time to dig.”

The response crackled back from the rover: “Copy that, Doc. Bringing the coffee.”

Lila smiled despite herself. Coffee on Mars was a ritual, a tether to Earth. She sealed her helmet and stepped into the airlock, boots clanking against the grated floor. Outside, the temperature was -80°C, but her suit kept her warm. The horizon


## ✅ Wrap-Up
In this notebook, we explored how to:
- Connect to Baseten Inference with the OpenAI client
- Run prompts for code generation, summarization, and creative writing

You can extend these examples to build chatbots, assistants, or custom LLM apps!