# Detailed Thinking Mode with Llama 3.3 Nemotron Super 49B

In this notebook, we'll explore how simple it is to leverage thinking mode on, and off, using the Llama 3.3 Nemotron Super 49B NIM. 

If you'd like to learn more about this model - please check out our [blog](https://developer.nvidia.com/blog/build-enterprise-ai-agents-with-advanced-open-nvidia-llama-nemotron-reasoning-models/), which goes into exactly how this model was produced.

> NOTE: In order to move forward in this notebook - please ensure you've followed the instructions on the [README.md](./README.md)

### Detailed Thinking Mode On

For the first example, we'll look at the model with detailed thinking "on". With the system prompt set to enable detailed thinking, the model will behave as a long-thinking reasoning model and is most effective for complex reasoning tasks. Before outputting a final response, the model will generate a number of tokens enclosed by "thinking" tags. 

In [None]:
from openai import OpenAI

client = OpenAI(
  base_url = "http://0.0.0.0:8000/v1",
  api_key = "not used"
)

completion = client.chat.completions.create(
  model="nvidia/llama-3.3-nemotron-super-49b-v1",
  messages=[
    {"role":"system","content":"detailed thinking on"},
    {"role":"user","content":"Solve x*(sin(x)+2)=0"}
  ],
  temperature=0.6,
  top_p=0.95,
  max_tokens=32768,
  frequency_penalty=0,
  presence_penalty=0,
  stream=True
)

for chunk in completion:
  if chunk.choices[0].delta.content is not None:
    print(chunk.choices[0].delta.content, end="")

<think>
Okay, so I need to solve the equation x*(sin(x) + 2) = 0. Hmm, let me think. I remember from algebra that if a product of two things is zero, then at least one of them has to be zero. That's the zero product property. So, in this case, either x = 0 or sin(x) + 2 = 0. Let me write that down:

1. x = 0
2. sin(x) + 2 = 0

Alright, starting with the first one, x = 0. That seems straightforward. If x is 0, then plugging back into the original equation: 0*(sin(0) + 2) = 0*(0 + 2) = 0*2 = 0. Yep, that works. So x = 0 is definitely a solution.

Now, the second part: sin(x) + 2 = 0. Let me solve for sin(x). Subtract 2 from both sides: sin(x) = -2. Wait a minute, sin(x) equals -2? But the sine function has a range of [-1, 1]. That means sin(x) can never be less than -1 or greater than 1. So sin(x) = -2 is impossible. Therefore, there are no solutions from the second equation.

So, putting it all together, the only solution is x = 0. Let me double-check. If x is 0, then the equation holds

### Detailed Thinking Mode Off

For our second example, we will look at the model with detailed thinking "off". With the system prompt set to disable detailed thinking, the model will behave as a typical instruction-tuned model. It will immediately begin generating the final response, with no thinking tokens produced. This mode is most effective for things like Tool Calling, Chat applications, or other use-cases where a direct response is preferred.

In [9]:
from openai import OpenAI

client = OpenAI(
  base_url = "http://0.0.0.0:8000/v1",
  api_key = "not used"
)

completion = client.chat.completions.create(
  model="nvidia/llama-3.3-nemotron-super-49b-v1",
  messages=[
    {"role":"system","content":"detailed thinking off"},
    {"role":"user","content":"What is NVIDIA?"}
  ],
  temperature=0,
  max_tokens=32768,
  frequency_penalty=0,
  presence_penalty=0,
  stream=True
)

for chunk in completion:
  if chunk.choices[0].delta.content is not None:
    print(chunk.choices[0].delta.content, end="")

NVIDIA is a leading American technology company known for designing and manufacturing a wide range of products, but most notably for its graphics processing units (GPUs), which have become indispensable in various fields. Here's a breakdown of what NVIDIA is and what it does across its main areas of focus:

### 1. **Graphics Processing Units (GPUs) for Gaming**
- **Primary Use**: Enhancing gaming experiences by accelerating graphics rendering.
- **Products**: GeForce series (e.g., GeForce RTX 30 series) for consumers and enthusiasts.
- **Key Features**: High-resolution gaming, ray tracing, artificial intelligence (AI) enhanced graphics, and more.

### 2. **Professional Graphics (Quadro)**
- **Primary Use**: For professionals requiring high-end graphics capabilities (e.g., 3D modeling, video editing, engineering).
- **Products**: Quadro series, designed for reliability and performance in professional applications.

### 3. **Datacenter and AI Computing**
- **Primary Use**: Accelerating c