Fix `observability_evaluation_and_profiling` example notebook by dagardner-nv · Pull Request #1874 · NVIDIA/NeMo-Agent-Toolkit

dagardner-nv · 2026-04-16T18:31:11Z

Description

This notebook was installing the nvidia-nat-profiling package which was dropped in v1.5, causing the notebook to install nat v1.3
Replace the model with a nano model to avoid being rate limited during the eval steps.
Update migration-guide.md to fix profiler installation instructions.
Replace broken documentation links (unrelated link check errors found in CI)

By Submitting this PR I confirm:

I am familiar with the Contributing Guidelines.
We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
- Any contribution which contains commits that are not Signed-Off will not be accepted.
When the PR is ready for review, new or existing tests cover these changes.
When the PR is ready for review, the documentation is up to date with these changes.

Summary by CodeRabbit

Documentation
- Updated NeMo Customizer links in the finetuning guide; revised packaging/install guidance to replace the profiling extra with profiler and document the eval/profiler split.
Examples
- Updated Microservices setup link and cleaned README whitespace/formatting.
Notebooks
- Switched profiling extra name to profiler and added eval where applicable; updated generated model/config defaults (model selection and token limits) and bumped notebook kernel/Python metadata.

Signed-off-by: David Gardner <dagardner@nvidia.com>

This reverts commit e881173. Signed-off-by: David Gardner <dagardner@nvidia.com>

Signed-off-by: David Gardner <dagardner@nvidia.com>

review-notebook-app · 2026-04-16T18:31:17Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

review-notebook-app · 2026-04-16T18:31:17Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

coderabbitai · 2026-04-16T18:31:26Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: fc64ed86-7c98-4480-8e64-dccdd02f305f

📥 Commits

Reviewing files that changed from the base of the PR and between 892c4c5 and 58ee904.

📒 Files selected for processing (1)

examples/notebooks/observability_evaluation_and_profiling.ipynb

🚧 Files skipped from review as they are similar to previous changes (1)

examples/notebooks/observability_evaluation_and_profiling.ipynb

Walkthrough

Documentation and notebooks updated: NeMo Customizer links now target the customizer landing page; NeMo Microservices setup link adjusted; profiling extra/subpackage renamed from profiling → profiler across docs and notebooks; one notebook’s LLM workflow YAML and notebook metadata were updated; minor README whitespace fixes.

Changes

Cohort / File(s)	Summary
NeMo Customizer & Microservices links `docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md`, `examples/finetuning/dpo_tic_tac_toe/README.md`	Updated NeMo Customizer references to point to the customizer landing page; adjusted NeMo Microservices setup link to `latest/index.html`; small README whitespace standardization.
Profiling extra rename & notebooks/migration guide `docs/source/resources/migration-guide.md`, `examples/notebooks/optimize_model_selection.ipynb`, `examples/notebooks/observability_evaluation_and_profiling.ipynb`	Replaced `profiling` with `profiler` in extras/package names, install examples, and notebook install checks; documented eval/profiler split and changed combined install examples to `nvidia-nat[eval, profiler]`.
Notebook LLM config & metadata `examples/notebooks/observability_evaluation_and_profiling.ipynb`	Updated generated workflow YAML: `llms.nim_llm.model_name` → `nvidia/nemotron-3-nano-30b-a3b`, `max_tokens` 2048→16384, removed `context_window`; added `kernelspec` (`python3`) and bumped `language_info.version` 3.12.9→3.13.2.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Important

Pre-merge checks failed

Please resolve all errors before merging. Addressing warnings is optional.

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title is partially related to the changeset; it focuses on fixing the observability_evaluation_and_profiling notebook but does not convey the broader scope of updates including package renaming, link fixes, and migration guide corrections.	Consider updating the title to better reflect the main changes, such as 'Update nvidia-nat package references from profiling to profiler' or 'Fix observability notebook and update profiler package naming'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 Ruff (0.15.10)

examples/notebooks/observability_evaluation_and_profiling.ipynb

Unexpected end of JSON input

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

examples/notebooks/observability_evaluation_and_profiling.ipynb (2)
1883-1887: Use a portable kernelspec display name.

Line 1884 uses a local-environment label (.venv (3.13.2)), which can cause noisy notebook diffs and confusion on other machines. Prefer a generic display name (for example, Python 3).
Proposed metadata tweak
  "kernelspec": {
-   "display_name": ".venv (3.13.2)",
+   "display_name": "Python 3",
    "language": "python",
    "name": "python3"
  },
Also applies to: 1898-1898
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@examples/notebooks/observability_evaluation_and_profiling.ipynb` around lines
1883 - 1887, Replace the local-environment kernelspec display name with a
portable, generic one: update the "kernelspec" -> "display_name" value currently
set to ".venv (3.13.2)" to something like "Python 3" (and make the same change
for the other occurrence at the second kernelspec entry), leaving "language" and
"name" unchanged so the notebook metadata is stable across machines.
1104-1106: Lower max_tokens default for this evaluation workflow configuration.

The max_tokens: 16384 setting is significantly higher than recommended defaults (512–1024) for evaluation and interactive workloads. The evaluation dataset's longest response (Ark S12 Ultra tablet specifications) requires far fewer tokens. Reduce to 1024 or 2024 to align with best practices and minimize unnecessary token overhead during evaluation runs, then add a comment documenting when to increase this value for longer outputs.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@examples/notebooks/observability_evaluation_and_profiling.ipynb` around lines
1104 - 1106, Reduce the max_tokens value in the evaluation workflow
configuration: change the current "max_tokens: 16384" to a lower default such as
1024 (or 2048 if you prefer), and add an inline comment next to the "max_tokens"
setting explaining that this default suits most evaluation/interactive workloads
and that it should be increased only for datasets or prompts that require much
longer outputs (e.g., known long-document generation). Reference the existing
keys "model_name", "temperature", and "max_tokens" to locate and update the
setting.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md`:
- Line 22: Replace the incorrect NeMo Customizer URL
"embedding-customization-job.html" with the DPO-specific tutorial URL
"fine-tune/tutorials/dpo-customization-job.html" wherever it appears in the
document (e.g., the sentence starting "This guide covers Direct Preference
Optimization (DPO) training..." that currently links to
embedding-customization-job.html); also update the other occurrence noted (the
one referenced as "Also applies to: 950") to the same DPO URL so all links point
to the DPO customization job tutorial.

---

Nitpick comments:
In `@examples/notebooks/observability_evaluation_and_profiling.ipynb`:
- Around line 1883-1887: Replace the local-environment kernelspec display name
with a portable, generic one: update the "kernelspec" -> "display_name" value
currently set to ".venv (3.13.2)" to something like "Python 3" (and make the
same change for the other occurrence at the second kernelspec entry), leaving
"language" and "name" unchanged so the notebook metadata is stable across
machines.
- Around line 1104-1106: Reduce the max_tokens value in the evaluation workflow
configuration: change the current "max_tokens: 16384" to a lower default such as
1024 (or 2048 if you prefer), and add an inline comment next to the "max_tokens"
setting explaining that this default suits most evaluation/interactive workloads
and that it should be increased only for datasets or prompts that require much
longer outputs (e.g., known long-document generation). Reference the existing
keys "model_name", "temperature", and "max_tokens" to locate and update the
setting.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 48c28db0-448d-421b-864e-757a37832f26

📥 Commits

Reviewing files that changed from the base of the PR and between b685539 and 958cf03.

📒 Files selected for processing (3)

docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md
examples/finetuning/dpo_tic_tac_toe/README.md
examples/notebooks/observability_evaluation_and_profiling.ipynb

… group/extra for nvidia-nat-eval Signed-off-by: David Gardner <dagardner@nvidia.com>

Signed-off-by: David Gardner <dagardner@nvidia.com>

mnajafian-nv

Great work, conditional approval upon reviewing the inline suggestions :)

coderabbitai

🧹 Nitpick comments (1)

examples/notebooks/optimize_model_selection.ipynb (1)
179-184: Fix stale setup text to match the new profiler extra.

The install cell now uses nvidia-nat[langchain,profiler], but the setup bullet still says nvidia-nat[profiling] (Line 162). Please align the prose with the command to avoid copy/paste confusion.
✏️ Proposed doc update
-* The `nvidia-nat[profiling]` subpackage contains components for profiling and performance analysis.
+* The `nvidia-nat[profiler]` subpackage contains components for profiling and performance analysis.
As per coding guidelines: "Verify that documentation and comments are clear and comprehensive" and "Keep documentation in sync with code".
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@examples/notebooks/optimize_model_selection.ipynb` around lines 179 - 184,
The setup bullet text is stale and says "nvidia-nat[profiling]" while the
install cell uses "nvidia-nat[langchain,profiler]"; update the prose to match
the command by replacing the outdated extra name with
"nvidia-nat[langchain,profiler]" (or otherwise reflect both extras), ensuring
the documentation and the install cell (the pip command using
nvidia-nat[langchain,profiler]) are consistent.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@examples/notebooks/optimize_model_selection.ipynb`:
- Around line 179-184: The setup bullet text is stale and says
"nvidia-nat[profiling]" while the install cell uses
"nvidia-nat[langchain,profiler]"; update the prose to match the command by
replacing the outdated extra name with "nvidia-nat[langchain,profiler]" (or
otherwise reflect both extras), ensuring the documentation and the install cell
(the pip command using nvidia-nat[langchain,profiler]) are consistent.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 659ac785-5177-4b3c-8379-d1b44b465d04

📥 Commits

Reviewing files that changed from the base of the PR and between 958cf03 and 2262c0f.

📒 Files selected for processing (2)

docs/source/resources/migration-guide.md
examples/notebooks/optimize_model_selection.ipynb

Co-authored-by: mnajafian-nv <mnajafian@nvidia.com> Signed-off-by: David Gardner <96306125+dagardner-nv@users.noreply.github.com>

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@examples/notebooks/observability_evaluation_and_profiling.ipynb`:
- Around line 600-603: The install line currently installs
"nvidia-nat[langchain,llama-index,phoenix,profiler]" but the notebook runs `nat
eval`; update the pip install invocation in the notebook cell (the string
containing uv pip install "nvidia-nat[langchain,llama-index,phoenix,profiler]")
to include the eval extra for evaluation workflows—use
"nvidia-nat[eval,profiler]" (or add eval alongside the existing extras if you
need langchain/llama-index/phoenix too), and update the accompanying echo
message to reflect the chosen extras so the notebook installs the runtime
dependency required for `nat eval`.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro Plus

Run ID: 25ec6843-b65c-444d-8710-1e1de6863f1d

📥 Commits

Reviewing files that changed from the base of the PR and between 2262c0f and 27fac7e.

📒 Files selected for processing (1)

examples/notebooks/observability_evaluation_and_profiling.ipynb

Signed-off-by: David Gardner <dagardner@nvidia.com>

…/NeMo-Agent-Toolkit into david-observe-eval-notebook Signed-off-by: David Gardner <dagardner@nvidia.com>

Signed-off-by: David Gardner <dagardner@nvidia.com>

…om profiler, but still good to declare it explicitly Signed-off-by: David Gardner <dagardner@nvidia.com>

mnajafian-nv

LGTM!

dagardner-nv · 2026-04-16T21:19:47Z

/merge

…#1874) * This notebook was installing the `nvidia-nat-profiling` package which was dropped in v1.5, causing the notebook to install nat v1.3 * Replace the model with a nano model to avoid being rate limited during the eval steps. * Update `migration-guide.md` to fix profiler installation instructions. * Replace broken documentation links (unrelated link check errors found in CI) - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/NeMo-Agent-Toolkit/blob/develop/docs/source/resources/contributing/index.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. * **Documentation** * Updated NeMo Customizer links in the finetuning guide; revised packaging/install guidance to replace the profiling extra with profiler and document the eval/profiler split. * **Examples** * Updated Microservices setup link and cleaned README whitespace/formatting. * **Notebooks** * Switched profiling extra name to profiler and added eval where applicable; updated generated model/config defaults (model selection and token limits) and bumped notebook kernel/Python metadata. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - https://github.com/mnajafian-nv - Bryan Bednarski (https://github.com/bbednarski9) URL: NVIDIA#1874 Signed-off-by: Yuchen Zhang <yuchenz@nvidia.com>

…#1874) * This notebook was installing the `nvidia-nat-profiling` package which was dropped in v1.5, causing the notebook to install nat v1.3 * Replace the model with a nano model to avoid being rate limited during the eval steps. * Update `migration-guide.md` to fix profiler installation instructions. * Replace broken documentation links (unrelated link check errors found in CI) ## By Submitting this PR I confirm: - I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/NeMo-Agent-Toolkit/blob/develop/docs/source/resources/contributing/index.md). - We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license. - Any contribution which contains commits that are not Signed-Off will not be accepted. - When the PR is ready for review, new or existing tests cover these changes. - When the PR is ready for review, the documentation is up to date with these changes. ## Summary by CodeRabbit * **Documentation** * Updated NeMo Customizer links in the finetuning guide; revised packaging/install guidance to replace the profiling extra with profiler and document the eval/profiler split. * **Examples** * Updated Microservices setup link and cleaned README whitespace/formatting. * **Notebooks** * Switched profiling extra name to profiler and added eval where applicable; updated generated model/config defaults (model selection and token limits) and bumped notebook kernel/Python metadata. Authors: - David Gardner (https://github.com/dagardner-nv) Approvers: - https://github.com/mnajafian-nv - Bryan Bednarski (https://github.com/bbednarski9) URL: NVIDIA#1874

dagardner-nv added 5 commits April 16, 2026 10:39

Replace out-dated nvidia-nat-profiling package with nvidia-nat-profiler

7ed9eff

Signed-off-by: David Gardner <dagardner@nvidia.com>

Replace model

e881173

Signed-off-by: David Gardner <dagardner@nvidia.com>

Revert "Replace model"

2f38c5f

This reverts commit e881173. Signed-off-by: David Gardner <dagardner@nvidia.com>

Fix model configs

d26df9f

Signed-off-by: David Gardner <dagardner@nvidia.com>

Fix broken links

958cf03

Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv self-assigned this Apr 16, 2026

dagardner-nv requested a review from a team as a code owner April 16, 2026 18:31

dagardner-nv added bug Something isn't working non-breaking Non-breaking change labels Apr 16, 2026

coderabbitai Bot reviewed Apr 16, 2026

View reviewed changes

Comment thread docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md Outdated

dagardner-nv marked this pull request as draft April 16, 2026 18:45

dagardner-nv added 2 commits April 16, 2026 11:49

profiling is not a dependency group/extra for nvidia-nat, nor is it a…

fd6769b

… group/extra for nvidia-nat-eval Signed-off-by: David Gardner <dagardner@nvidia.com>

Update profiler package

2262c0f

Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv marked this pull request as ready for review April 16, 2026 18:53

mnajafian-nv approved these changes Apr 16, 2026

View reviewed changes

Comment thread docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md Outdated

Comment thread examples/notebooks/observability_evaluation_and_profiling.ipynb

Comment thread examples/notebooks/observability_evaluation_and_profiling.ipynb Outdated

coderabbitai Bot reviewed Apr 16, 2026

View reviewed changes

bbednarski9 reviewed Apr 16, 2026

View reviewed changes

Comment thread docs/source/improve-workflows/finetuning/dpo_with_nemo_customizer.md Outdated

Update examples/notebooks/observability_evaluation_and_profiling.ipynb

27fac7e

Co-authored-by: mnajafian-nv <mnajafian@nvidia.com> Signed-off-by: David Gardner <96306125+dagardner-nv@users.noreply.github.com>

coderabbitai Bot reviewed Apr 16, 2026

View reviewed changes

Comment thread examples/notebooks/observability_evaluation_and_profiling.ipynb Outdated

dagardner-nv added 3 commits April 16, 2026 13:18

Update nemo documentation links

099c13f

Signed-off-by: David Gardner <dagardner@nvidia.com>

Merge branch 'david-observe-eval-notebook' of github.com:dagardner-nv…

72e91d4

…/NeMo-Agent-Toolkit into david-observe-eval-notebook Signed-off-by: David Gardner <dagardner@nvidia.com>

Formatting

892c4c5

Signed-off-by: David Gardner <dagardner@nvidia.com>

bbednarski9 approved these changes Apr 16, 2026

View reviewed changes

Add missing eval package (previously pulled in as a transitive dep fr…

58ee904

…om profiler, but still good to declare it explicitly Signed-off-by: David Gardner <dagardner@nvidia.com>

dagardner-nv requested a review from mnajafian-nv April 16, 2026 20:34

mnajafian-nv approved these changes Apr 16, 2026

View reviewed changes

rapids-bot Bot merged commit 3f159aa into NVIDIA:develop Apr 16, 2026
17 checks passed

dagardner-nv deleted the david-observe-eval-notebook branch April 16, 2026 21:20

Conversation

dagardner-nv commented Apr 16, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

By Submitting this PR I confirm:

Summary by CodeRabbit

Uh oh!

review-notebook-app Bot commented Apr 16, 2026

Uh oh!

review-notebook-app Bot commented Apr 16, 2026

Uh oh!

coderabbitai Bot commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Pre-merge checks failed

❌ Failed checks (1 inconclusive)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mnajafian-nv left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mnajafian-nv left a comment

Choose a reason for hiding this comment

Uh oh!

dagardner-nv commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dagardner-nv commented Apr 16, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 16, 2026 •

edited

Loading