Skip to content

Add E2E test for RagaAI Catalyst#1194

Merged
rapids-bot[bot] merged 16 commits intoNVIDIA:developfrom
dagardner-nv:david-test-ragaai-catalyst
Nov 20, 2025
Merged

Add E2E test for RagaAI Catalyst#1194
rapids-bot[bot] merged 16 commits intoNVIDIA:developfrom
dagardner-nv:david-test-ragaai-catalyst

Conversation

@dagardner-nv
Copy link
Contributor

@dagardner-nv dagardner-nv commented Nov 20, 2025

Description

  • Document the need to set NAT_SPAN_PREFIX=aiq
  • Update the documentation to reflect Catalyst UI changes, and the need to create the project in the Catalyst UI prior to running the workflow.
  • Remove the endpoint entry from config-catalyst.yml, CatalystTelemetryExporter has a reasonable default value for this, and overriding this with an environment variable shouldn't be required
  • Replace calls to print with logging calls
  • Add E2E test

By Submitting this PR I confirm:

  • I am familiar with the Contributing Guidelines.
  • We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
    • Any contribution which contains commits that are not Signed-Off will not be accepted.
  • When the PR is ready for review, new or existing tests cover these changes.
  • When the PR is ready for review, the documentation is up to date with these changes.

Summary by CodeRabbit

  • Documentation

    • Expanded Catalyst setup with combined credentials/project step, step-by-step API key and project guidance, optional endpoint note, NAT_SPAN_PREFIX instructions, workflow run steps, and dashboard/dataset trace viewing.
  • New Features

    • Added optional debug mode for controlling local trace file writes.
    • Made Catalyst endpoint optional and documented customization.
  • Bug Fixes

    • Improved error logging to include exception context during trace export.
  • Tests

    • Added environment-aware fixtures and full Catalyst workflow integration tests with span-prefix compatibility and dataset polling/cleanup.

✏️ Tip: You can customize this high-level summary in your review settings.

…ault, update instructions on generating an API key

Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
Signed-off-by: David Gardner <dagardner@nvidia.com>
This reverts commit 275d000.

Signed-off-by: David Gardner <dagardner@nvidia.com>
This reverts commit d7b8dbc.

Signed-off-by: David Gardner <dagardner@nvidia.com>
…eep the traces won't be uploaded before the dataset is

Signed-off-by: David Gardner <dagardner@nvidia.com>
@dagardner-nv dagardner-nv self-assigned this Nov 20, 2025
@dagardner-nv dagardner-nv added improvement Improvement to existing functionality non-breaking Non-breaking change labels Nov 20, 2025
@coderabbitai
Copy link

coderabbitai bot commented Nov 20, 2025

Walkthrough

Updates add detailed Catalyst credential/project setup and a NAT_SPAN_PREFIX environment variable in the README, remove the Catalyst endpoint from config, introduce Catalyst-related test fixtures and an integration test (with a duplicated insertion), replace print/error logs with logger.exception, and add a debug_mode parameter to RAGATraceExporterOptWrite.__init__.

Changes

Cohort / File(s) Summary
Documentation
examples/observability/simple_calculator_observability/README.md
Reorganized Catalyst setup: merged credential/project steps, added substeps for API key creation, optional custom endpoint note, documented NAT_SPAN_PREFIX (set to aiq) and new workflow/run steps including returning to Catalyst dashboard.
Configuration
examples/observability/simple_calculator_observability/configs/config-catalyst.yml
Removed endpoint: ${CATALYST_ENDPOINT} from general.telemetry.tracing.catalyst.
Tests
examples/observability/simple_calculator_observability/tests/test_simple_calc_observability.py
Added aiq_compatibility_span_prefix fixture (temporarily sets NAT_SPAN_PREFIX and reloads span module), added importlib and os imports, and added an integration test_catalyst_full_workflow (appears duplicated in the diff) that runs the workflow and polls the Catalyst Dataset API.
Test fixtures (integration helpers)
packages/nvidia_nat_test/src/nat/test/plugin.py
Added session-scoped fixtures catalyst_keys, catalyst_project_name, and catalyst_dataset_name to read Catalyst creds, derive project name, produce dataset names, and teardown datasets if present.
Plugin implementation
packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py
Replaced print/error calls with logger.exception in trace preparation/export error paths and added debug_mode: bool = False parameter to RAGATraceExporterOptWrite.__init__ to control optional local trace file writing.

Sequence Diagram(s)

sequenceDiagram
  participant Test as Integration Test
  participant Exporter as RAGATraceExporterOptWrite
  participant Catalyst as Catalyst API / Dataset
  participant FS as Local File (optional)

  Note over Test,Exporter: Test runs workflow and emits OTEL spans
  Test->>Exporter: invoke exporter (spans)
  alt debug_mode = true
    Exporter->>FS: write trace file (local debug artifact)
  end
  Exporter->>Catalyst: send spans / create dataset
  Catalyst-->>Exporter: ack / dataset created
  Exporter-->>Test: export result
  Test->>Catalyst: poll Dataset API for dataset presence
  Catalyst-->>Test: dataset found / not found
Loading

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

  • Focus areas:
    • Remove duplicate test_catalyst_full_workflow insertion.
    • Verify aiq_compatibility_span_prefix correctly restores env and reloads modules without side effects.
    • Ensure RAGATraceExporterOptWrite.__init__ remains backward-compatible and debug_mode is honored where intended.
    • Confirm Catalyst dataset teardown in catalyst_dataset_name fixture is idempotent and error-tolerant.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)
Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 41.67% which is insufficient. The required threshold is 80.00%. You can run @coderabbitai generate docstrings to improve docstring coverage.
✅ Passed checks (2 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title 'Add E2E test for RagaAI Catalyst' is concise, descriptive, uses imperative mood, and accurately reflects the main objective of adding end-to-end test coverage for Catalyst workflows.
✨ Finishing touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Post copyable unit tests in a comment

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 4c757e5 and fc22457.

📒 Files selected for processing (1)
  • examples/observability/simple_calculator_observability/README.md (1 hunks)
🚧 Files skipped from review as they are similar to previous changes (1)
  • examples/observability/simple_calculator_observability/README.md

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

dagardner-nv and others added 2 commits November 19, 2025 16:21
Signed-off-by: David Gardner <dagardner@nvidia.com>
@dagardner-nv dagardner-nv marked this pull request as ready for review November 20, 2025 00:29
@dagardner-nv dagardner-nv requested a review from a team as a code owner November 20, 2025 00:29
Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)
packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py (1)

36-49: Align debug_mode behavior with documented semantics for local trace files

The docstrings for RAGATraceExporterOptWrite (lines 42–43), DynamicTraceExporterOptWrite (lines 163–164), and RagaAICatalystMixin.__init__ (lines 218–219) all consistently state:

  • When debug_mode is False (default) → create local rag_agent_traces.json.
  • When debug_mode is True → skip local file creation for cleaner operation.

However, the implementation at line 142 contradicts this:

if self.debug_mode:
    with open(os.path.join(os.getcwd(), 'rag_agent_traces.json'), 'w', encoding="utf-8") as f:
        json.dump(ragaai_trace, f, cls=TracerJSONEncoder, indent=2)

This means:

  • Default (debug_mode=False) does NOT create the file (contradicts docstring).
  • Setting debug_mode=True DOES enable file creation (contradicts documented intent to skip).

To match the documented behavior, invert the condition:

-                if self.debug_mode:
-                    with open(os.path.join(os.getcwd(), 'rag_agent_traces.json'), 'w', encoding="utf-8") as f:
-                        json.dump(ragaai_trace, f, cls=TracerJSONEncoder, indent=2)
+                if not self.debug_mode:
+                    with open(os.path.join(os.getcwd(), 'rag_agent_traces.json'), 'w', encoding="utf-8") as f:
+                        json.dump(ragaai_trace, f, cls=TracerJSONEncoder, indent=2)

Alternatively, if the implementation intent is correct and debug_mode=True should enable local artifacts, update all three docstrings to reflect that True enables file creation and False disables it.

Also applies to: 163–164, 218–219

🧹 Nitpick comments (4)
packages/nvidia_nat_test/src/nat/test/plugin.py (1)

350-379: Tighten Catalyst fixtures: unused dependency + teardown robustness

  • catalyst_project_name_fixture(catalyst_keys) intentionally depends on catalyst_keys but never uses it, which Ruff flags (ARG001). To keep the dependency while making intent clear, consider renaming the parameter or marking it ignored, e.g.:
-@pytest.fixture(name="catalyst_project_name")
-def catalyst_project_name_fixture(catalyst_keys) -> str:
+@pytest.fixture(name="catalyst_project_name")
+def catalyst_project_name_fixture(_catalyst_keys) -> str:  # noqa: ARG001
     return os.environ.get("NAT_CI_CATALYST_PROJECT_NAME", "nat-e2e")
  • In catalyst_dataset_name_fixture, any ImportError or API failure from ragaai_catalyst.Dataset during teardown will surface as a hard error rather than following the common fail_missing/skip convention used by other integration fixtures in this file (e.g., galileo_project_fixture, weave, langsmith_client). If you want consistent behavior, consider adding a fail_missing: bool parameter and handling missing ragaai_catalyst similarly:
-@pytest.fixture(name="catalyst_dataset_name")
-def catalyst_dataset_name_fixture(catalyst_project_name: str, project_name: str) -> str:
+@pytest.fixture(name="catalyst_dataset_name")
+def catalyst_dataset_name_fixture(catalyst_project_name: str,
+                                  project_name: str,
+                                  fail_missing: bool) -> str:
@@
-    from ragaai_catalyst import Dataset
-    ds = Dataset(catalyst_project_name)
-    if dataset_name in ds.list_datasets():
-        ds.delete_dataset(dataset_name)
+    try:
+        from ragaai_catalyst import Dataset
+        ds = Dataset(catalyst_project_name)
+        if dataset_name in ds.list_datasets():
+            ds.delete_dataset(dataset_name)
+    except ImportError as e:
+        reason = "Catalyst integration tests require the `ragaai_catalyst` package to be installed."
+        if fail_missing:
+            raise RuntimeError(reason) from e
+        pytest.skip(reason=reason)

This keeps CI-friendly cleanup while aligning with the existing integration-fixture patterns.

packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py (1)

59-152: Simplify logger.exception calls (drop redundant e and exc_info=True)

Functionally, the exception handling is fine and matches the guideline of using logger.exception when not re-raising. However, all of these calls:

logger.exception("Error in convert_json_format function: %s: %s", trace_id, e, exc_info=True)
...
logger.exception("Error converting trace %s: %s", trace_id, str(e), exc_info=True)
...
logger.exception("Error exporting spans: %s", e, exc_info=True)

are more verbose than necessary:

  • logger.exception already sets exc_info=True by default.
  • Including e (or str(e)) in the message is usually redundant, since the stack trace will show the exception type and message.

To address Ruff TRY401 and keep logs clean, you can simplify to:

-            except Exception as e:
-                logger.exception("Error in convert_json_format function: %s: %s", trace_id, e, exc_info=True)
+            except Exception:
+                logger.exception("Error in convert_json_format function for trace %s", trace_id)

and similarly for the other blocks, including the final catch-all in prepare_trace and the one in export_otel_spans.

This keeps full stack traces while avoiding redundant arguments and lint warnings.

Also applies to: 248-248

examples/observability/simple_calculator_observability/README.md (1)

212-237: Tidy ordered list numbering in Catalyst setup section

The new Catalyst instructions read cleanly, but the ordered list now has two 3. items:

  • 3. Set the NAT_SPAN_PREFIX environment variable…
  • 3. Run the workflow:

Markdown will auto-renumber, but for readability (and to avoid docs linters complaining), consider either:

  • Renumbering explicitly:
3. Set the NAT_SPAN_PREFIX environment variable to `aiq` for RagaAI Catalyst compatibility:
4. Run the workflow:

or

  • Making all list items 1. and relying on Markdown’s auto-numbering.

Content-wise, the NAT_SPAN_PREFIX step is great and aligns with the new tests.

examples/observability/simple_calculator_observability/tests/test_simple_calc_observability.py (1)

240-267: Catalyst E2E test looks good; consider minor observability/robustness tweaks

The end-to-end Catalyst test wiring is sound:

  • Uses catalyst_project_name / catalyst_dataset_name fixtures and catalyst_keys to ensure proper context.
  • Waits for ingestion with an initial sleep plus a bounded polling loop.
  • Verifies dataset existence via Dataset.list_datasets().

Two optional improvements you might consider:

  1. Reduce hard-coded initial sleep
    You already have a polling loop with a deadline; you could drop or shorten the initial await asyncio.sleep(5) and rely more on the loop to keep the test a bit snappier when Catalyst is fast.

  2. Handle Dataset import errors consistently
    If you decide to make the catalyst_dataset_name fixture follow a fail_missing/skip pattern (as suggested in plugin.py), you might want to mirror that here as well (wrapping the Dataset import in try/except ImportError and skipping when fail_missing is false), for consistent behavior across all Catalyst integration tests.

If you’re happy with current CI timings and assumptions about Catalyst availability, the current implementation is acceptable as-is.

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b00007e and 98521c4.

📒 Files selected for processing (5)
  • examples/observability/simple_calculator_observability/README.md (1 hunks)
  • examples/observability/simple_calculator_observability/configs/config-catalyst.yml (0 hunks)
  • examples/observability/simple_calculator_observability/tests/test_simple_calc_observability.py (3 hunks)
  • packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py (5 hunks)
  • packages/nvidia_nat_test/src/nat/test/plugin.py (1 hunks)
💤 Files with no reviewable changes (1)
  • examples/observability/simple_calculator_observability/configs/config-catalyst.yml
🧰 Additional context used
📓 Path-based instructions (3)
**/*

⚙️ CodeRabbit configuration file

**/*: # Code Review Instructions

  • Ensure the code follows best practices and coding standards. - For Python code, follow
    PEP 20 and
    PEP 8 for style guidelines.
  • Check for security vulnerabilities and potential issues. - Python methods should use type hints for all parameters and return values.
    Example:
    def my_function(param1: int, param2: str) -> bool:
        pass
  • For Python exception handling, ensure proper stack trace preservation:
    • When re-raising exceptions: use bare raise statements to maintain the original stack trace,
      and use logger.error() (not logger.exception()) to avoid duplicate stack trace output.
    • When catching and logging exceptions without re-raising: always use logger.exception()
      to capture the full stack trace information.

Documentation Review Instructions - Verify that documentation and comments are clear and comprehensive. - Verify that the documentation doesn't contain any TODOs, FIXMEs or placeholder text like "lorem ipsum". - Verify that the documentation doesn't contain any offensive or outdated terms. - Verify that documentation and comments are free of spelling mistakes, ensure the documentation doesn't contain any

words listed in the ci/vale/styles/config/vocabularies/nat/reject.txt file, words that might appear to be
spelling mistakes but are listed in the ci/vale/styles/config/vocabularies/nat/accept.txt file are OK.

Misc. - All code (except .mdc files that contain Cursor rules) should be licensed under the Apache License 2.0,

and should contain an Apache License 2.0 header comment at the top of each file.

  • Confirm that copyright years are up-to date whenever a file is changed.

Referenced Documentation Contents

ci/vale/styles/config/vocabularies/nat/reject.txt:
Not directly related to PR objectives; no actionable changes for this PR. Reserved for broader policy checks.

...

Files:

  • packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py
  • examples/observability/simple_calculator_observability/README.md
  • packages/nvidia_nat_test/src/nat/test/plugin.py
  • examples/observability/simple_calculator_observability/tests/test_simple_calc_observability.py
packages/**/*

⚙️ CodeRabbit configuration file

packages/**/*: - This directory contains optional plugin packages for the toolkit, each should contain a pyproject.toml file. - The pyproject.toml file should declare a dependency on nvidia-nat or another package with a name starting
with nvidia-nat-. This dependency should be declared using ~=<version>, and the version should be a two
digit version (ex: ~=1.0).

  • Not all packages contain Python code, if they do they should also contain their own set of tests, in a
    tests/ directory at the same level as the pyproject.toml file.

Files:

  • packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py
  • packages/nvidia_nat_test/src/nat/test/plugin.py
examples/**/*

⚙️ CodeRabbit configuration file

examples/**/*: - This directory contains example code and usage scenarios for the toolkit, at a minimum an example should
contain a README.md or file README.ipynb.

  • If an example contains Python code, it should be placed in a subdirectory named src/ and should
    contain a pyproject.toml file. Optionally, it might also contain scripts in a scripts/ directory.
  • If an example contains YAML files, they should be placed in a subdirectory named configs/. - If an example contains sample data files, they should be placed in a subdirectory named data/, and should
    be checked into git-lfs.

Files:

  • examples/observability/simple_calculator_observability/README.md
  • examples/observability/simple_calculator_observability/tests/test_simple_calc_observability.py
🪛 Ruff (0.14.5)
packages/nvidia_nat_ragaai/src/nat/plugins/ragaai/mixin/ragaai_catalyst_mixin.py

59-59: Redundant exception object included in logging.exception call

(TRY401)


67-67: Redundant exception object included in logging.exception call

(TRY401)


75-75: Redundant exception object included in logging.exception call

(TRY401)


84-84: Redundant exception object included in logging.exception call

(TRY401)


91-91: Redundant exception object included in logging.exception call

(TRY401)


101-101: Redundant exception object included in logging.exception call

(TRY401)


108-108: Redundant exception object included in logging.exception call

(TRY401)


116-116: Redundant exception object included in logging.exception call

(TRY401)


133-133: Redundant exception object included in logging.exception call

(TRY401)


146-146: Redundant exception object included in logging.exception call

(TRY401)


149-149: Consider moving this statement to an else block

(TRY300)


151-151: Redundant exception object included in logging.exception call

(TRY401)


248-248: Redundant exception object included in logging.exception call

(TRY401)

packages/nvidia_nat_test/src/nat/test/plugin.py

363-363: Unused function argument: catalyst_keys

(ARG001)

@bbednarski9
Copy link
Contributor

It seems that we might be missing otel mixin tests for pycls: RagaAICatalystMixin. Thoughts @dagardner-nv ?

Signed-off-by: David Gardner <dagardner@nvidia.com>
…NeMo-Agent-Toolkit into david-test-ragaai-catalyst

Signed-off-by: David Gardner <dagardner@nvidia.com>
Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)
examples/observability/simple_calculator_observability/README.md (1)

38-44: Update Prerequisites to include Catalyst.

Line 44 lists supported observability platforms but omits Catalyst, which is now fully documented in the Catalyst Integration section. The Prerequisites should reflect all major integrations to guide users correctly.

Apply this diff to add Catalyst to the Prerequisites:

-3. **Observability platform**: Access to at least one of the supported platforms (Phoenix, Langfuse, LangSmith, Weave, or Patronus)
+3. **Observability platform**: Access to at least one of the supported platforms (Phoenix, Langfuse, LangSmith, Weave, Patronus, or RagaAI Catalyst)
📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 98521c4 and 4c757e5.

📒 Files selected for processing (1)
  • examples/observability/simple_calculator_observability/README.md (1 hunks)
🧰 Additional context used
📓 Path-based instructions (2)
**/*

⚙️ CodeRabbit configuration file

**/*: # Code Review Instructions

  • Ensure the code follows best practices and coding standards. - For Python code, follow
    PEP 20 and
    PEP 8 for style guidelines.
  • Check for security vulnerabilities and potential issues. - Python methods should use type hints for all parameters and return values.
    Example:
    def my_function(param1: int, param2: str) -> bool:
        pass
  • For Python exception handling, ensure proper stack trace preservation:
    • When re-raising exceptions: use bare raise statements to maintain the original stack trace,
      and use logger.error() (not logger.exception()) to avoid duplicate stack trace output.
    • When catching and logging exceptions without re-raising: always use logger.exception()
      to capture the full stack trace information.

Documentation Review Instructions - Verify that documentation and comments are clear and comprehensive. - Verify that the documentation doesn't contain any TODOs, FIXMEs or placeholder text like "lorem ipsum". - Verify that the documentation doesn't contain any offensive or outdated terms. - Verify that documentation and comments are free of spelling mistakes, ensure the documentation doesn't contain any

words listed in the ci/vale/styles/config/vocabularies/nat/reject.txt file, words that might appear to be
spelling mistakes but are listed in the ci/vale/styles/config/vocabularies/nat/accept.txt file are OK.

Misc. - All code (except .mdc files that contain Cursor rules) should be licensed under the Apache License 2.0,

and should contain an Apache License 2.0 header comment at the top of each file.

  • Confirm that copyright years are up-to date whenever a file is changed.

Files:

  • examples/observability/simple_calculator_observability/README.md
examples/**/*

⚙️ CodeRabbit configuration file

examples/**/*: - This directory contains example code and usage scenarios for the toolkit, at a minimum an example should
contain a README.md or file README.ipynb.

  • If an example contains Python code, it should be placed in a subdirectory named src/ and should
    contain a pyproject.toml file. Optionally, it might also contain scripts in a scripts/ directory.
  • If an example contains YAML files, they should be placed in a subdirectory named configs/. - If an example contains sample data files, they should be placed in a subdirectory named data/, and should
    be checked into git-lfs.

Files:

  • examples/observability/simple_calculator_observability/README.md
🔇 Additional comments (2)
examples/observability/simple_calculator_observability/README.md (2)

212-217: Verify config-catalyst.yml supports project name configuration.

Line 217 instructs users to update the project name in config-catalyst.yml, but this documentation doesn't clarify the exact config field or structure. Given that the PR removed the endpoint field, ensure the config file documentation aligns with what fields are actually configurable.

Verify that:

  1. The config-catalyst.yml file contains a configurable project name field (or clarify how project name is set via environment variable)
  2. Update the documentation if project name is set via environment variable instead of config file

46-52: Ignore this review comment—the suggested changes are incorrect.

The current Installation section is correct and complete. The single command uv pip install -e examples/observability/simple_calculator_observability automatically installs all required dependencies, including nat_simple_calculator, which is already declared in pyproject.toml.

The suggested diff has two problems:

  1. uv pip install -e ".[ragaai]" — The [ragaai] optional extra does not exist in pyproject.toml. RagaAI Catalyst is optional and only required if users want to use that specific observability platform. The README already provides complete Catalyst setup instructions (including environment variable configuration) in the "RagaAI Catalyst Integration" section.

  2. uv pip install -e examples/getting_started/simple_calculator — This is redundant. The nat_simple_calculator package is already declared as a dependency and will be installed automatically.

Likely an incorrect or invalid review comment.

@dagardner-nv
Copy link
Contributor Author

It seems that we might be missing otel mixin tests for pycls: RagaAICatalystMixin. Thoughts @dagardner-nv ?

The RagaAICatalystExporter class which inherits from both RagaAICatalystMixin and OtelSpanExporter, is being tested here as an E2E test.

This PR is part of an effort to add E2E test coverage for all of our examples, this PR adds that for the Catalyst section of the examples/observability/simple_calculator_observability example.

However it is lacking a unittest, which is outside the scope of this PR (although I did end up having to fix some stuff here and there to get the example working).

Signed-off-by: David Gardner <dagardner@nvidia.com>
@dagardner-nv
Copy link
Contributor Author

/ok to test fc22457

@dagardner-nv
Copy link
Contributor Author

/merge

@rapids-bot rapids-bot bot merged commit b5b4516 into NVIDIA:develop Nov 20, 2025
42 of 47 checks passed
@dagardner-nv dagardner-nv deleted the david-test-ragaai-catalyst branch November 20, 2025 22:45
saglave pushed a commit to snps-scm13/SNPS-NeMo-Agent-Toolkit that referenced this pull request Dec 11, 2025
* Document the need to set `NAT_SPAN_PREFIX=aiq`
* Update the documentation to reflect Catalyst UI changes, and the need to create the project in the Catalyst UI prior to running the workflow.
* Remove the `endpoint` entry from `config-catalyst.yml`, `CatalystTelemetryExporter` has a reasonable default value for this, and overriding this with an environment variable shouldn't be required
* Replace calls to `print` with logging calls
* Add E2E test

## By Submitting this PR I confirm:
- I am familiar with the [Contributing Guidelines](https://github.com/NVIDIA/NeMo-Agent-Toolkit/blob/develop/docs/source/resources/contributing.md).
- We require that all contributors "sign-off" on their commits. This certifies that the contribution is your original work, or you have rights to submit it under the same license, or a compatible license.
  - Any contribution which contains commits that are not Signed-Off will not be accepted.
- When the PR is ready for review, new or existing tests cover these changes.
- When the PR is ready for review, the documentation is up to date with these changes.

## Summary by CodeRabbit

* **Documentation**
  * Expanded Catalyst setup with combined credentials/project step, step-by-step API key and project guidance, optional endpoint note, NAT_SPAN_PREFIX instructions, workflow run steps, and dashboard/dataset trace viewing.

* **New Features**
  * Added optional debug mode for controlling local trace file writes.
  * Made Catalyst endpoint optional and documented customization.

* **Bug Fixes**
  * Improved error logging to include exception context during trace export.

* **Tests**
  * Added environment-aware fixtures and full Catalyst workflow integration tests with span-prefix compatibility and dataset polling/cleanup.

<sub>✏️ Tip: You can customize this high-level summary in your review settings.</sub>

Authors:
  - David Gardner (https://github.com/dagardner-nv)

Approvers:
  - Bryan Bednarski (https://github.com/bbednarski9)

URL: NVIDIA#1194
Signed-off-by: Sangharsh Aglave <aglave@synopsys.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

improvement Improvement to existing functionality non-breaking Non-breaking change

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants