Skip to content

v2.3.0

Choose a tag to compare

@shubhadeepd shubhadeepd released this 15 Oct 05:15
· 451 commits to main since this release
ba87c08

Version 2.3.0 (2025-10-14)

This release adds RTX6000 platform support, adds deployment by using NIM operator, improves vector database pluggability with the blueprint, and other changes.

Added

Changed

  • Migrated default LLM model for reflection to llama-3.3-nemotron-super-49b instead of mixtral-8x22b-instruct-v01.
  • Refactored rag-playground code
    • Use React end to end. Next.js dependencies were deprecated.
    • More developer friendly and intuitive look and feel.
    • rag-playground service is renamed to rag-frontend
  • Refactored helm chart support
    • Expanded and reorganized Helm chart configuration, enabling granular control over service components, resource settings, and observability (tracing, metrics).
    • Introduced ConfigMap and service definitions to facilitate improved application deployment flexibility.
    • Implemented refined service account and secret management in Helm templates.
    • Added a new Helm values file for nim-operator to configure LLM model environment and component toggles.

Fixed

  • Fixed support for long audio file ingestion.
  • Fixed support to ingest images without charts/tables.
  • Fixed requirement of rebuilding rag frontend container when LLM model name was changed.

Removed

  • Removed consistency level configuration support for Milvus.
  • Removed EMBEDDING_NIM_ENDPOINT and EMBEDDING_NIM_MODEL_NAME environment variables for nvingest.
  • Removed unused ENABLE_MULTITURN environment variable from rag-server.
  • Removed ENABLE_NEMOTRON_THINKING environment variable from rag-server.