v2.2.0
This release adds B200 platform support, a native Python API, and major enhancements for multimodal and metadata features. It also improves deployment flexibility and customization across the RAG blueprint.
Added
- Support deploying the blueprint on B200 platform.
- Support for native python API
- Refactoring code and directory to support python API
- Better modularization for easier customization
- Moved to
uvas the package manager for this project
- Added support for configurable vector store consistency levels (Bounded/Strong/Session) to optimize retrieval performance vs accuracy trade-offs.
- Capability to add custom metadata for files and metadata based filtering
- Documentation of using Multi Instance GPUs. Reduces minimum GPU requirement for helm charts to 3xH100.
- Multi collection based retrieval support
- Audio files (.mp3 and .wav) support
- Support of using Vision Language Model based generation for charts and images
- Support for generating summaries of uploaded files
- Sample user interface enhancements
- Support for non-blocking file upload
- More efficient error reporting for ingestion failures
- Prompt customization support without rebuilding images
- Added support to enable infographics, which improves accuracy for documents containing text in image format.
- See this guide for details
- New customizations
- How to support non nvingest based ingestion + retrieval
- How to enable CPU based milvus
- How to enable nemoretriever-parse as an alternate PDF parser
- How to use standalone nv-ingest python client to do ingestion
- Nvidia AI Workbench support
Changed
- Changed API schema to support newly added features
- POST /collections to be deprecated in favour of POST /collection for ingestor-server
- New endpoint GET /summary added for rag-server
- Metadata information available as part of GET /collections and GET /documents API
- Check out migration guide for detailed changes at API level
- Optimized batch mode ingestion support to improve perf for multi user concurrent file upload.
Known Issues
Check out this section to understand the known issues present for this release.