Skip to content

Conversation

vinay-raman
Copy link
Contributor

@vinay-raman vinay-raman commented Oct 14, 2024

NeMo retriever synthetic data generation:

Customer had a following feedback:
Documents that have preset ids need to have their original ids in the generated results.

Fix:
The rawdoc format accepts "_id" as a key which can include preset document ids. These ids are persisted in the generated results.

@dglogo dglogo self-requested a review October 15, 2024 00:21
@dglogo dglogo merged commit aa76878 into NVIDIA:main Oct 15, 2024
anniesurla pushed a commit to anniesurla/GenerativeAIExamples that referenced this pull request Jun 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants