docs: add retriever docs by akoumpa · Pull Request #1407 · NVIDIA-NeMo/Automodel

akoumpa · 2026-02-27T15:20:08Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

Add specific line by line info of high level changes in this PR.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

Related to # (issue)

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

copy-pr-bot · 2026-02-27T15:20:12Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa · 2026-03-04T18:08:44Z

/ok to test 0346ed6

copy-pr-bot · 2026-03-04T18:09:21Z

/ok to test 0346ed6

@akoumpa, there was an error processing your request: E2

See the following link for more information: https://docs.gha-runners.nvidia.com/cpr/e/2/

rnyak · 2026-03-05T14:37:04Z

+INFO:root:step 30 | epoch 0 | val_loss 1.1230 | val_acc1 0.7820 | val_mrr 0.8450
+```
+
+## Using the Fine-Tuned Model


better to change this to transformers usage script. as in here:

https://huggingface.co/nvidia/llama-nemotron-embed-1b-v2#transformers-usage

rnyak · 2026-03-05T14:38:30Z

+  dataset:
+    _target_: nemo_automodel.components.datasets.llm.make_retrieval_dataset
+    data_dir_list:
+      - /path/to/train_data.json


here better to use the predefined path that we added lately to automatically download the example dataset from HF hub.

see the latest yaml file pls: https://github.com/NVIDIA-NeMo/Automodel/blob/main/examples/biencoder/llama3_2_1b_biencoder.yaml#L55

rnyak · 2026-03-05T14:39:53Z

+
+```yaml
+model:
+  _target_: nemo_automodel.NeMoAutoModelBiencoder.from_pretrained


let's not merge this MR yet, we can do once MR 1449 is finalized, we might need to update class names etc.

jgerh

Completed tech pubs review of docs/guides/overview.md and docs/guides/retriever/embedding-finetuning.md and provided a few copyedits. docs/index.md reviewed in #1573.

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

add retriever docs

a5098a3

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa added the docs-only With great power comes great responsibility. label Mar 3, 2026

akoumpa added 2 commits March 3, 2026 11:30

improve

4f090ba

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

remove reranker

17be0e8

Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>

akoumpa marked this pull request as ready for review March 4, 2026 18:08

akoumpa requested review from HuiyingLi, adil-a, hemildesai and jgerh as code owners March 4, 2026 18:08

akoumpa enabled auto-merge (squash) March 4, 2026 18:08

rnyak reviewed Mar 5, 2026

View reviewed changes

jgerh reviewed Mar 18, 2026

View reviewed changes

akoumpa marked this pull request as draft March 18, 2026 18:50

auto-merge was automatically disabled March 18, 2026 18:50
Pull request was converted to draft

Apply suggestions from code review

edb1b39

Co-authored-by: jgerh <163925524+jgerh@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add retriever docs#1407

docs: add retriever docs#1407
akoumpa wants to merge 4 commits intomainfrom
akoumparouli/docs_add_retriever

akoumpa commented Feb 27, 2026

Uh oh!

copy-pr-bot Bot commented Feb 27, 2026

Uh oh!

akoumpa commented Mar 4, 2026

Uh oh!

copy-pr-bot Bot commented Mar 4, 2026

Uh oh!

rnyak Mar 5, 2026

Uh oh!

rnyak Mar 5, 2026

Uh oh!

rnyak Mar 5, 2026

Uh oh!

jgerh left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

akoumpa commented Feb 27, 2026

What does this PR do ?

Changelog

Before your PR is "Ready for review"

Additional Information

Uh oh!

copy-pr-bot Bot commented Feb 27, 2026

Uh oh!

akoumpa commented Mar 4, 2026

Uh oh!

copy-pr-bot Bot commented Mar 4, 2026

Uh oh!

rnyak Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

jgerh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants