Add validation tests for ESM2 fine-tuning benchmark partial-conv #1010

nvmvle · 2025-07-30T12:01:41Z

Description

This PR adds a comprehensive test validation section to the ESM2 fine-tuning benchmark partial-conv configuration. The tests define expected baseline metrics and ranges for model performance validation during CI runs.

The validation includes:

Exit code validation (expecting successful completion with code 0)
Performance baselines for key metrics:
- consumed_samples
- val_loss
- val_acc
- TFLOPS per GPU
- reduced_train_loss

Type of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Refactor
Documentation update
Other (please describe):

CI Pipeline Configuration

Configure CI behavior by applying the relevant labels:

SKIP_CI - Skip all continuous integration tests
INCLUDE_NOTEBOOKS_TESTS - Execute notebook validation tests in pytest
INCLUDE_SLOW_TESTS - Execute tests labelled as slow in pytest for extensive testing

Note

By default, the notebooks validation tests are skipped unless explicitly enabled.

Authorizing CI Runs

We use copy-pr-bot to manage authorization of CI
runs on NVIDIA's compute resources.

If a pull request is opened by a trusted user and contains only trusted changes, the pull request's code will
automatically be copied to a pull-request/ prefixed branch in the source repository (e.g. pull-request/123)
If a pull request is opened by an untrusted user or contains untrusted changes, an NVIDIA org member must leave an
/ok to test comment on the pull request to trigger CI. This will need to be done for each new commit.

Pre-submit Checklist

I have tested these changes locally
I have updated the documentation accordingly
I have added/updated tests as needed
All existing tests pass successfully

Signed-off-by: My Le mvle@nvidia.com

copy-pr-bot · 2025-07-30T12:01:45Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

…file

jwilber · 2025-08-01T18:49:51Z

/ok to test 8e0042b

nvmvle · 2025-08-04T03:36:04Z

/ok to test dfd6913

jwilber · 2025-08-14T18:24:32Z

/ok to test e734275

Signed-off-by: My Le <mvle@nvidia.com>

nvmvle · 2025-08-15T01:03:25Z

/ok to test 42918f6

jwilber · 2025-08-15T03:10:52Z

/ok to test 42918f6

codecov-commenter · 2025-08-15T04:45:40Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.29%. Comparing base (21b1442) to head (42918f6).
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1010   +/-   ##
=======================================
  Coverage   81.29%   81.29%           
=======================================
  Files         152      152           
  Lines       10271    10271           
=======================================
  Hits         8350     8350           
  Misses       1921     1921

…DIA#1010) ### Description This PR adds a comprehensive test validation section to the ESM2 fine-tuning benchmark partial-conv configuration. The tests define expected baseline metrics and ranges for model performance validation during CI runs. The validation includes: - Exit code validation (expecting successful completion with code 0) - Performance baselines for key metrics: - `consumed_samples` - `val_loss` - `val_acc` - `TFLOPS per GPU` - `reduced_train_loss` ### Type of changes - [ ] Bug fix (non-breaking change which fixes an issue) - [x] New feature (non-breaking change which adds functionality) - [ ] Refactor - [ ] Documentation update - [ ] Other (please describe): ### CI Pipeline Configuration Configure CI behavior by applying the relevant labels: - [SKIP_CI](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#skip_ci) - Skip all continuous integration tests - [INCLUDE_NOTEBOOKS_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_notebooks_tests) - Execute notebook validation tests in pytest - [INCLUDE_SLOW_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_slow_tests) - Execute tests labelled as slow in pytest for extensive testing > [!NOTE] > By default, the notebooks validation tests are skipped unless explicitly enabled. #### Authorizing CI Runs We use [copy-pr-bot](https://docs.gha-runners.nvidia.com/apps/copy-pr-bot/#automation) to manage authorization of CI runs on NVIDIA's compute resources. * If a pull request is opened by a trusted user and contains only trusted changes, the pull request's code will automatically be copied to a pull-request/ prefixed branch in the source repository (e.g. pull-request/123) * If a pull request is opened by an untrusted user or contains untrusted changes, an NVIDIA org member must leave an `/ok to test` comment on the pull request to trigger CI. This will need to be done for each new commit. ### Pre-submit Checklist - [x] I have tested these changes locally - [ ] I have updated the documentation accordingly - [ ] I have added/updated tests as needed - [x] All existing tests pass successfully Signed-off-by: My Le <mvle@nvidia.com> --------- Signed-off-by: My Le <mvle@nvidia.com> Co-authored-by: Jared Wilber <jwilber@nvidia.com>

nvmvle requested review from jwilber and trvachov July 30, 2025 12:01

nvmvle requested review from dorotat-nv and pstjohn as code owners July 30, 2025 12:01

pstjohn approved these changes Jul 30, 2025

View reviewed changes

Add static tests for ESM2 finetuning benchmark configuration in YAML …

589324c

…file

nvmvle force-pushed the mvle/jet-test-esm2-finetune branch from 54f15a1 to 589324c Compare July 30, 2025 13:32

jwilber approved these changes Aug 1, 2025

View reviewed changes

Merge branch 'main' into mvle/jet-test-esm2-finetune

8e0042b

jwilber enabled auto-merge August 1, 2025 18:50

Merge branch 'main' into mvle/jet-test-esm2-finetune

dfd6913

trvachov approved these changes Aug 6, 2025

View reviewed changes

nvmvle and others added 3 commits August 7, 2025 12:18

Merge branch 'main' into mvle/jet-test-esm2-finetune

8025525

Merge branch 'main' into mvle/jet-test-esm2-finetune

e08c50a

Merge branch 'main' into mvle/jet-test-esm2-finetune

e734275

fix precommit

42918f6

Signed-off-by: My Le <mvle@nvidia.com>

jwilber added this pull request to the merge queue Aug 15, 2025

Merged via the queue into main with commit db065c1 Aug 15, 2025
14 checks passed

jwilber deleted the mvle/jet-test-esm2-finetune branch August 15, 2025 06:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add validation tests for ESM2 fine-tuning benchmark partial-conv #1010

Add validation tests for ESM2 fine-tuning benchmark partial-conv #1010

Uh oh!

nvmvle commented Jul 30, 2025

Uh oh!

copy-pr-bot bot commented Jul 30, 2025

Uh oh!

jwilber commented Aug 1, 2025

Uh oh!

nvmvle commented Aug 4, 2025

Uh oh!

jwilber commented Aug 14, 2025

Uh oh!

nvmvle commented Aug 15, 2025

Uh oh!

jwilber commented Aug 15, 2025

Uh oh!

codecov-commenter commented Aug 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Add validation tests for ESM2 fine-tuning benchmark partial-conv #1010

Add validation tests for ESM2 fine-tuning benchmark partial-conv #1010

Uh oh!

Conversation

nvmvle commented Jul 30, 2025

Description

Type of changes

CI Pipeline Configuration

Authorizing CI Runs

Pre-submit Checklist

Uh oh!

copy-pr-bot bot commented Jul 30, 2025

Uh oh!

jwilber commented Aug 1, 2025

Uh oh!

nvmvle commented Aug 4, 2025

Uh oh!

jwilber commented Aug 14, 2025

Uh oh!

nvmvle commented Aug 15, 2025

Uh oh!

jwilber commented Aug 15, 2025

Uh oh!

codecov-commenter commented Aug 15, 2025

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants