Skip to content

Conversation

@nvmvle
Copy link
Collaborator

@nvmvle nvmvle commented Jul 30, 2025

Description

This PR adds a comprehensive test validation section to the ESM2 fine-tuning benchmark partial-conv configuration. The tests define expected baseline metrics and ranges for model performance validation during CI runs.

The validation includes:

  • Exit code validation (expecting successful completion with code 0)
  • Performance baselines for key metrics:
    • consumed_samples
    • val_loss
    • val_acc
    • TFLOPS per GPU
    • reduced_train_loss

Type of changes

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Refactor
  • Documentation update
  • Other (please describe):

CI Pipeline Configuration

Configure CI behavior by applying the relevant labels:

Note

By default, the notebooks validation tests are skipped unless explicitly enabled.

Authorizing CI Runs

We use copy-pr-bot to manage authorization of CI
runs on NVIDIA's compute resources.

  • If a pull request is opened by a trusted user and contains only trusted changes, the pull request's code will
    automatically be copied to a pull-request/ prefixed branch in the source repository (e.g. pull-request/123)
  • If a pull request is opened by an untrusted user or contains untrusted changes, an NVIDIA org member must leave an
    /ok to test comment on the pull request to trigger CI. This will need to be done for each new commit.

Pre-submit Checklist

  • I have tested these changes locally
  • I have updated the documentation accordingly
  • I have added/updated tests as needed
  • All existing tests pass successfully

Signed-off-by: My Le mvle@nvidia.com

@nvmvle nvmvle requested review from jwilber and trvachov July 30, 2025 12:01
@copy-pr-bot
Copy link

copy-pr-bot bot commented Jul 30, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@nvmvle nvmvle force-pushed the mvle/jet-test-esm2-finetune branch from 54f15a1 to 589324c Compare July 30, 2025 13:32
@jwilber
Copy link
Collaborator

jwilber commented Aug 1, 2025

/ok to test 8e0042b

@jwilber jwilber enabled auto-merge August 1, 2025 18:50
@nvmvle
Copy link
Collaborator Author

nvmvle commented Aug 4, 2025

/ok to test dfd6913

@jwilber
Copy link
Collaborator

jwilber commented Aug 14, 2025

/ok to test e734275

Signed-off-by: My Le <mvle@nvidia.com>
@nvmvle
Copy link
Collaborator Author

nvmvle commented Aug 15, 2025

/ok to test 42918f6

@jwilber
Copy link
Collaborator

jwilber commented Aug 15, 2025

/ok to test 42918f6

@jwilber jwilber added this pull request to the merge queue Aug 15, 2025
@codecov-commenter
Copy link

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.29%. Comparing base (21b1442) to head (42918f6).
✅ All tests successful. No failed tests found.

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #1010   +/-   ##
=======================================
  Coverage   81.29%   81.29%           
=======================================
  Files         152      152           
  Lines       10271    10271           
=======================================
  Hits         8350     8350           
  Misses       1921     1921           

Merged via the queue into main with commit db065c1 Aug 15, 2025
14 checks passed
@jwilber jwilber deleted the mvle/jet-test-esm2-finetune branch August 15, 2025 06:13
pstjohn pushed a commit to pstjohn/bionemo-framework that referenced this pull request Aug 19, 2025
…DIA#1010)

### Description
This PR adds a comprehensive test validation section to the ESM2
fine-tuning benchmark partial-conv configuration. The tests define
expected baseline metrics and ranges for model performance validation
during CI runs.

The validation includes:
- Exit code validation (expecting successful completion with code 0)
- Performance baselines for key metrics:
  - `consumed_samples`
  - `val_loss`
  - `val_acc`
  - `TFLOPS per GPU`
  - `reduced_train_loss`

### Type of changes
- [ ]  Bug fix (non-breaking change which fixes an issue)
- [x]  New feature (non-breaking change which adds functionality)
- [ ]  Refactor
- [ ]  Documentation update
- [ ]  Other (please describe):

### CI Pipeline Configuration
Configure CI behavior by applying the relevant labels:

-
[SKIP_CI](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#skip_ci)
- Skip all continuous integration tests
-
[INCLUDE_NOTEBOOKS_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_notebooks_tests)
- Execute notebook validation tests in pytest
-
[INCLUDE_SLOW_TESTS](https://github.com/NVIDIA/bionemo-framework/blob/main/docs/docs/user-guide/contributing/contributing.md#include_slow_tests)
- Execute tests labelled as slow in pytest for extensive testing

> [!NOTE]
> By default, the notebooks validation tests are skipped unless
explicitly enabled.

#### Authorizing CI Runs

We use
[copy-pr-bot](https://docs.gha-runners.nvidia.com/apps/copy-pr-bot/#automation)
to manage authorization of CI
runs on NVIDIA's compute resources.

* If a pull request is opened by a trusted user and contains only
trusted changes, the pull request's code will
automatically be copied to a pull-request/ prefixed branch in the source
repository (e.g. pull-request/123)
* If a pull request is opened by an untrusted user or contains untrusted
changes, an NVIDIA org member must leave an
`/ok to test` comment on the pull request to trigger CI. This will need
to be done for each new commit.

### Pre-submit Checklist

 - [x] I have tested these changes locally
 - [ ] I have updated the documentation accordingly
 - [ ] I have added/updated tests as needed
 - [x] All existing tests pass successfully
 
 Signed-off-by: My Le <mvle@nvidia.com>

---------

Signed-off-by: My Le <mvle@nvidia.com>
Co-authored-by: Jared Wilber <jwilber@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants