Fix pre/post-training evaluation to use same batch in nn_tutorial #3667

patrocinio · 2025-11-27T00:27:32Z

The tutorial was comparing loss on different batches:

Pre-training: evaluated on first 64 instances (batch 0)
Post-training: evaluated on last batch from training loop

This made the comparison misleading as it wasn't measuring improvement on the same data.

Changes:

Save the initial batch (xb_initial, yb_initial) after first evaluation
Use the saved initial batch for post-training evaluation
Added clarifying comment about fair comparison
Now both evaluations use the same data (first 64 training instances)

This provides an accurate before/after comparison showing the model's improvement on the same batch of data.

Fixes #3666

Description

Checklist

The issue that is being fixed is referred in the description (see above "Fixes #ISSUE_NUMBER")
Only one issue is addressed in this pull request
Labels from the issue that this PR is fixing are added to this pull request
No unnecessary issues are included into this pull request.

The tutorial was comparing loss on different batches: - Pre-training: evaluated on first 64 instances (batch 0) - Post-training: evaluated on last batch from training loop This made the comparison misleading as it wasn't measuring improvement on the same data. Changes: - Save the initial batch (xb_initial, yb_initial) after first evaluation - Use the saved initial batch for post-training evaluation - Added clarifying comment about fair comparison - Now both evaluations use the same data (first 64 training instances) This provides an accurate before/after comparison showing the model's improvement on the same batch of data.

pytorch-bot · 2025-11-27T00:27:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3667

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 2 Active SEVs

There are 2 currently active SEVs. If your PR is affected, please view them below:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-cla bot added the cla signed label Nov 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix pre/post-training evaluation to use same batch in nn_tutorial #3667

Fix pre/post-training evaluation to use same batch in nn_tutorial #3667

Uh oh!

patrocinio commented Nov 27, 2025

Uh oh!

pytorch-bot bot commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix pre/post-training evaluation to use same batch in nn_tutorial #3667

Are you sure you want to change the base?

Fix pre/post-training evaluation to use same batch in nn_tutorial #3667

Uh oh!

Conversation

patrocinio commented Nov 27, 2025

Description

Checklist

Uh oh!

pytorch-bot bot commented Nov 27, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/tutorials/3667

❗ 2 Active SEVs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant