feat: dataset versioning by max-braintrust · Pull Request #1837 · braintrustdata/braintrust-sdk-javascript

max-braintrust · 2026-04-15T18:53:43Z

Summary

This PR adds dataset snapshot and environment tag support to the JS SDK. See feature spec here: braintrustdata/braintrust-spec#14

Background

This change adds two friendlier ways to reference dataset versions:

Snapshots, which are stable human-readable names for a specific dataset version
Environment tags, which are movable aliases like ppe or production that can be repointed over time

These are still just ways of referring to a concrete dataset version (xact_id). The SDK resolves snapshot names and environment tags down to the underlying xact_id before experiment or eval registration, so we keep the existing reproducibility guarantees while making version selection much easier to use.

This PR adds:

SDK support for initializing datasets by:
- explicit version (xact_id)
- snapshot name
- environment tag
Resolution of snapshot and environment selectors to a concrete dataset version internally before eval / experiment registration
SDK helpers for dataset snapshots, including:
- create
- list
- update via register/upsert for the current dataset version
- patch snapshot metadata by id
- delete
- restore and restore/preview to return the dataset head to the state at a particular version
Dev server support for forwarding dataset version and environment when resolving datasets for remote evals
Tests and example coverage for the new version-selection paths

max-braintrust · 2026-04-21T05:47:34Z

  const newNormalized = normalizeClass(newClass);

  // Check if normalized versions are similar (one contains significant portion of the other)
  const similarityThreshold = Math.min(500, oldNormalized.length * 0.5);


This heuristic fails if you are adding substantially to the method bodies of a class - added validation through the TS parser as a fallback when this fails. Since this should be faster, keeping it for the normal case.

Abhijeet Prasad (AbhiPrasad) · 2026-04-22T17:28:35Z

Luca Forstner (@lforst) we really need to split this file up 😭

Abhijeet Prasad (AbhiPrasad) · 2026-04-22T17:30:00Z

+        });
+        args["dataset_id"] = datasetSelection.datasetId;
+        if (datasetSelection.datasetVersion !== undefined) {
+          args["dataset_version"] = datasetSelection.datasetVersion;


This is a change to how this worked before because we had the } else { branch that would do args["dataset_version"] = await (dataset as AnyDataset).version();. Is that intentional? I guess we do save on having to do the dataset.version() call.

No that should probably get fixed so this still works for subclasses/cases where the version is pinned manually - updated serializeDatasetForExperiment() so it will always hit .version() if we don't resolve through one of the other selections.

Abhijeet Prasad (AbhiPrasad) · 2026-04-22T17:35:17Z

+  const snapshots = await getDatasetSnapshots({ state, datasetId });
+  const match = snapshots.find((snapshot) => snapshot.name === snapshotName);


can we add a backend endpoint to do this instead? Feels like a lot to scan through all the datasets client-side.

Yes this should already be supported - pulled apart listSnapshots() and getSnapshot() so this makes use of that properly now.

Abhijeet Prasad (AbhiPrasad) · 2026-04-22T17:39:59Z

+    dataset_id: string;
+    dataset_version?: string;
+    dataset_environment?: string;
+    dataset_snapshot_name?: string;


do we need to pass dataset_snapshot_name into the remote evals created in

braintrust-sdk-javascript/js/dev/server.ts

Line 312 in 343634f

async function getDataset(

(I lack a lot of context w/ remote evals so lmk if I'm off the mark!)

Discussed a bit offline - the remote eval path needs more api changes before it gets added in the sdk.

Abhijeet Prasad (AbhiPrasad) · 2026-04-23T15:09:33Z

+  xactId: string;
+};
+
+type DatasetSnapshotLookup =


can we export this? it's used by public async getSnapshot

Abhijeet Prasad (AbhiPrasad) · 2026-04-23T15:10:04Z

+---
+
+- (feat) Add dataset snapshot/environment selection support to `init()` and `initDataset()`, including snapshot CRUD helpers and `DatasetSnapshot` type exports.
+- (feat) Update `braintrust/dev` to respect `dataset_version` and `dataset_environment` when resolving datasets for evals.


feel free to also add a little extra detail here, like an example code snippet!

max-braintrust added 7 commits April 15, 2026 11:48

better handle Dataset extension

524ac9b

add changeset

ad475af

Merge branch 'main' into dataset-tags-refactor

1ee10e9

refactor sdk change and fix api compat tests

32ba437

Merge branch 'main' into dataset-tags-refactor

ba02958

fix bridge

cc82c66

support updates

cff7d47

max-braintrust marked this pull request as ready for review April 20, 2026 20:46

add restore

f40c059

Abhijeet Prasad (AbhiPrasad) requested review from Abhijeet Prasad (AbhiPrasad) and Luca Forstner (lforst) April 20, 2026 22:16

max-braintrust added 2 commits April 20, 2026 15:32

fix mock

bd98856

Merge branch 'main' into dataset-tags-refactor

e70b7ed

max-braintrust changed the title ~~Support dataset versioning~~ feat: dataset versioning Apr 21, 2026

max-braintrust added 2 commits April 20, 2026 22:23

update ordering

fccc636

fix class signature tests

b1fe3df

max-braintrust commented Apr 21, 2026

View reviewed changes

use ast parse as fallback

343634f

Abhijeet Prasad (AbhiPrasad) reviewed Apr 22, 2026

View reviewed changes

max-braintrust added 2 commits April 22, 2026 11:43

pr feedback

d4c6faa

Merge branch 'main' into dataset-tags-refactor

a472b9b

max-braintrust requested a review from Abhijeet Prasad (AbhiPrasad) April 22, 2026 21:35

Abhijeet Prasad (AbhiPrasad) approved these changes Apr 23, 2026

View reviewed changes

Abhijeet Prasad (AbhiPrasad) reviewed Apr 23, 2026

View reviewed changes

max-braintrust added 2 commits April 23, 2026 10:08

add example snippits

aa23190

Merge branch 'main' into dataset-tags-refactor

6a7effc

max-braintrust merged commit 3500ec2 into main Apr 23, 2026
52 of 54 checks passed

max-braintrust deleted the dataset-tags-refactor branch April 23, 2026 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: dataset versioning#1837

feat: dataset versioning#1837
max-braintrust merged 17 commits intomainfrom
dataset-tags-refactor

max-braintrust commented Apr 15, 2026 •

edited

Loading

Uh oh!

max-braintrust Apr 21, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 22, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 22, 2026

Uh oh!

max-braintrust Apr 22, 2026

Uh oh!

Uh oh!

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 22, 2026

Uh oh!

max-braintrust Apr 22, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 22, 2026

Uh oh!

max-braintrust Apr 22, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 23, 2026

Uh oh!

Abhijeet Prasad (AbhiPrasad) Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		const snapshots = await getDatasetSnapshots({ state, datasetId });
		const match = snapshots.find((snapshot) => snapshot.name === snapshotName);

Conversation

max-braintrust commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

This PR adds:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

max-braintrust commented Apr 15, 2026 •

edited

Loading