Skip to content

fix: ng prepare data metrics conflict#738

Merged
bxyu-nvidia merged 1 commit intomainfrom
cmunley1/data-prep
Feb 24, 2026
Merged

fix: ng prepare data metrics conflict#738
bxyu-nvidia merged 1 commit intomainfrom
cmunley1/data-prep

Conversation

@cmunley1
Copy link
Contributor

issue described in #736

collate_samples writes metrics without the dataset config metadata eg name type jsonl_fpath while validate_samples_and_aggregate_emtrics wrote those with metadata , causing a conflict. this mirrors what the validate step does to merge the metadata before the conflict check

Signed-off-by: cmunley1 <cmunley@nvidia.com>
@copy-pr-bot
Copy link

copy-pr-bot bot commented Feb 20, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@cwing-nvidia cwing-nvidia linked an issue Feb 20, 2026 that may be closed by this pull request
@bxyu-nvidia bxyu-nvidia merged commit f1d19a8 into main Feb 24, 2026
5 checks passed
@bxyu-nvidia bxyu-nvidia deleted the cmunley1/data-prep branch February 24, 2026 17:43
fsiino-nvidia pushed a commit that referenced this pull request Feb 26, 2026
issue described in #736

collate_samples writes metrics without the dataset config metadata eg
name type jsonl_fpath while validate_samples_and_aggregate_emtrics wrote
those with metadata , causing a conflict. this mirrors what the validate
step does to merge the metadata before the conflict check

Signed-off-by: cmunley1 <cmunley@nvidia.com>
fsiino-nvidia pushed a commit that referenced this pull request Feb 26, 2026
issue described in #736

collate_samples writes metrics without the dataset config metadata eg
name type jsonl_fpath while validate_samples_and_aggregate_emtrics wrote
those with metadata , causing a conflict. this mirrors what the validate
step does to merge the metadata before the conflict check

Signed-off-by: cmunley1 <cmunley@nvidia.com>
abubakaria56 pushed a commit to abubakaria56/Gym that referenced this pull request Mar 2, 2026
issue described in NVIDIA-NeMo#736

collate_samples writes metrics without the dataset config metadata eg
name type jsonl_fpath while validate_samples_and_aggregate_emtrics wrote
those with metadata , causing a conflict. this mirrors what the validate
step does to merge the metadata before the conflict check

Signed-off-by: cmunley1 <cmunley@nvidia.com>
abubakaria56 pushed a commit to abubakaria56/Gym that referenced this pull request Mar 2, 2026
issue described in NVIDIA-NeMo#736

collate_samples writes metrics without the dataset config metadata eg
name type jsonl_fpath while validate_samples_and_aggregate_emtrics wrote
those with metadata , causing a conflict. this mirrors what the validate
step does to merge the metadata before the conflict check

Signed-off-by: cmunley1 <cmunley@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

bug: ng_prepare_data metrics conflict

2 participants