Skip to content

185 Add dataset hash#187

Merged
oleschwen merged 9 commits intomainfrom
185-add-checksum-of-data-used-in-training
Feb 9, 2026
Merged

185 Add dataset hash#187
oleschwen merged 9 commits intomainfrom
185-add-checksum-of-data-used-in-training

Conversation

@oleschwen
Copy link
Copy Markdown
Collaborator

  • Log a hash of the dataset used
  • Log whether duplicate UIDs or image data appear in training/validation data (no IDs logged)

@oleschwen oleschwen linked an issue Jan 30, 2026 that may be closed by this pull request
2 tasks
@oleschwen oleschwen changed the title 185 Add dataset hash WIP 185 Add dataset hash Jan 30, 2026
@oleschwen oleschwen changed the title WIP 185 Add dataset hash 185 Add dataset hash Feb 9, 2026
@oleschwen oleschwen marked this pull request as ready for review February 9, 2026 09:25
@oleschwen oleschwen self-assigned this Feb 9, 2026
Copy link
Copy Markdown
Contributor

@Ultimate-Storm Ultimate-Storm left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

approved

@oleschwen oleschwen merged commit 66eac87 into main Feb 9, 2026
5 checks passed
@oleschwen oleschwen deleted the 185-add-checksum-of-data-used-in-training branch February 9, 2026 13:19
deboraJ1 pushed a commit to deboraJ1/MediSwarm that referenced this pull request Mar 4, 2026
…ata-used-in-training

185 Add dataset hash
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add checksum of data used in training

2 participants