Skip to content

feat: base model switched to Roberta#324

Closed
hanneshapke wants to merge 2 commits intomainfrom
feat/quant-roberta-model
Closed

feat: base model switched to Roberta#324
hanneshapke wants to merge 2 commits intomainfrom
feat/quant-roberta-model

Conversation

@hanneshapke
Copy link
Copy Markdown
Collaborator

updated training setup

updated training setup

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
# Path to audit_allowlist.txt to filter training samples (empty = no filtering)
# Generate with: uv run python model/dataset/audit_dataset.py --samples-dir <dir>
audit_allowlist = ""
audit_allowlist = "/home/hannes/kiji-proxy/model/dataset/data_samples/training_samples/audit_ledger.tsv"
Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Davidnet Watch out the path should be relative!

HF Hub API rejects binary files, requiring Xet/LFS storage via git push.
Added _upload_binary_via_git helper that clones the repo, tracks the file
extension with LFS, and pushes.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@hanneshapke
Copy link
Copy Markdown
Collaborator Author

Closed in favor of #325

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant