Skip to content

Conversation

SamMalayek
Copy link

This adds support for a new --embd-output-format raw option, which outputs embeddings as plain space-separated floats — without JSON formatting or embedding N: prefixes.

This is useful for downstream vector pipelines and scripting, e.g. when piping directly into NumPy or other vector processing tools.

  • Existing formats (json, json+, etc.) remain unchanged.
  • Default behavior is unaffected.

This new option outputs embeddings as raw space-separated floats, without JSON or 'embedding N:' prefixes. Useful for downstream vector pipelines and scripting.
@SamMalayek SamMalayek force-pushed the feature/raw-embedding-output branch from 0d10ee4 to cd96be7 Compare October 12, 2025 21:44
@SamMalayek SamMalayek requested a review from danbev October 13, 2025 23:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants