chore(release): Release v1.0.2#358
Merged
Merged
Conversation
647c1d3 to
846d226
Compare
skylarmb
approved these changes
May 11, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Release v1.0.2
Version:
1.0.2Changelog
[1.0.2] - 2026-05-11
Fixed
Experiments: client-side evaluator scores now flow end-to-end in
evaluate()evaluate()are now written to the correct user-function chain span viaenrich_span(metrics=…), exported over OTLP, and surfaced in server-side run comparison's per-eventmetric_deltas. Previously, evaluator scores were duplicated intorun.metadata.evaluator_metricsand onto the per-datapoint session event, but neither path actually populated the per-event surface that run comparison requires — client-side evaluator scoring and comparison did not work end-to-end.API Client:
event_idtyped as required onPOST /eventsresponsesPostEventResponse.event_idwas incorrectly typed asOptional[str]in the OpenAPI spec; the backend always returns the field on a 2xx response. The spec was corrected and the generated SDK now reflects the runtime guarantee, removing the need for defensiveif event.event_id:guards. No runtime behavior change, but strictly typed mypy codebases withwarn_unreachable = Trueor pyright withreportUnnecessaryComparisonenabled may produce "unnecessary comparison" or "redundant Optional guard" warnings, depending on exact usage patterns.Added
FilterFieldType.DATETIMEenum value"datetime"wire value supported by the backend. Additive — no behavior change for existing callers.Deprecated
FilterFieldType.IDID = "id"was never a valid backend wire value and has always been rejected on the wire. Use"string"for UUID-shaped fields such assession_id. The alias is preserved so existing imports continue to resolve and will be removed in the next major.Removed
honeyhivePython CLI entry point removed frompyproject.tomlhoneyhiveconsole script was non-functional (dead code) and shadowed the official TypeScript CLI on$PATH. CLI functionality is now provided by the officialhoneyhiveTypeScript CLI; removing the Python script entry point letshoneyhiveresolve correctly when both packages are installed globally.This PR was automatically generated.