Summary
Design the use of LLM confidence/uncertainty signals (logprobs, token probabilities) for decision-making within pipelines. This is NOT IN SPEC and requires spec authoring.
Parent issue: #105 — Missing Modality F
Why
Modern LLM APIs expose logprobs. Research on selective prediction shows you can use confidence scores to decide whether to accept output, retry with more context, or escalate to a better model. This ties directly into model cascading — low-confidence responses trigger escalation to a more capable (and expensive) model.
Dependencies
Design Decisions Needed
Spec Work Required
New spec section needed. Depends on native LLM provider support (#128) being designed first.
Acceptance Criteria
Summary
Design the use of LLM confidence/uncertainty signals (logprobs, token probabilities) for decision-making within pipelines. This is NOT IN SPEC and requires spec authoring.
Parent issue: #105 — Missing Modality F
Why
Modern LLM APIs expose logprobs. Research on selective prediction shows you can use confidence scores to decide whether to accept output, retry with more context, or escalate to a better model. This ties directly into model cascading — low-confidence responses trigger escalation to a more capable (and expensive) model.
Dependencies
Design Decisions Needed
{{ step.<id>.confidence }}? Structured metadata?Spec Work Required
New spec section needed. Depends on native LLM provider support (#128) being designed first.
Acceptance Criteria