Validator rejects valid eval input when role is missing (short-form input)

## Context

PR [WiseTechGlobal/WTG.AI.Prompts#490](https://github.com/WiseTechGlobal/WTG.AI.Prompts/pull/490) fails eval validation in CI:

```
✗ evals/arch-prc/functional-evidence-review.eval.yaml
  ✗ [input[0].role] Invalid role 'undefined'. Must be one of: system, user, assistant
  ✗ [input[0].content] Missing or invalid 'content' field (must be a string, array, or object)
```

[Failed run](https://github.com/WiseTechGlobal/WTG.AI.Prompts/actions/runs/23885805981/job/69648296894)

### Root cause (CI version mismatch)

The PR branch is missing `agentv` from `devDependencies` in `package.json` (main has `"agentv": "^4.3.4"`, PR branch doesn't). When CI runs `bunx agentv validate`, since agentv isn't installed locally, `bunx` auto-downloads a potentially different version — causing inconsistent validation behavior.

**Short-term fix**: A workflow change has been prepared for WTG.AI.Prompts to support a `AGENTV_VERSION` repository variable, allowing the version to be pinned via Settings > Variables without a code push.

### Underlying issue: validator is stricter than runtime for input arrays

The validator (`eval-validator.ts` `validateMessages()`) requires every item in an `input` array to be a message object with both `role` and `content`. When `role` is missing, it hard-errors:

```typescript
// eval-validator.ts:489-498
const role = message.role;
const validRoles = ['system', 'user', 'assistant'];
if (!validRoles.includes(role as string)) {
  errors.push({
    severity: 'error',
    ...
    message: `Invalid role '${role}'. Must be one of: ${validRoles.join(', ')}`,
  });
}
```

But the runtime (`shorthand-expansion.ts:34-37`) silently filters items that don't match `isTestMessage()`:

```typescript
// shorthand-expansion.ts:34-37
if (Array.isArray(value)) {
  const messages = value.filter((msg): msg is TestMessage => isTestMessage(msg));
  return messages.length > 0 ? messages : undefined;
}
```

This means the validator rejects input that the runtime would accept.

## Proposed fix

Make the validator accept content objects (items with a `type` field like `file`, `text`, `image`) as implicit user messages in input arrays, or at minimum downgrade from error to warning. This would align the validator with the runtime's lenient handling.

Specifically, in `validateMessages()`, before checking `role`, check if the item looks like a content object rather than a message:

```typescript
// If item looks like a content object ({type: "file", value: ...}), 
// treat as valid — runtime wraps these implicitly
if (isObject(message) && 'type' in message && !('role' in message)) {
  // Warn or accept silently
  continue;
}
```

## Two-part fix summary

| Repo | Change | Status |
|------|--------|--------|
| WTG.AI.Prompts | Add `AGENTV_VERSION` env var to `validate.yml` workflow so version can be overridden via repo variable | Prepared locally |
| EntityProcess/agentv | Make `validateMessages()` accept content objects without `role` in input arrays | TODO |

---
_devbox2-wtg-ai-prompts-allagents_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validator rejects valid eval input when role is missing (short-form input) #915

Context

Root cause (CI version mismatch)

Underlying issue: validator is stricter than runtime for input arrays

Proposed fix

Two-part fix summary

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Repo	Change	Status
WTG.AI.Prompts	Add `AGENTV_VERSION` env var to `validate.yml` workflow so version can be overridden via repo variable	Prepared locally
EntityProcess/agentv	Make `validateMessages()` accept content objects without `role` in input arrays	TODO

Validator rejects valid eval input when role is missing (short-form input) #915

Description

Context

Root cause (CI version mismatch)

Underlying issue: validator is stricter than runtime for input arrays

Proposed fix

Two-part fix summary

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions