Skip to content

fix: Update inference tests for WatsonX model deprecations and API changes#1969

Merged
yoavkatz merged 2 commits into
mainfrom
fix/inference-tests-model-updates
May 21, 2026
Merged

fix: Update inference tests for WatsonX model deprecations and API changes#1969
yoavkatz merged 2 commits into
mainfrom
fix/inference-tests-model-updates

Conversation

@yoavkatz
Copy link
Copy Markdown
Member

@yoavkatz yoavkatz commented May 20, 2026

Summary

  • Replace deprecated ibm/granite-3-8b-instruct with ibm/granite-4-h-small in WatsonX inference tests
  • Fix double-encoded tool call arguments in WMLInferenceEngineChat._send_requests (API now returns arguments as a JSON string rather than a dict)
  • Remove logprobs section from vision model test (meta-llama/llama-3-2-11b-vision-instruct no longer supports top_logprobs)
  • Replace WatsonX-based option-selecting test with HF engine (the /ml/v1/text/generation API and return_options.input_tokens/token_logprobs are deprecated platform-wide)
  • Update expected log prob values for test_log_prob_scoring_inference_engine (library version drift)
  • Fix pytest incorrectly collecting test_metric utility function as a standalone test by renaming the import

Test plan

  • All 28 tests in tests/inference/ pass locally
  • CI passes

yoavkatz and others added 2 commits May 20, 2026 18:23
…anges

- Replace deprecated ibm/granite-3-8b-instruct with ibm/granite-4-h-small
- Fix double-encoded tool call arguments in WMLInferenceEngineChat
- Remove logprobs test for vision model (no longer supported)
- Replace WatsonX option-selecting test with HF engine (generation API deprecated)
- Update log prob expected values for library version drift
- Fix pytest collecting test_metric as standalone test function

Signed-off-by: Yoav Katz <katz@il.ibm.com>
@yoavkatz yoavkatz merged commit 876b22a into main May 21, 2026
21 checks passed
@yoavkatz yoavkatz deleted the fix/inference-tests-model-updates branch May 21, 2026 09:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant