Added tests and refactored fix for long prompt issue #2256

AsyaPronina · 2025-05-22T16:44:59Z

Follow-up for #2242

Wovchena · 2025-05-23T06:17:05Z

tests/python_tests/test_llm_pipeline_npu.py

+                        r'the limit\.\n'
+    with pytest.raises(RuntimeError, match=exception_pattern):
+        pipe.generate(prompt_2, max_new_tokens=2, ignore_eos=True)
+    # pipe.finish_chat()


It's up to you to call finish_chat() because pipe is destroyed after. But don't introduce commented code

Wovchena · 2025-05-23T06:19:10Z

tests/python_tests/test_llm_pipeline_npu.py

Is it really worth renaming? I think stateful can still be used for other devices. We just don't use it by default. If the renaming is needed, fix the labeler because it affects smart CI:

openvino.genai/.github/labeler.yml

Line 27 in b7e8993

- 'tests/python_tests/test_llm_pipeline_static.py'

Copilot

Pull Request Overview

This PR adds new tests and refactors error handling for the limit on prompt lengths in NPU pipelines.

Updates test cases to ensure that long input prompts correctly trigger errors.
Refactors C++ code to improve error message consistency for static and stateful pipelines on NPU.
Adds new tests in LLMPipeline to cover tokenized, tensor, and string input scenarios.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
tests/python_tests/test_vlm_pipeline.py	Updated model reference in one test and added a new test for termination by long input embeddings.
tests/python_tests/test_llm_pipeline_npu.py	Added multiple tests for handling overly long tokenized, tensor, and string inputs.
src/cpp/src/llm/pipeline_static.cpp	Updated error message to include NPU context.
src/cpp/src/llm/pipeline_stateful.cpp	Introduced a helper function to validate NPU prompt length and refactored assertions accordingly.

Copilot · 2025-05-23T06:41:01Z

tests/python_tests/test_vlm_pipeline.py

+def test_vlm_npu_terminate_by_long_input_embeds(backend):
+    models_path = get_ov_model(model_ids[0])


In test_vlm_npu_terminate_by_long_input_embeds, the use of 'model_ids[0]' is inconsistent compared to the other test which uses 'model_id'. Verify if 'model_ids' is defined and intended, or change it to 'model_id' for consistency.

Suggested change

def test_vlm_npu_terminate_by_long_input_embeds(backend):

models_path = get_ov_model(model_ids[0])

def test_vlm_npu_terminate_by_long_input_embeds(model_id, backend):

models_path = get_ov_model(model_id)

Copilot · 2025-05-23T06:41:01Z

tests/python_tests/test_llm_pipeline_npu.py

+    long_prompt = 'The Sun is yellow because' * 20
+    tokenizer = Tokenizer(model_path)
+    tensor_input = tokenizer.encode(long_prompt).input_ids
+    assert(isinstance(input_ids, Tensor))


The variable 'input_ids' is not defined; it appears the intent was to assert that tokenized_input.input_ids is an instance of Tensor. Replace 'input_ids' with 'tokenized_input.input_ids'.

Suggested change

assert(isinstance(input_ids, Tensor))

assert(isinstance(tensor_input, Tensor))

github-actions bot added category: visual language category: LLM no-match-files labels May 22, 2025

Added tests and refactored fix for long prompt issue

7729110

AsyaPronina force-pushed the long_prompt_tests_and_refactoring branch from 5fe1cf3 to 7729110 Compare May 22, 2025 16:49

AsyaPronina requested review from Wovchena, dmatveev and sbalandi and removed request for Wovchena and dmatveev May 22, 2025 16:51

Wovchena reviewed May 23, 2025

View reviewed changes

Wovchena requested a review from Copilot May 23, 2025 06:40

Copilot AI reviewed May 23, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added tests and refactored fix for long prompt issue #2256

Added tests and refactored fix for long prompt issue #2256

AsyaPronina commented May 22, 2025 •

edited

Loading

Uh oh!

Wovchena May 23, 2025

Uh oh!

Wovchena May 23, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI May 23, 2025

Uh oh!

Copilot AI May 23, 2025

Uh oh!

Uh oh!

		def test_vlm_npu_terminate_by_long_input_embeds(backend):
		models_path = get_ov_model(model_ids[0])

	assert(isinstance(input_ids, Tensor))
	assert(isinstance(tensor_input, Tensor))

Added tests and refactored fix for long prompt issue #2256

Are you sure you want to change the base?

Added tests and refactored fix for long prompt issue #2256

Conversation

AsyaPronina commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Wovchena May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Wovchena May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AsyaPronina commented May 22, 2025 •

edited

Loading

Wovchena May 23, 2025 •

edited

Loading