[fix] Fix/oc2ov test stability by kaisongli · Pull Request #1669 · volcengine/OpenViking

kaisongli · 2026-04-23T09:00:44Z

Description

Related Issue

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Refactoring (no functional changes)
Performance improvement
Test update

Changes Made

Testing

I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have tested this on the following platforms:
- Linux
- macOS
- Windows

Checklist

My code follows the project's coding style
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
Any dependent changes have been merged and published

Screenshots (if applicable)

Additional Notes

- oc2ov_test.yml: 定时任务改为一天两次(10:00/18:00北京时间)，超时改为100分钟 - api_test_effect.yml: 已有限制只有主仓库运行定时任务 - TestMemoryRead: 查询措辞优化+关键词容错，兼容旧记忆文件残留

Change workflow job conditions from exclusion-based to inclusion-based: - oc2ov_test.yml: only workflow_dispatch or main repo can run - api_test_effect.yml: only workflow_dispatch or main repo can run - api_test.yml: only workflow_dispatch, pull_request, or main repo can run This prevents fork repos from triggering resource-consuming scheduled and release workflows while still allowing manual dispatch and PRs.

github-actions · 2026-04-23T09:04:27Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
🏅 Score: 85
🧪 No relevant tests
🔒 No security concerns identified
✅ No TODO sections
🔀 Multiple PR themes Sub-PR theme: Improve test stability with retries and unstable response handling Relevant files: tests/oc2ov_test/tests/base_cli_test.py tests/oc2ov_test/tests/p0/test_context_engine.py tests/oc2ov_test/tests/p0/test_memory_crud.py Sub-PR theme: Adjust CI workflow triggers and timeouts Relevant files: .github/workflows/api_test.yml .github/workflows/api_test_effect.yml .github/workflows/oc2ov_test.yml
⚡ Recommended focus areas for review Workflow Condition Change The updated 'if' condition may skip push events to main in forked repositories, which was previously allowed. This could reduce test coverage for fork contributors. if: github.event_name == 'workflow_dispatch' \|\| github.event_name == 'pull_request' \|\| github.repository == 'volcengine/OpenViking' Workflow Condition Change The updated 'if' condition may skip push events in forked repositories, which was previously allowed. This could reduce test coverage for fork contributors. if: github.event_name == 'workflow_dispatch' \|\| github.repository == 'volcengine/OpenViking' Assertion Skipping Assertion methods now skip checks when LLM responses are unstable, which improves stability but could mask real test failures. Consider adding a strict mode or logging the skipped assertions more prominently. if self._is_llm_unstable_response(response): self.logger.warning(f"LLM 不稳定，跳过关键词断言: {keywords}") return Duplicate Unstable Check Logic Unstable response detection logic is duplicated between `smart_wait_for_sync` and `_is_llm_unstable_response`. Consolidating this would reduce maintenance overhead and ensure consistency. unstable = ( not text or any( ind in text.lower() for ind in [ "idle timeout", "couldn't generate", "please try again", "no_reply", ] ) or (text.startswith('[{"name"') and len(text) < 300) or text == "}]" )

github-actions · 2026-04-23T09:05:17Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Use shared unstable check method Remove the duplicated unstable response check in `check_response()` and use the newly added `_is_llm_unstable_response()` method instead. This ensures consistent unstable response detection across the codebase and reduces maintenance overhead. tests/oc2ov_test/tests/base_cli_test.py [155-173] response = self.client.send_message(check_message, session_id=target_session_id) -text = self.assertion.extract_response_text(response).strip() -unstable = ( - not text - or any( - ind in text.lower() - for ind in [ - "idle timeout", - "couldn't generate", - "please try again", - "no_reply", - ] - ) - or (text.startswith('[{"name"') and len(text) < 300) - or text == "}]" -) -if unstable: +if self._is_llm_unstable_response(response): self.logger.warning("LLM 不稳定，跳过 smart_wait 关键词检查") return True Suggestion importance[1-10]: 6 __ Why: The suggestion correctly identifies duplicated unstable response detection logic in `check_response()` and proposes using the new shared `_is_llm_unstable_response()` method instead, which improves consistency and maintainability.	Low

kaisongli added 2 commits April 23, 2026 16:48

github-project-automation Bot added this to OpenViking project Apr 23, 2026

github-project-automation Bot moved this to Backlog in OpenViking project Apr 23, 2026

qin-ctx approved these changes Apr 23, 2026

View reviewed changes

qin-ctx merged commit cadf027 into volcengine:main Apr 23, 2026
6 checks passed

github-project-automation Bot moved this from Backlog to Done in OpenViking project Apr 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Fix/oc2ov test stability#1669

[fix] Fix/oc2ov test stability#1669
qin-ctx merged 2 commits intovolcengine:mainfrom
kaisongli:fix/oc2ov-test-stability

kaisongli commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

github-actions Bot commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kaisongli commented Apr 23, 2026

Description

Related Issue

Type of Change

Changes Made

Testing

Checklist

Screenshots (if applicable)

Additional Notes

Uh oh!

github-actions Bot commented Apr 23, 2026

PR Reviewer Guide 🔍

Uh oh!

github-actions Bot commented Apr 23, 2026

PR Code Suggestions ✨

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants