-
Notifications
You must be signed in to change notification settings - Fork 76
Fix run examples workflow failed on schedule run & use parallel execution with pytest #1229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
"## 🔄 Running Examples with |
This reverts commit 4e998b7.
|
Looks like there are a few issues preventing this PR from being merged!
If you'd like me to help, just leave a comment, like Feel free to include any additional details that might help me get this PR into a better state. You can manage your notification settings |
🔄 Running Examples with
|
🔄 Running Examples with
|
| Example | Status | Duration | Cost |
|---|---|---|---|
| 01_standalone_sdk/02_custom_tools.py | ✅ PASS | 29.9s | $0.03 |
| 01_standalone_sdk/03_activate_skill.py | ✅ PASS | 10.3s | $0.01 |
| 01_standalone_sdk/05_use_llm_registry.py | ✅ PASS | 13.7s | $0.01 |
| 01_standalone_sdk/07_mcp_integration.py | ✅ PASS | 49.4s | $0.02 |
| 01_standalone_sdk/09_pause_example.py | ✅ PASS | 17.0s | $0.01 |
| 01_standalone_sdk/10_persistence.py | ✅ PASS | 39.7s | $0.02 |
| 01_standalone_sdk/11_async.py | ✅ PASS | 37.8s | $0.02 |
| 01_standalone_sdk/12_custom_secrets.py | ✅ PASS | 14.0s | $0.01 |
| 01_standalone_sdk/13_get_llm_metrics.py | ✅ PASS | 32.7s | $0.01 |
| 01_standalone_sdk/14_context_condenser.py | ✅ PASS | 3m 15s | $0.37 |
| 01_standalone_sdk/17_image_input.py | ✅ PASS | 17.2s | $0.02 |
| 01_standalone_sdk/18_send_message_while_processing.py | ✅ PASS | 21.5s | $0.01 |
| 01_standalone_sdk/19_llm_routing.py | ✅ PASS | 17.4s | $0.02 |
| 01_standalone_sdk/20_stuck_detector.py | ✅ PASS | 23.9s | $0.02 |
| 01_standalone_sdk/21_generate_extraneous_conversation_costs.py | ✅ PASS | 10.4s | $0.00 |
| 01_standalone_sdk/22_anthropic_thinking.py | ✅ PASS | 13.5s | $0.01 |
| 01_standalone_sdk/23_responses_reasoning.py | ✅ PASS | 39.7s | $0.01 |
| 01_standalone_sdk/24_planning_agent_workflow.py | ✅ PASS | 7m 34s | $0.57 |
| 01_standalone_sdk/25_agent_delegation.py | ✅ PASS | 1m 31s | $0.09 |
| 01_standalone_sdk/26_custom_visualizer.py | ✅ PASS | 25.9s | $0.02 |
| 02_remote_agent_server/01_convo_with_local_agent_server.py | ✅ PASS | 1m 8s | $0.05 |
| 02_remote_agent_server/02_convo_with_docker_sandboxed_server.py | ✅ PASS | 1m 3s | $0.02 |
| 02_remote_agent_server/03_browser_use_with_docker_sandboxed_server.py | ✅ PASS | 2m 17s | $0.04 |
| 02_remote_agent_server/04_convo_with_api_sandboxed_server.py | ✅ PASS | 4m 57s | $0.03 |
✅ All tests passed!
Total: 24 | Passed: 24 | Failed: 0 | Total Cost: $1.42
🔄 Running Examples with
|
| Example | Status | Duration | Cost |
|---|---|---|---|
| 01_standalone_sdk/02_custom_tools.py | ✅ PASS | 38.5s | $0.03 |
| 01_standalone_sdk/03_activate_skill.py | ✅ PASS | 13.6s | $0.01 |
| 01_standalone_sdk/05_use_llm_registry.py | ✅ PASS | 15.1s | $0.01 |
| 01_standalone_sdk/07_mcp_integration.py | ✅ PASS | 51.0s | $0.02 |
| 01_standalone_sdk/09_pause_example.py | ✅ PASS | 18.5s | $0.01 |
| 01_standalone_sdk/10_persistence.py | ✅ PASS | 43.5s | $0.02 |
| 01_standalone_sdk/11_async.py | ✅ PASS | 36.3s | $0.03 |
| 01_standalone_sdk/12_custom_secrets.py | ✅ PASS | 23.0s | $0.01 |
| 01_standalone_sdk/13_get_llm_metrics.py | ✅ PASS | 35.3s | $0.02 |
| 01_standalone_sdk/14_context_condenser.py | ✅ PASS | 2m 43s | $0.30 |
| 01_standalone_sdk/17_image_input.py | ✅ PASS | 17.4s | $0.02 |
| 01_standalone_sdk/18_send_message_while_processing.py | ✅ PASS | 20.7s | $0.01 |
| 01_standalone_sdk/19_llm_routing.py | ✅ PASS | 17.9s | $0.02 |
| 01_standalone_sdk/20_stuck_detector.py | ✅ PASS | 22.0s | $0.02 |
| 01_standalone_sdk/21_generate_extraneous_conversation_costs.py | ✅ PASS | 13.8s | $0.00 |
| 01_standalone_sdk/22_anthropic_thinking.py | ✅ PASS | 22.2s | $0.02 |
| 01_standalone_sdk/23_responses_reasoning.py | ✅ PASS | 41.8s | $0.01 |
| 01_standalone_sdk/24_planning_agent_workflow.py | ✅ PASS | 4m 46s | $0.32 |
| 01_standalone_sdk/25_agent_delegation.py | ✅ PASS | 47.0s | $0.04 |
| 01_standalone_sdk/26_custom_visualizer.py | ✅ PASS | 24.6s | $0.03 |
| 02_remote_agent_server/01_convo_with_local_agent_server.py | ✅ PASS | 56.4s | $0.03 |
| 02_remote_agent_server/02_convo_with_docker_sandboxed_server.py | ✅ PASS | 2m 29s | $0.04 |
| 02_remote_agent_server/03_browser_use_with_docker_sandboxed_server.py | ✅ PASS | 2m 45s | $0.10 |
| 02_remote_agent_server/04_convo_with_api_sandboxed_server.py | ✅ PASS | 1m 40s | $0.03 |
✅ All tests passed!
Total: 24 | Passed: 24 | Failed: 0 | Total Cost: $1.13
xingyaoww
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is so awesome! Thank you!
🔄 Running Examples with
|
| Example | Status | Duration | Cost |
|---|---|---|---|
| 01_standalone_sdk/02_custom_tools.py | ✅ PASS | 29.1s | $0.03 |
| 01_standalone_sdk/03_activate_skill.py | ✅ PASS | 12.5s | $0.01 |
| 01_standalone_sdk/05_use_llm_registry.py | ✅ PASS | 12.9s | $0.01 |
| 01_standalone_sdk/07_mcp_integration.py | ✅ PASS | 48.3s | $0.02 |
| 01_standalone_sdk/09_pause_example.py | ✅ PASS | 16.6s | $0.01 |
| 01_standalone_sdk/10_persistence.py | ✅ PASS | 39.0s | $0.02 |
| 01_standalone_sdk/11_async.py | ✅ PASS | 37.8s | $0.03 |
| 01_standalone_sdk/12_custom_secrets.py | ✅ PASS | 19.4s | $0.01 |
| 01_standalone_sdk/13_get_llm_metrics.py | ✅ PASS | 32.7s | $0.01 |
| 01_standalone_sdk/14_context_condenser.py | ✅ PASS | 2m 58s | $0.34 |
| 01_standalone_sdk/17_image_input.py | ✅ PASS | 19.0s | $0.02 |
| 01_standalone_sdk/18_send_message_while_processing.py | ✅ PASS | 23.6s | $0.01 |
| 01_standalone_sdk/19_llm_routing.py | ✅ PASS | 24.9s | $0.02 |
| 01_standalone_sdk/20_stuck_detector.py | ✅ PASS | 20.8s | $0.01 |
| 01_standalone_sdk/21_generate_extraneous_conversation_costs.py | ✅ PASS | 11.4s | $0.00 |
| 01_standalone_sdk/22_anthropic_thinking.py | ✅ PASS | 16.6s | $0.01 |
| 01_standalone_sdk/23_responses_reasoning.py | ✅ PASS | 38.2s | $0.01 |
| 01_standalone_sdk/24_planning_agent_workflow.py | ✅ PASS | 3m 41s | $0.21 |
| 01_standalone_sdk/25_agent_delegation.py | ✅ PASS | 1m 42s | $0.22 |
| 01_standalone_sdk/26_custom_visualizer.py | ✅ PASS | 23.0s | $0.02 |
| 02_remote_agent_server/01_convo_with_local_agent_server.py | ✅ PASS | 1m 10s | $0.05 |
| 02_remote_agent_server/02_convo_with_docker_sandboxed_server.py | ✅ PASS | 2m 28s | $0.05 |
| 02_remote_agent_server/03_browser_use_with_docker_sandboxed_server.py | ✅ PASS | 2m 54s | $0.07 |
| 02_remote_agent_server/04_convo_with_api_sandboxed_server.py | ✅ PASS | 1m 37s | $0.03 |
✅ All tests passed!
Total: 24 | Passed: 24 | Failed: 0 | Total Cost: $1.23
This PR is to:
pytest-xdistFix #1058
Agent Server images for this PR
• GHCR package: https://github.com/OpenHands/agent-sdk/pkgs/container/agent-server
Variants & Base Images
eclipse-temurin:17-jdknikolaik/python-nodejs:python3.12-nodejs22golang:1.21-bookwormPull (multi-arch manifest)
# Each variant is a multi-arch manifest supporting both amd64 and arm64 docker pull ghcr.io/openhands/agent-server:30c3275-pythonRun
All tags pushed for this build
About Multi-Architecture Support
30c3275-python) is a multi-arch manifest supporting both amd64 and arm6430c3275-python-amd64) are also available if needed