Add AgenticCodeExecution sample solution - MCP code-execution agents#82
Merged
sgurunat merged 13 commits intoopea-project:mainfrom Apr 24, 2026
Merged
Conversation
f82d439 to
794bb42
Compare
amberjain1
reviewed
Apr 1, 2026
5d44018 to
888d65b
Compare
sgurunat
reviewed
Apr 13, 2026
sgurunat
reviewed
Apr 13, 2026
sgurunat
reviewed
Apr 13, 2026
sgurunat
reviewed
Apr 13, 2026
f144c79 to
30f7c67
Compare
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
sgurunat
reviewed
Apr 20, 2026
vivekrsintc
reviewed
Apr 20, 2026
Contributor
vivekrsintc
left a comment
There was a problem hiding this comment.
📄 Third-Party Notice Required for τ-bench
This solution uses data and domain concepts from τ-bench (tau2-bench) by Sierra Research, licensed under MIT. A third-party notice file should be added.
Suggestion: Create sample_solutions/AgenticCodeExecution/THIRD_PARTY_NOTICES with the following content:
This project includes or references components from the following third-party projects:
τ-bench (tau2-bench) https://github.com/sierra-research/tau2-bench License: MIT Copyright (c) Sierra Research
sgurunat
previously approved these changes
Apr 23, 2026
amberjain1
previously approved these changes
Apr 23, 2026
MCP-based agentic code execution demo with Flowise, supporting retail, airline, stocks, banking, and triage domains. - Two-server MCP architecture: tools-server + sandbox-server - Flowise 3.0.12 as visual agent UI (docker-compose included) - Auto-download of tau2-bench databases (airline, retail) - LLM deployment guide (EI Helm + standalone Docker) - Comprehensive troubleshooting section - Per-session database isolation for concurrent users Source: https://github.com/intel-sandbox/Agentic-Code-Execution Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
…tead - Remove Flowise service from docker-compose.yml (use plugins/agenticai) - Update README to reference EI agenticai plugin for Flowise deployment - Update Flowise MCP config, troubleshooting sections for K8s context - Remove FLOWISE_PORT from .env Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
…to table Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
Sync with standalone Agentic-Code-Execution repo refactor:
- Replace tools-server/, system-prompts/, data/ with examples/{domain}/ structure
- Each domain (retail, airline, stocks, banking, triage) is self-contained
- Remove start_all.sh (unused)
- Update docker-compose.yml, Dockerfile, .gitignore, README paths
- Update MCP server imports and default DB/session paths
Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
Reference EI deployment guide in Option A section for prerequisites and cluster setup context. Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
Pin all dependencies to exact versions verified on a working installation: - requirements.txt: fastmcp==3.2.3, mcp==1.27.0, pydantic==2.13.0, etc. - examples/requirements.txt: fastmcp==3.2.3, pydantic==2.13.0, uvicorn==0.44.0, starlette==1.0.0 - sandbox-server/requirements.txt: + pydantic-monty==0.0.11 Also fix code-mode git URL (intel-sandbox -> universal-tool-calling-protocol). Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
…pt updates - Retail: port dict returns, direct access, and optimized prompts from tau2-bench - Retail: improvements for Code Execution Retail Agent - Remove <policy> tags from system prompt (confuses agent) - Rename Flowise retail agentflow to agentflow_fast_code_execution_retail.json - Update error_hints.py with improved json.loads guidance - Sandbox server improvements Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
…ise rename - Update retail system prompt with best tau2 score version - Add example retail conversation to README - Revert Flowise retail agentflow rename back to agentflow_code_execution_retail.json Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
- Add Qwen/Qwen3-Coder-30B-A3B-Instruct to EI model menu (CPU option 27) - Add model config in xeon-values.yaml (qwen3_coder parser, 10GB KV cache) - Add deploy/uninstall tasks in deploy-inference-models.yml - README: retail-first copy-paste flow, remove redundant Flowise summary - README: add test user credentials (Mia Garcia, Aarav Anderson) Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
- Remove pre-download model section (EI handles this) - Simplify Option A: single helm install, link to EI docs - Remove Option B (Standalone Docker) — not needed for EI context - Remove EI-handled troubleshooting (OOMKilled, sched_setaffinity, LOGNAME, proxy, ECR) - Add security disclaimer for execute_python sandbox - Add THIRD_PARTY_NOTICES for τ-bench (MIT, Sierra Research) - Add _disclaimer field to bundled banking and stocks db.json - Update Data Attribution with synthetic data notice - Sync with latest Agentic-Code-Execution upstream changes Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
- README: add hardware requirements table, NUMA considerations section, and multi-NUMA warning for EI deployment - docker-compose: add PYTHON_BASE_IMAGE build arg for both services Signed-off-by: Rafal Bogdanowicz <rafal.bogdanowicz@intel.com>
a1168ad to
d74d1dd
Compare
sgurunat
approved these changes
Apr 24, 2026
amberjain1
approved these changes
Apr 24, 2026
Collaborator
|
Merging this as customer is waiting for this. Will ask Rafal to fix those Bandit scan issues and push the changes asap. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Adds a new sample solution AgenticCodeExecution — a self-contained MCP-based agentic code execution demo supporting retail, airline, stocks, banking, and triage domains.
What's included: