You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
✅ Edge case - future date: handled gracefully (0.6s)
✅ Log file structure validated
Phase 3: Audit Command (3 tests)
✅ Audit successful run: comprehensive report in 56ms
✅ Audit with URL format: works identically to numeric ID (57ms)
✅ Edge case - invalid run ID: graceful error with suggestions (377ms)
Phase 4: Compile Command (2 tests)
✅ Compile single workflow: successful (1.8s)
✅ Compile multiple workflows: successful with graceful handling of missing workflow (1.8s)
Performance Benchmarks
All performance metrics are within expected ranges:
Tool
Operation
Time
Status
logs
Basic download (3 runs)
25.9s
✅ Expected
logs
Engine filter (3 runs)
14.6s
✅ Good
logs
Date filter (3 runs)
11.5s
✅ Good
audit
Successful run
56ms
✅ Excellent
audit
URL-based
57ms
✅ Excellent
audit
Invalid ID
377ms
✅ Good
compile
Single workflow
1.8s
✅ Acceptable
compile
Multiple workflows
1.8s
✅ Acceptable
Issues Found
1. Workflow Name Filter Failure (Medium Severity) ❌
Tool: logs Description: The --workflow-name filter fails to find workflows that exist in the repository Impact: Users cannot filter logs by workflow name, requiring manual filtering of results Workaround: Use other filters (engine, date) and manually filter results Issue: Created (see related issues)
Positive Findings ✨
Excellent Error Handling: All edge cases (future dates, invalid run IDs, missing workflows) are handled gracefully with helpful error messages and suggestions
Strong Performance: Audit command is exceptionally fast (<100ms), logs command performs within expected ranges
Comprehensive Audit Reports: Audit output includes all expected sections:
Flexible Input Formats: Audit accepts both numeric run IDs and full GitHub URLs
Well-Structured Log Storage: Downloaded logs are organized by run ID with clear directory structure
Recommendations
Fix workflow name filtering: Investigate and resolve the workflow name resolution mismatch in the logs tool (issue created)
Documentation: Consider documenting the expected time ranges for logs downloads (10-25s is normal)
Caching: The logs tool caches results effectively - this could be highlighted in documentation as a feature
Conclusion
The agentic-workflows MCP server tools are functioning well overall with only one medium-severity issue identified. The audit and compile commands work flawlessly with excellent performance. The logs command has robust filtering capabilities except for the workflow-name filter. Error handling is exemplary across all tools.
Next Steps:
Address the workflow name filter issue
Continue daily monitoring of tool performance
Document normal performance ranges for user expectations
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Testing Session Overview
Date: 2026-05-20
Run ID: §26145567529
Tools Tested: audit, logs, compile (agentic-workflows MCP server)
Total Tests: 12 | Passed: 11 | Failed: 1
Overall Health: 🟢 Healthy (91% success rate)
Test Coverage Summary
Phase 1: Environment Setup ✅
Phase 2: Logs Command (6 tests)
Phase 3: Audit Command (3 tests)
Phase 4: Compile Command (2 tests)
Performance Benchmarks
All performance metrics are within expected ranges:
Issues Found
1. Workflow Name Filter Failure (Medium Severity) ❌
Tool: logs
Description: The
--workflow-namefilter fails to find workflows that exist in the repositoryImpact: Users cannot filter logs by workflow name, requiring manual filtering of results
Workaround: Use other filters (engine, date) and manually filter results
Issue: Created (see related issues)
Positive Findings ✨
Excellent Error Handling: All edge cases (future dates, invalid run IDs, missing workflows) are handled gracefully with helpful error messages and suggestions
Strong Performance: Audit command is exceptionally fast (<100ms), logs command performs within expected ranges
Comprehensive Audit Reports: Audit output includes all expected sections:
Robust Compilation: Compile command validates workflows, provides clear error messages, and generates valid .lock.yml files
Flexible Input Formats: Audit accepts both numeric run IDs and full GitHub URLs
Well-Structured Log Storage: Downloaded logs are organized by run ID with clear directory structure
Recommendations
Fix workflow name filtering: Investigate and resolve the workflow name resolution mismatch in the logs tool (issue created)
Documentation: Consider documenting the expected time ranges for logs downloads (10-25s is normal)
Caching: The logs tool caches results effectively - this could be highlighted in documentation as a feature
Conclusion
The agentic-workflows MCP server tools are functioning well overall with only one medium-severity issue identified. The audit and compile commands work flawlessly with excellent performance. The logs command has robust filtering capabilities except for the workflow-name filter. Error handling is exemplary across all tools.
Next Steps:
Beta Was this translation helpful? Give feedback.
All reactions