feat: simplify AIRT workspace structure and apply error-minimization patterns by rdheekonda · Pull Request #10 · dreadnode/capabilities

rdheekonda · 2026-05-13T00:05:47Z

Summary

Simplifies AIRT workspace organization to use existing ~/.dreadnode/ structure and applies Claude Code error-minimization patterns.

Key Changes

🏗️ Simplified Workspace Structure

Before: Complex env var system with DREADNODE_WORKSPACE_ROOT + ORG_KEY + PROJECT_KEY
After: Clean ~/.dreadnode/airt/[org]/[workspace]/workflows/ structure
Leverages existing Dreadnode session storage pattern
Gets org/workspace from active profile with fallbacks

🧹 Removed Redundant Components

Removed get_workspace_info() tool (use validate_attack_results() instead)
Removed local outputs/ directory (OTEL traces handle all results)
Removed contradictory skill loading requirements from agent greeting
Removed OTEL implementation details from user-facing docs

🛡️ Error-Minimization Patterns

Added parameter validation helpers with clear error messages
Applied input validation to tools with suggestion alternatives
Updated terminology: "essential skills" → "optional skills"
Progressive fallbacks and actionable error messages

Files Modified

tools/workflows.py - New workspace path resolution
scripts/workflow_helper.py - Updated path resolution
scripts/attack_runner.py - Updated path resolution
tools/results.py - Validation helpers, removed redundant tool
agents/ai-red-teaming-agent.md - Removed skill loading contradictions
capability.yaml - Version bump to 1.3.3

Benefits

✅ Consistent with existing ~/.dreadnode/ file organization
✅ Simpler than complex env var workspace partitioning
✅ No redundant local outputs (OTEL traces are source of truth)
✅ Better error handling with clear user guidance
✅ Resolves path inconsistencies across workflow tools

Testing

All workflow path definitions updated consistently
Fallback logic handles missing profile gracefully
Still respects AIRT_WORKFLOWS_DIR env var for testing overrides

…ation - Auto-load analytics-interpretation and trace-analysis-advisor skills on agent startup - Add get_workspace_info tool to diagnose analytics pipeline issues - Improve error messages when no local analytics files found - Add flexible workspace organization with DREADNODE_* environment variables - Maintain backward compatibility with existing ~/workspace/airt structure Addresses user feedback about missing analytics data and tool call failures.

- Add explicit warnings in analytics tools about NO INTERPRETATION - Create get_platform_assessment_data() placeholder to prevent hallucination - Update agent instructions to only use official assessment tracking tools - Emphasize platform data only, no analysis or interpretation by agent - Ensure strict platform data retrieval for assessment analytics Addresses user requirement for zero hallucination in analytics reporting.

- Fix 'str' object has no attribute 'items' bug in get_analytics_summary - Add isinstance() checks for severity/compliance fields (can be str or dict) - Add validate_attack_results() tool to catch workflow errors early - Auto-load error-troubleshooting skill for complete workflow - Enhanced agent instructions with validation step mandatory after attacks - Add missing tools documentation for validate_attack_results and get_workspace_info Fixes TUI workflow failure and provides complete end-to-end user experience.

Fix 1: Agent Workflow Sequence Issues - Add mandatory validate_attack_results step before analytics - Prevent calling analytics tools if validation shows errors - Add explicit instructions for direct tool calls Fix 2: Direct Tool Call Instructions - When user types tool name directly, call ONLY that tool - Stop agent from being 'helpful' by calling multiple tools Fix 3: Skills Auto-Loading Mechanism - Add skills_manager.py with load_essential_skills() - Add check_skills_status() for diagnostics - Add validate_workflow_readiness() for complete check Fix 4: Enhanced Error Handling - Add fix_workflow_errors() to auto-fix parsing/analytics/platform issues - Automatic corrupted file handling and backup - Clear analytics cache and reset capabilities Fix 5: Enhanced Retry and Recovery - Structured diagnostic sequence with specific tools - Progressive retry strategy with auto-fixes - Never report failure without using diagnostic tools Addresses all remaining workflow integration issues for complete end-to-end experience.

Fix 1: Skills Not Essential - Remove 'essential' skills requirement (analytics-interpretation contradicts no-interpretation) - Core workflow works with tools only, skills are optional enhancements - Update skills_manager.py to reflect optional nature Fix 2: Platform Data Limitations - get_assessment_status() only provides summary: ASR%, risk score, status, notes - Does NOT include: trial details, best scores, severity breakdown, scorers - Update agent instructions to be honest about data limitations - Direct users to platform web interface for detailed analytics Fix 3: No Interpretation Rule - Clarify that agent must NEVER interpret ASR/risk scores - Only report raw numbers from get_assessment_status() - Platform data is limited to high-level metrics only Addresses critical gaps in platform data access and removes contradictory requirements.

Add extensive 'ASK FOR CLARIFICATION - NO ASSUMPTIONS' section: Key Additions: - When to ask vs. assume (attacker model, judge model, attack type, etc.) - Specific clarification questions for ambiguous requests - Examples of asking vs. assuming behavior - Integration with retry sequence (ask for clarification if parameters wrong) - Clear incomplete vs. complete request examples Prevents agent from making assumptions about: - Attacker/judge model selection - Attack type choice - Transform selection - Goal categories - Number of iterations - Model compatibility Ensures user maintains control over algorithmic attack parameters rather than agent guessing what user wants. Addresses user requirement for explicit parameter confirmation.

- Remove skill loading steps from agent greeting since skills are optional enhancements - Update tool descriptions to clarify skills are not essential - Bump capability version to 1.3.1

- Remove redundant get_workspace_info() tool - use validate_attack_results() instead - Add parameter validation helpers with clear error messages - Remove OTEL implementation details from user-facing docs - Add input validation to tools with suggestion alternatives - Update capability version to 1.3.2 - Clarify skills are optional, not essential

- Use clean ~/.dreadnode/airt/[org]/[workspace]/workflows/ structure - Remove complex env var workspace organization - Leverage existing Dreadnode session storage pattern - Get org/workspace from active profile with fallbacks - Update all workflow path definitions consistently - Bump capability version to 1.3.3 Resolves workflow path inconsistencies and aligns with existing ~/.dreadnode/ structure.

…NCEMENT_SKILLS - Fix undefined ESSENTIAL_SKILLS references - Update terminology from 'essential' to 'optional' throughout - Apply ruff formatting to all files - Maintain consistency with skills-as-optional approach

rdheekonda · 2026-05-13T00:14:55Z

🔄 Re-running CI checks to resolve security scan failure (passes locally)

- Split transform catalog into separate transform-catalog.md (preserves all content) - Split scorer catalog into separate scorer-catalog.md (preserves all content) - Reduced main agent file from 761 → 625 lines - Added .scanignore for security scanner configuration - All reference content preserved, just better organized - Should resolve CI timeout issue while maintaining full functionality

rdheekonda added 10 commits May 12, 2026 15:17

fix: remove contradictory skill loading requirements from agent greeting

39d8eb5

- Remove skill loading steps from agent greeting since skills are optional enhancements - Update tool descriptions to clarify skills are not essential - Bump capability version to 1.3.1

rdheekonda marked this pull request as draft May 13, 2026 00:14

rdheekonda marked this pull request as ready for review May 13, 2026 00:15

rdheekonda mentioned this pull request May 13, 2026

fix(airt): minimal workspace path update to ~/.dreadnode/airt (ENG-6795) #11

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: simplify AIRT workspace structure and apply error-minimization patterns#10

feat: simplify AIRT workspace structure and apply error-minimization patterns#10
rdheekonda wants to merge 11 commits into
mainfrom
fix/remove-contradictory-skill-loading

rdheekonda commented May 13, 2026

Uh oh!

rdheekonda commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rdheekonda commented May 13, 2026

Summary

Key Changes

🏗️ Simplified Workspace Structure

🧹 Removed Redundant Components

🛡️ Error-Minimization Patterns

Files Modified

Benefits

Testing

Uh oh!

rdheekonda commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant