Interactive Tableau dashboard evaluating autonomous AI agent performance.
Dataset: agentic_ai_performance_dataset_20250622.csv
Source: Agentic AI Performance Dataset 2025 (CC0: Public Domain)
| Column Name | Description | Values/Range |
|---|---|---|
| agent_id | Unique identifier for each AI agent | 5,000 unique values (format: AG_XXXXX) |
| agent_type | Category of AI agent | 16 types (Customer Service, Project Manager, Code Assistant, QA Tester, etc.) |
| model_architecture | Underlying AI model | 10 models (GPT-4o, Claude-3.5, LLaMA-3, PaLM-2, Gemini-Pro, InstructGPT, Mixtral-8x7B, Falcon-180B, CodeT5+, Transformer-XL) |
| deployment_environment | Deployment location | 6 environments (Cloud, Server, Edge, Mobile, Desktop, Hybrid) |
| task_category | Type of task performed | 10 categories (Text Processing, Communication, Decision Making, Creative Writing, Planning & Scheduling, Learning & Adaptation, Research & Summarization, Data Analysis, Code Generation, Problem Solving) |
| task_complexity | Complexity level | 1-10 scale |
| autonomy_level | Level of independence | 1-10 scale |
| success_rate | Task completion success rate | 0-1 (0.30 to 0.98) |
| accuracy_score | Output accuracy score | 0-1 (0.40 to 0.96) |
| efficiency_score | Resource efficiency score | 0-1 (0.30 to 0.89) |
| execution_time_seconds | Time taken to complete task | Seconds |
| response_latency_ms | Response time | Milliseconds |
| memory_usage_mb | Memory consumption | Megabytes |
| cpu_usage_percent | CPU utilization | Percentage (0-100) |
| cost_per_task_cents | Cost per task execution | US cents |
| human_intervention_required | Whether human assistance was needed | True/False |
| error_recovery_rate | Rate of successful error recovery | 0-1 |
| multimodal_capability | Ability to process multiple input types | Score or flag |
| edge_compatibility | Compatibility with edge deployment | Score or flag |
| privacy_compliance_score | Privacy/compliance adherence | 0-1 |
| bias_detection_score | Measure of bias in outputs | 0-1 |
| timestamp | When the task was executed | Date/time format |
| data_quality_score | Quality of input/output data | Score |
| performance_index | Composite performance metric | Calculated index |
| cost_efficiency_ratio | Cost vs performance ratio | Ratio |
| autonomous_capability_score | Overall autonomy capability | Score |