Skip to content

lisaoliv/agents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

agents

Interactive Tableau dashboard evaluating autonomous AI agent performance.

Dataset: agentic_ai_performance_dataset_20250622.csv
Source: Agentic AI Performance Dataset 2025 (CC0: Public Domain)


VARIABLE DESCRIPTIONS

Column Name Description Values/Range
agent_id Unique identifier for each AI agent 5,000 unique values (format: AG_XXXXX)
agent_type Category of AI agent 16 types (Customer Service, Project Manager, Code Assistant, QA Tester, etc.)
model_architecture Underlying AI model 10 models (GPT-4o, Claude-3.5, LLaMA-3, PaLM-2, Gemini-Pro, InstructGPT, Mixtral-8x7B, Falcon-180B, CodeT5+, Transformer-XL)
deployment_environment Deployment location 6 environments (Cloud, Server, Edge, Mobile, Desktop, Hybrid)
task_category Type of task performed 10 categories (Text Processing, Communication, Decision Making, Creative Writing, Planning & Scheduling, Learning & Adaptation, Research & Summarization, Data Analysis, Code Generation, Problem Solving)
task_complexity Complexity level 1-10 scale
autonomy_level Level of independence 1-10 scale
success_rate Task completion success rate 0-1 (0.30 to 0.98)
accuracy_score Output accuracy score 0-1 (0.40 to 0.96)
efficiency_score Resource efficiency score 0-1 (0.30 to 0.89)
execution_time_seconds Time taken to complete task Seconds
response_latency_ms Response time Milliseconds
memory_usage_mb Memory consumption Megabytes
cpu_usage_percent CPU utilization Percentage (0-100)
cost_per_task_cents Cost per task execution US cents
human_intervention_required Whether human assistance was needed True/False
error_recovery_rate Rate of successful error recovery 0-1
multimodal_capability Ability to process multiple input types Score or flag
edge_compatibility Compatibility with edge deployment Score or flag
privacy_compliance_score Privacy/compliance adherence 0-1
bias_detection_score Measure of bias in outputs 0-1
timestamp When the task was executed Date/time format
data_quality_score Quality of input/output data Score
performance_index Composite performance metric Calculated index
cost_efficiency_ratio Cost vs performance ratio Ratio
autonomous_capability_score Overall autonomy capability Score

About

Interactive Tableau dashboard evaluating autonomous AI agent performance (latency, token usage, task success rates, and cost efficiency) across environments and models

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors