Bench'd Independent Benchmark Results for CrewAI Memory

Hi team! We're [Bench'd](https://benchd.ai) — an independent benchmark platform for AI memory systems.

We ran **CrewAI Memory** through LongMemEval (500 questions).

## Results

| Benchmark | Score | Questions | Status |
|-----------|-------|-----------|--------|
| LongMemEval v1.0 | **46.0%** | 500 | Verified |

**Per-dimension:** Recall 74.4% · Temporal 35.5% · Reasoning 29.3%

Full results: [benchd.ai/system/crewai-memory](https://benchd.ai/system/crewai-memory)

The LLM baseline (no memory) scores 57.6%. CrewAI's recall is strong at 74.4% but temporal and reasoning pull the overall score down.

## Run it yourself

\`\`\`bash
pip install benchd-harness
benchd run -a crewai-memory -b longmemeval-v1 --judge --key ./keys/private.key
\`\`\`

---
*[Bench'd](https://benchd.ai) — the neutral benchmark standard for AI memory systems.*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bench'd Independent Benchmark Results for CrewAI Memory #5800

Results

Run it yourself

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Bench'd Independent Benchmark Results for CrewAI Memory #5800

Description

Results

Run it yourself

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions