Skip to content

v0.3.0

Choose a tag to compare

@chaynabors chaynabors released this 12 Jun 15:11
· 14 commits to main since this release
42601de

What's Changed

  • feat(redteam): add built-in red teaming support by @kevmyung in #184
  • chore: allow importing EvaluationReport from root by @poshinchen in #238
  • chore: added trace-based evaluators into defaults by @poshinchen in #244
  • chore(report): always return flattened report by @poshinchen in #241
  • feat: add chaos resilience evaluators (failure communication, partial completion, recovery strategy) by @ybdarrenwang in #236
  • fix(mappers): join all toolResult.content blocks to fix faithfulness false negatives by @poshinchen in #240
  • chore(version): bumped strands-agents-version to the latest by @poshinchen in #246
  • feat(redteam): add Crescendo multi-turn attack strategy by @yeomjiwonyeom in #245
  • feat(cli): added strands-evals cli by @poshinchen in #243
  • chore(evaluator): added evaluator name and evaluator_type for report by @poshinchen in #249
  • chore(cli): added single case evaluation command by @poshinchen in #252
  • fix(templates): correct doc link and clean up issue/PR templates by @yonib05 in #256
  • docs: add AI contribution guidance to CONTRIBUTING and PR template by @yonib05 in #257
  • docs(agents): add high-quality PR guidance to AGENTS.md by @yonib05 in #258
  • docs(agents): add community and character guidance to AGENTS.md by @yonib05 in #261
  • feat(issue-labeler): add LLM issue labeler for area and type by @yonib05 in #255
  • feat(redteam): add Bad Likert Judge multi-turn attack strategy by @yeomjiwonyeom in #248
  • chore(cli): added generate command for experiment generation by @poshinchen in #260

New Contributors

Full Changelog: v0.2.1...v0.3.0