构建 AI Agent 应用

概览 • 博客文章 • 特性 • 安装 • 使用 • 目录结构 • 示例 • 测试

概览

本仓库提供了一个统一的、多框架的平台，用于设计、实现和评估 AI 驱动的 Agent（智能体）。通过将场景定义与框架特定代码分离，我们实现了：

每个场景的单一规范（位于 src/scenarios/ 下）。
在 LangGraph、LangChain、Autogen（以及更多）中的并行实现。
一个共享的评估工具，用于比较不同框架的输出。
内置的可观测性（Loki 日志记录 & OpenTelemetry/Tempo）。
核心工具和遥测设置的单元测试。

无论您是在构建电子商务支持机器人、IT 支持台助手、语音代理，还是介于两者之间的任何东西，此代码库都能帮助您从原型扩展到生产覆盖——同时保持一致性和可重用性。

博客文章

特性

与框架无关的场景规范 src/scenarios/<scenario_name>/ 下的每个场景包含：
- spec.md：用户旅程和成功标准的通俗英语描述。
- data/：用于快速测试或演示的示例输入/输出 JSON。
- evaluation/：一个共享的 run_eval.py 工具加上一个“黄金”评估集（JSON 或 CSV）。
多框架实现 在以下目录下并行实现每个场景：
- src/frameworks/langgraph/
- src/frameworks/autogen/ （通过遵循相同的文件夹模式轻松添加更多框架。）
内置可观测性
- Loki 日志记录： src/common/observability/loki_logger.py 将结构化日志发布到本地 Loki 端点。
- OpenTelemetry / Tempo： src/common/observability/instrument_tempo.py 设置 OTLP 导出器并将跨度（父级和子级）检测到 Tempo。
核心工具和遥测的单元测试
- 评估工具的测试： tests/evaluation/test_ai_judge.py & test_memory_evaluation.py
- 可观测性代码的测试（monkeypatching 导出器）： tests/observability/test_loki_logger.py & test_instrument_tempo.py

安装

克隆仓库

git clone https://github.com/alanhou/ai-agent.git
cd ai-agent

创建 Conda（或 Virtualenv）环境

# 使用 Conda
conda env create -f python/environment.yml
conda activate agents

安装 Python 依赖项（以及可编辑的 “src” 包）
```
pip install -r python/requirements.txt
pip install -e python/src
```
- pip install -e src 确保 src/ 下的模块（例如 common.*，frameworks.*）是可导入的。

使用

1. 运行场景评估

每个场景都包含一个共享的评估脚本：

# 从仓库根目录：
cd python/src/common/scenarios/ecommerce_customer_support/evaluation

python -m python.src.common.evaluation.batch_evaluation \
  --dataset python/src/common/evaluation/scenarios/ecommerce_customer_support_evaluation_set.json \
  --graph_py python/src/frameworks/langgraph_agents/ecommerce_customer_support/customer_support_agent.py

2. 启动单个框架 Agent

如果您想手动运行电子商务 Agent 的 LangGraph 版本：

python - << 'PYCODE'
from frameworks.langgraph.scenarios.ecommerce_customer_support.implementation import run_ecommerce_support

payload = {
  "order": {"order_id": "A12345", "status": "Delivered", "total": 19.99},
  "messages": [{"type": "human", "content": "My mug arrived broken. Refund?"}]
}

response = run_ecommerce_support(payload)
print(response)
PYCODE

根据其他场景或框架相应地替换 run_ecommerce_support 和 payload 形状。

3. 可观测性

Loki 日志记录 代码中对 log_to_loki(label, message) 的任何调用都会将 JSON payload 发送到：
```
http://localhost:3100/loki/api/v1/push
```
将 Grafana/Loki 指向该端点以实时查看日志。
OpenTelemetry / Tempo
```
from common.observability.instrument_tempo import do_work
do_work()  # 向 OTLP 端点 (localhost:3200) 发出一个父跨度和三个子跨度
```
要检测您自己的函数，请导入 tracer = common.observability.instrument_tempo.tracer 并将代码包装在 with tracer.start_as_current_span("span-name"): 块中。

目录结构

以下是所有内容的组织方式概览：

ai-agent/
├── README.md
├── scenarios/                 
│   ├── ecommerce_customer_support.jsonl
│   └── ...
├── python/                    
│   ├── .gitignore
│   ├── environment.yml
│   ├── requirements.txt
│   ├── conftest.py
│   ├── src/
│   │   ├── common/
│   │   └── frameworks/
│   └── tests/
└── go/                        
    ├── go.mod
    ├── cmd/
    └── internal/

示例

1. 运行 LangChain Agent（电子商务支持）

# 从仓库根目录：
cd python/src/frameworks/langchain/scenarios/ecommerce_customer_support

# 示例用法：
python - << 'PYCODE'
from frameworks.langchain.scenarios.ecommerce_customer_support.implementation import run_ecommerce_support

payload = {
  "order": {"order_id": "A12345", "status": "Delivered", "total": 19.99},
  "messages": [{"type": "human", "content": "My mug arrived broken. Refund?"}]
}

response = run_ecommerce_support(payload)
print(response)
PYCODE

2. 运行 LangGraph Agent（电子商务支持）

# 从仓库根目录：
cd python/src/frameworks/langgraph/scenarios/ecommerce_customer_support

# 示例用法：
python - << 'PYCODE'
from frameworks.langgraph.scenarios/ecommerce_customer_support.implementation import run_ecommerce_support

payload = {
  "order": {"order_id": "A12345", "status": "Delivered", "total": 19.99},
  "messages": [{"type": "human", "content": "My mug arrived broken. Refund?"}]
}

response = run_ecommerce_support(payload)
print(response)
PYCODE

测试

我们使用 pytest 进行所有单元测试：

评估工具测试：
- tests/evaluation/test_ai_judge.py
- tests/evaluation/test_memory_evaluation.py
可观测性测试：
- tests/observability/test_loki_logger.py
- tests/observability/test_instrument_tempo.py

要运行完整的测试套件：

cd /Users/your-user/dev/ai-agent/python
pytest -q

所有测试都应该通过且没有错误。

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
examples		examples
go		go
python		python
scenarios		scenarios
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
README_en.md		README_en.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

构建 AI Agent 应用

概览

博客文章

特性

安装

使用

1. 运行场景评估

2. 启动单个框架 Agent

3. 可观测性

目录结构

示例

1. 运行 LangChain Agent（电子商务支持）

2. 运行 LangGraph Agent（电子商务支持）

测试

About

Uh oh!

Releases

Packages

Languages

alanhou/ai-agent

Folders and files

Latest commit

History

Repository files navigation

构建 AI Agent 应用

概览

博客文章

特性

安装

使用

1. 运行场景评估

2. 启动单个框架 Agent

3. 可观测性

目录结构

示例

1. 运行 LangChain Agent（电子商务支持）

2. 运行 LangGraph Agent（电子商务支持）

测试

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages