EmbedClaw

Decouple LLM, Tools, Agent, and Channels—then pack them onto a single ESP32-S3.

EmbedClaw is not just “a chatbot on an MCU.”
It’s an Agent Runtime on a microcontroller: messages enter via Channels, the Agent orchestrates, the LLM decides, Tools execute, Memory is persisted, Skills supply task-level knowledge, and results go back out through Channels.

Origins

This project draws on the ideas and direction of:

EmbedClaw keeps the goal of running a full AI Agent on low-power hardware but focuses the architecture on decoupling LLM, Tools, Agent, and Channels.
That means you can add new models, new channels, new tools, or new Skills without rewriting the rest of the system.

Why EmbedClaw

1. Decoupled, not feature-bloated

The main idea is not “it can chat,” but that the parts that usually get tangled are separated:

Channel only handles how messages are received and sent; it doesn’t care how the LLM reasons.
Agent only handles task orchestration, context building, and the tool loop; it doesn’t care about transport.
LLM only adapts model request/response; it doesn’t care whether the message came from Feishu or WebSocket.
Tools only expose capabilities and JSON schema; they don’t care who calls them.
Skills only describe tasks; they don’t depend on internal implementation.

That gives you:

Easier addition of new chat entry points
Lower cost to switch model providers
Fast iteration on Tools and Skills
Agent capabilities that can grow without collapsing the codebase

2. Not a one-off demo—a sustainable Agent core

The repo already has a full loop:

Wi-Fi bring-up
SPIFFS mount
Channel registration and start
Tool registration
Skill install and load
LLM init
Agent loop running
Memory / session persistence

It’s a working “embedded Agent base” you can extend.

Implemented Features

Core

Module	Current implementation	Notes
LLM	Qwen `qwen-plus`	Via Alibaba DashScope OpenAI-compatible API
Web Search	Tavily Search API	For news, weather, and real-time info
Chat Channel	Feishu, WebSocket, QQBot	Feishu long connection, local WebSocket chat, official QQBot gateway
Agent	ReAct tool loop	Model can call tools, read results, then continue
Long-term memory	`/spiffs/memory/MEMORY.md`	User profile, preferences, stable facts
Short-term memory	`/spiffs/session/se_<hash>.jsonl`	Conversation history including tool call traces
Daily notes	`/spiffs/memory/<YYYY-MM-DD>.md`	Recent events and daily context
Skills	SPIFFS pre-installed + runtime	Task instructions as Markdown
Tools	Files, time, search, cron, GPIO	Exposed to LLM via JSON schema

Registered tools

Tool	Purpose
`get_current_time`	Get current time and sync system clock
`web_search`	Web search via Tavily
`read_file`	Read files under `/spiffs`
`write_file`	Write or overwrite files under `/spiffs`
`edit_file`	Find/replace in `/spiffs` files
`list_dir`	List files under `/spiffs`
`cron_add`	Add periodic or one-shot scheduled tasks
`cron_list`	List scheduled tasks
`cron_remove`	Remove scheduled tasks
`gpio_control`	Control ESP32 GPIO pins (on, off, set, toggle, get)

Pre-installed skills

These are pre-installed as Markdown files in spiffs_data/skills/ and deployed with the SPIFFS image:

weather
daily-briefing
skill-creator

You can add more Skills as Markdown under /spiffs/skills/*.md; the Agent picks them up from the system prompt.

Architecture

flowchart LR
    U[User] --> F[Feishu Channel]
    U --> W[WebSocket Channel]
    U --> Qc[QQBot Channel]
    F --> A[Agent Loop]
    W --> A
    Qc --> A
    A --> L[LLM Provider]
    L --> Q[Qwen via DashScope]
    A --> T[Tool Registry]
    T --> S1[Web Search]
    T --> S2[File Tools]
    T --> S3[Time Tool]
    T --> S4[Cron Tool]
    T --> S5[GPIO Tool]
    A --> M1[Session Memory]
    A --> M2[Long-term Memory]
    A --> K[Skill Loader]
    A --> O[Outbound Dispatcher]
    O --> F
    O --> W
    O --> Qc

Directory layout

.
├── main/                         # App entry, Wi-Fi init
├── components/embed_claw/
│   ├── core/                     # Agent, Memory, Session, Skill Loader, Tool Registry
│   ├── llm/                      # LLM provider abstraction and implementations
│   ├── tools/                    # Tool implementations
│   ├── channel/                 # Feishu / WebSocket / QQBot channels
│   ├── embed_claw.c             # System startup entry
│   └── ec_config_internal.h     # Built-in defaults; local overrides live in main/ec_config.h
├── spiffs_data/                 # Default SPIFFS image content
│   ├── config/                  # SOUL.md, USER.md
│   ├── memory/                  # MEMORY.md
│   └── skills/                  # Pre-installed skill files
└── scripts/                     # WebSocket test script and test-app helpers

Runtime flow

After boot, the flow is:

main/main.c inits NVS, SPIFFS, Wi-Fi.
ec_embed_claw_start() registers channels, tools, skills, and inits the LLM.
The Agent loop blocks on inbound messages.
Channels turn incoming data into ec_msg_t.
The Agent loads short-term history, long-term memory, recent notes, and skill summaries into the system prompt.
The LLM decides to reply or call tools.
Tools run and results are fed back to the LLM.
Final text goes to the outbound queue.
The outbound task sends replies back through the right channel.

Storage layout

EmbedClaw uses SPIFFS for persona, user info, sessions, and memory:

Path	Purpose
`/spiffs/config/SOUL.md`	Assistant persona and style
`/spiffs/config/USER.md`	Static user info
`/spiffs/memory/MEMORY.md`	Long-term memory
`/spiffs/memory/<YYYY-MM-DD>.md`	Daily notes
`/spiffs/session/se_<hash>.jsonl`	Session history
`/spiffs/skills/*.md`	Skill files
`/spiffs/cron.json`	Cron snapshot

Notes:

Session history keeps the last 20 messages by default.
The system prompt is built from long-term memory, last 3 days of notes, and skill summaries.
cron.json is written to SPIFFS but cron state is not fully restored on reboot yet.

Quick start

Hardware and environment

You’ll need:

An ESP32-S3 dev board
16 MB Flash (default partition layout assumes 16 MB)
PSRAM (enabled by default in this project)
USB cable
ESP-IDF 5.x installed
Recommended version: ESP-IDF v5.5.2 (current validated baseline)

The default target is esp32s3. The build packs spiffs_data/ with spiffs_create_partition_image.

1. Configure keys and platform

Build-time configuration is layered:

components/embed_claw/ec_config_internal.h provides repo defaults and empty secret placeholders.
Create local main/ec_config.h for project-specific overrides. Define only the macros you want to override. The build injects this header into embed_claw, so sensitive values do not need to live in the shared component tree.

Create main/ec_config.h if needed. The default preset is Qwen:

#define EC_USE_QWEN                1
#define EC_USE_DEEPSEEK            0
#define EC_USE_DOUBAO              0
#define EC_USE_KIMI                0
#define EC_USE_HUNYUAN             0
#define EC_SECRET_SEARCH_KEY        "YOUR_TAVILY_API_KEY"
#define EC_LLM_API_KEY              "YOUR_DASHSCOPE_API_KEY"
#define EC_SECRET_FEISHU_APP_ID     "YOUR_FEISHU_APP_ID"
#define EC_SECRET_FEISHU_APP_SECRET "YOUR_FEISHU_APP_SECRET"

Under the default preset, the effective LLM settings are:

#define EC_LLM_PROVIDER_NAME       "openai"
#define EC_LLM_API_URL             "https://dashscope-intl.aliyuncs.com/compatible-mode/v1/chat/completions"
#define EC_LLM_MODEL               "qwen-plus"

Built-in OpenAI-compatible LLM presets now include:

Qwen: DashScope + qwen-plus
DeepSeek: https://api.deepseek.com/v1/chat/completions + deepseek-chat
Doubao: https://operator.las.cn-beijing.volces.com/api/v1/chat/completions + doubao-seed-1-6-251015
KiMi: https://api.moonshot.cn/v1/chat/completions + kimi-k2.5
Hunyuan: https://api.hunyuan.cloud.tencent.com/v1/chat/completions + hunyuan-turbos-latest

If you want to switch to DeepSeek, use:

#define EC_USE_QWEN                0
#define EC_USE_DEEPSEEK            1
#define EC_LLM_API_KEY              "YOUR_DEEPSEEK_API_KEY"

Notes:

Explicit EC_LLM_PROVIDER_NAME / EC_LLM_API_URL / EC_LLM_MODEL overrides take precedence over preset-derived defaults.
Only one EC_USE_* LLM preset can be 1 at a time.
The Doubao preset defaults to the Volcengine Beijing OpenAI-compatible endpoint; override EC_LLM_API_URL or EC_DOUBAO_LLM_API_URL if your region or endpoint differs.
If you skip Tavily or Feishu for now, you only need the keys for the active LLM preset.

Optional channel toggles:

#define EC_FEISHU_ENABLE 0
#define EC_QQ_ENABLE     1
#define EC_QQ_APP_ID     "YOUR_QQ_APP_ID"
#define EC_QQ_CLIENT_SECRET "YOUR_QQ_CLIENT_SECRET"

QQ uses the official QQBot route in this repo: AppID + ClientSecret -> access_token -> /gateway -> websocket. The device acts as a WebSocket client, so the device itself does not need a public IP.

2. Build

Before building, copy the esp32s3 defaults into sdkconfig.defaults so menuconfig starts from the repository's intended esp32s3 baseline:

cp sdkconfig.defaults.esp32s3 sdkconfig.defaults
idf.py set-target esp32s3
idf.py build

3. Flash and monitor

idf.py -p /dev/ttyACM0 flash monitor

On macOS the serial port is often:

/dev/cu.usbmodemXXXX

4. First-time Wi-Fi

main/wifi_connect.cpp behavior:

If Wi-Fi was saved before, it tries STA and connects.
If not, it starts a provisioning AP.

AP SSID prefix:

#define EMBED_WIFI_SSID_PREFIX "ESP32"

In provisioning mode, connect to the device’s AP and open:

http://192.168.4.1

After configuring Wi-Fi, the device switches back to normal STA mode.

WebSocket chat

WebSocket is the most direct way to talk to EmbedClaw and is ideal for debugging.

Server

Port: 18789
Path: /
Protocol: WebSocket text frames

Quick test

Use the provided script:

scripts/test_ws_client.py

Install dependency:

pip install websocket-client

Connect to the device:

python scripts/test_ws_client.py <DEVICE_IP> 18789

Example:

python scripts/test_ws_client.py 192.168.31.88 18789

Inbound message format

Simple message:

{
  "type": "message",
  "content": "Search for today's tech news"
}

With custom chat_id:

{
  "type": "message",
  "content": "Remember I like mechanical keyboards",
  "chat_id": "my-debug-session"
}

To simulate Feishu from a relay:

{
  "type": "message",
  "content": "Set a reminder for 8am tomorrow",
  "channel": "feishu",
  "chat_type": "open_id",
  "chat_id": "ou_xxx"
}

Outbound message format

Device response:

{
  "type": "response",
  "content": "Here’s today’s tech news summary.",
  "chat_id": "my-debug-session",
  "chat_type": "ws"
}

chat_type is included in outbound responses and follows the current session/inbound routing context.

Feishu (Lark) integration

EmbedClaw includes a Feishu channel that initiates a long-lived connection to Feishu to receive messages. No public IP or Webhook URL is required.

What the Feishu channel does

Uses App ID / App Secret to get tenant_access_token
Calls https://open.feishu.cn/callback/ws/endpoint for the WebSocket URL
Connects to Feishu over WebSocket
Subscribes to and handles im.message.receive_v1
Pushes text messages into the Agent
Sends replies via POST /open-apis/im/v1/messages

Setup

1. Create a Feishu app

Create an enterprise app in the Feishu open platform and note:

App ID
App Secret

2. Enable message permissions

Enable at least “receive messages” and “send messages,” and ensure the bot can be used in your tenant. Exact permission names may vary in the console.

3. Event subscription

Under “Event subscription”:

Choose Use long connection to receive events
Subscribe to im.message.receive_v1

4. Put credentials in the project

Create or edit main/ec_config.h:

#define EC_SECRET_FEISHU_APP_ID     "cli_xxx"
#define EC_SECRET_FEISHU_APP_SECRET "xxxx"

5. Build, flash, and connect

Once the device is online, the Feishu channel starts and connects to Feishu.

6. Chat

DM the bot, or
Add the bot to a group and chat there.

Reply target is chosen automatically with split routing fields:

DMs: chat_type="open_id", chat_id="<open_id>"
Groups: chat_type="chat_id", chat_id="<chat_id>"

Optional: PC relay script

The repo includes scripts/feishu_relay.py for:

Testing the Feishu event flow on a PC
Bridging Feishu messages to the device WebSocket
Debugging Feishu and the device Agent separately

For normal use, the built-in Feishu long-connection implementation is recommended.

QQBot integration

EmbedClaw also includes an official QQBot channel. This implementation follows the same route as the OpenClaw QQBot plugin instead of a OneBot bridge.

What the QQ channel does

Uses EC_QQ_APP_ID and EC_QQ_CLIENT_SECRET
Calls https://bots.qq.com/app/getAppAccessToken
Calls https://api.sgroup.qq.com/gateway
Connects to the QQ gateway over WebSocket
Sends IDENTIFY, keeps heartbeat, and handles dispatch events
Sends replies back over QQ official REST APIs

Supported inbound events

C2C_MESSAGE_CREATE
GROUP_AT_MESSAGE_CREATE
AT_MESSAGE_CREATE

These map to routing fields inside EmbedClaw:

C2C: chat_type="c2c", chat_id="<openid>"
Group: chat_type="group", chat_id="<group_openid>"
Channel: chat_type="channel", chat_id="<channel_id>"

Minimal config

Add to main/ec_config.h:

#define EC_QQ_ENABLE        1
#define EC_QQ_APP_ID        "YOUR_QQ_APP_ID"
#define EC_QQ_CLIENT_SECRET "YOUR_QQ_CLIENT_SECRET"

Optional:

#define EC_QQ_INTENTS       (1 << 25)
#define EC_QQ_RECONNECT_MS  10000

Notes

The current implementation focuses on text messages first.
The device does not expose a Webhook endpoint.
If QQ is enabled but the credentials are invalid, startup logs will show the token/gateway failure path.

See the official entry page: https://q.qq.com/qqbot/openclaw/index.html

Testing

There are two test layers in this repo:

Firmware build check: idf.py build
embed_claw unit-test-app build: ./scripts/run_unit_tests.sh build

Detailed board-side test instructions live in components/embed_claw/test/README.md.

GitHub Actions currently does compile-only checks on both the project firmware and the unit-test-app. It does not run hardware-attached tests in CI.

Persona and memory

These files are preloaded in SPIFFS:

Roles:

SOUL.md: Who the assistant is and how it speaks
USER.md: User profile
MEMORY.md: Long-term knowledge

Each turn, the Agent builds the system prompt from:

Personality
User info
Long-term memory
Recent notes
Available skills
Current turn context

That’s how it keeps continuity and “memory” across turns.

Extending the system

The repo is structured so you can extend it without rewriting core logic.

Adding a tool

Add tools_xxx.c under components/embed_claw/tools/
Define an ec_tools_t with name, description, input_schema_json, and execute
Expose a register function, e.g. esp_err_t ec_tools_xxx(void);
Add EC_TOOLS_REG(xxx) in components/embed_claw/tools/ec_tools_reg.inc

Minimal skeleton:

static esp_err_t ec_tool_demo_execute(const char *input_json, char *output, size_t output_size);

static const ec_tools_t s_demo = {
    .name = "demo_tool",
    .description = "Describe what this tool does.",
    .input_schema_json =
        "{\"type\":\"object\",\"properties\":{},\"required\":[]}",
    .execute = ec_tool_demo_execute,
};

esp_err_t ec_tools_demo(void)
{
    ec_tools_register(&s_demo);
    return ESP_OK;
}

Adding a skill

Skills are Markdown task descriptions, not code. You can:

Write them at runtime via tools to /spiffs/skills/<name>.md
Or put default skills in spiffs_data/skills/ so they’re in the SPIFFS image

Suggested format:

# Translate

Translate text between languages.

## When to use
When the user asks for translation.

## How to use
1. Detect source and target language.
2. Translate directly.
3. If terminology is important, verify with web_search.

Adding a channel

Add ec_channel_xxx.c under components/embed_claw/channel/
Implement start() and send()
Convert incoming messages to ec_msg_t and call ec_agent_inbound()
On outbound, route by msg->channel to select the channel driver
In each channel send(), use msg->chat_type + msg->chat_id to resolve the destination
Register with EC_CHANNEL_REG(xxx) in ec_channel_reg.inc

Minimal skeleton:

static esp_err_t ec_channel_demo_start(void);
static esp_err_t ec_channel_demo_send(const ec_msg_t *msg);

static const ec_channel_t s_driver = {
    .name = "demo",
    .vtable = {
        .start = ec_channel_demo_start,
        .send = ec_channel_demo_send,
    },
};

esp_err_t ec_channel_demo(void)
{
    return ec_channel_register(&s_driver);
}

Adding an LLM provider

Currently used:

EC_USE_QWEN=1
Other built-in OpenAI-compatible presets: EC_USE_DEEPSEEK / EC_USE_DOUBAO / EC_USE_KIMI / EC_USE_HUNYUAN
EC_LLM_PROVIDER_NAME (default: openai)
Qwen preset defaults to DashScope OpenAI-compatible + qwen-plus
The other presets reuse the same OpenAI-compatible provider

To add another provider:

See components/embed_claw/llm/ec_llm_internal.h
Add ec_llm_xxx.c / .h
Export ec_llm_xxx_get_provider(void) from the new provider module
Implement init and chat_tools, and map the response to ec_llm_response_t
Add one branch in ec_llm_init_default() (components/embed_claw/llm/ec_llm.c) to map provider name to getter
Set EC_LLM_PROVIDER_NAME in main/ec_config.h (and set matching URL/model)

The current runtime path is OpenAI-compatible providers (OpenAI, DeepSeek, Moonshot/KiMi, Qwen, Doubao, Hunyuan, etc.). Other provider families are not wired yet.

Possible next steps

With clear boundaries, natural extensions include: TODO.md

Notes

1. Build-time configuration

For open-source distribution, repo defaults keep secret fields empty. Put real keys in local main/ec_config.h rather than editing components/embed_claw/ec_config_internal.h. main/ec_config.h is ignored by Git by default.

Before running, set:

API key for the active LLM preset
EC_LLM_PROVIDER_NAME (default is openai)
Tavily API key
Feishu App ID and App Secret

2. Best use today

This repo is a good fit for:

Experimenting with an embedded AI Agent framework
Building Feishu- or WebSocket-driven edge assistants
Validating tool calling, memory, and skills on real hardware
Using it as a base for productization

License

This project is open source under the MIT License.

You may use, modify, distribute, and use commercially, subject to retaining the original copyright and license notice.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
assets		assets
components/embed_claw		components/embed_claw
main		main
scripts		scripts
spiffs_data		spiffs_data
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
TODO.md		TODO.md
TODO_ZH.md		TODO_ZH.md
partitions.csv		partitions.csv
sdkconfig.defaults.esp32s3		sdkconfig.defaults.esp32s3

Folders and files

Latest commit

History

Repository files navigation

EmbedClaw

Origins

Why EmbedClaw

1. Decoupled, not feature-bloated

2. Not a one-off demo—a sustainable Agent core

Implemented Features

Core

Registered tools

Pre-installed skills

Architecture

Directory layout

Runtime flow

Storage layout

Quick start

Hardware and environment

1. Configure keys and platform

2. Build

3. Flash and monitor

4. First-time Wi-Fi

WebSocket chat

Server

Quick test

Inbound message format

Outbound message format

Feishu (Lark) integration

What the Feishu channel does

Setup

1. Create a Feishu app

2. Enable message permissions

3. Event subscription

4. Put credentials in the project

5. Build, flash, and connect

6. Chat

Optional: PC relay script

QQBot integration

What the QQ channel does

Supported inbound events

Minimal config

Notes

Testing

Persona and memory

Extending the system

Adding a tool

Adding a skill

Adding a channel

Adding an LLM provider

Possible next steps

Notes

1. Build-time configuration

2. Best use today

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages