Skip to content

myanvoos/ASpec

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automated Stateful Specialization for Adaptive Agent Systems.

Published as a conference poster at ICLR 2026. [Paper]

ASpec manages the full lifecycle of specialist agents: autonomously discovering agent archetypes via evolutionary search, then cultivating their expertise through experience, mirroring how human experts learn through practice and reflection.

Overview

The framework has three main components:

  • Architect — an LLM-based generative process that evolves the operator pool and redesigns multi-agent architectures through creation, crossover, and pruning of specialists.
  • Meta-Controller — a neural policy that makes a binary decision per query: retain the current architecture or resample a new one. Trained via REINFORCE to balance performance, cost, and contextual fit.
  • Specialist Operators — agents derived from base operators (CoT, Debate, ReAct, etc.) with learned identities, directives, and persistent memory accumulated through post-execution reflection.

The methodology is split into two stages:

  1. Discovery: the Architect iteratively explores the space of specialist archetypes using evolutionary processes (creation, crossover, pruning), then selects the top-k via a performance-diversity objective.
  2. Cultivation: selected specialists deepen their expertise on a training corpus through execution and reflection, building structured memory entries that are semantically retrieved at inference time.

Setup

pixi install

To get started using OpenRouter, please add an OPENROUTER_API_KEY in .env. Then you can run:

pixi run python -m experiments.scripts.eval_aspec --dataset MATH --split train --api-type openrouter --model [model]

ASpec uses LiteLLM under the hood for compatibility with a wide range of LLM providers.

Citation

@inproceedings{
    vu2026automated,
    title={Automated Stateful Specialization for Adaptive Agent Systems},
    author={Myan Vu and Harrish Ayyanar and PANG JIANG and Anwiketh Reddy and Mayank Goel},
    booktitle={The Fourteenth International Conference on Learning Representations},
    year={2026},
    url={https://openreview.net/forum?id=UESTP6dR1K}
}

About

[ICLR '26] Official repo for "Automated Stateful Specialization for Adaptive Agent Systems"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages