SciAgent

A generic framework for building AI-powered scientific data analysis agents.

SciAgent provides the infrastructure — chat UI, CLI, code sandbox, guardrails, docs, MCP server — so you can focus on your domain-specific tools and knowledge.

The idea here is to build more human-in-the-loop scientific coding tools. Landing somewhere in between the basic LLM chat interface, and the end-to-end AI for science tools. The goal of this project is not to do the science for you, but to help you write strong, rigorous, and reproducible research code. Essentially an LLM wrapper but with a few extra tools to make sure the LLM doesn't go off the rails.

The core goal of this repo is to generate a domain-specific scientific agent in one of three output formats:

Fullstack — a complete, runnable Python submodule with CLI and web UI
Copilot / Claude Code — config files that plug directly into VS Code Copilot or Claude Code
Markdown — platform-agnostic spec files you can paste into any LLM

How to use this repo

The way I envision users utilizing this repo is in one of three ways.

Markdown templates for specifying agents - At its core, this repo contains Markdown templates to assist with building your agent. These templates are meant to be downloaded and customized for your domain specific use. The folder /templates/ contains templates for building and defining the agent, and prompts meant to constrain the agent to scientific rigor.
Fullstack custom agent cli/web app - This is a fullstack agent framework built on the copilot-sdk. Essentially this aims to use your custom domain tools in three ways:
1. Direct tool use by the Agent so it can do things like, direct inspect data, get metadata and plan work
2. A custom code execution environment with preloaded packages and tools.
3. Producing reproducible scripts for reuse with other data.
4. The agent also has the ability to ingest the docs of new packages and modify its own codebase, to give itself more power.
A self-assembling wizard - VERY WIP Built for novice coders. Describe your research domain to the self-assembly wizard and it discovers relevant packages, fetches their documentation, and generates a ready-to-use agent in your chosen format. The wizard now lives in its own package (sciagent-wizard) so framework users aren't affected by wizard-only releases.

Ideally, I was hoping to host a public version of the wizard for open use - however, I can't afford the hosting / llm api fees as a grad student. If you are a company that would be willing to help out, please contact me.

Check out PatchAgent for a real-world example in neurophysiology.

Built on the GitHub Copilot SDK.

Alternate

You may be interested in DAAF, the Data Analyst Augmentation Framework, by Brian Heseung Kim! A framework with much of the same goals and ideas. We accidentally had parallel evolution of our ideas.

Documentation

#	Page	Description
0	README	This document — overview, quick start, and links
1	Installation	Prerequisites, install variants, CLI commands, dev setup
2	Getting Started: Fullstack	Build a runnable Python agent with CLI & web UI
3	Getting Started: Copilot / Claude Code	Generate VS Code & Claude Code config files
4	Getting Started: Markdown	Generate platform-agnostic spec files for any LLM
5	API / Programmatic Usage	`AgentConfig`, `BaseScientificAgent`, tools, guardrails API
6	Architecture	System diagram, module reference, guardrails pipeline
7	Copilot Agents & Skills Reference	Agent/skill file formats, roster, handoff workflow
8	Showcase: PatchAgent	Real-world walkthrough in neurophysiology

Quick Start

pip install "sciagent[all] @ git+https://github.com/smestern/sciagent.git"       # install core framework + CLI + web UI
pip install "sciagent-wizard @ git+https://github.com/smestern/sciagent-wizard.git"  # install the wizard (optional)
sciagent wizard                 # launch the wizard (only available when sciagent-wizard is installed)

The wizard walks you through a conversation — describe your research domain, confirm discovered packages, and choose an output mode. A ready-to-use agent drops out the other end.

Three Output Streams

SciAgent generates a domain-specific scientific agent in one of three formats. Pick the one that fits your workflow:

1. Fullstack — Python agent with CLI & web UI

A complete, runnable Python submodule you can install and launch immediately. Includes sandboxed code execution, guardrails, Rich terminal REPL, and browser-based chat UI.

sciagent wizard -m fullstack

Full setup guide →

2. Copilot / Claude Code — IDE config files

Markdown-based agent definitions that plug directly into VS Code Copilot Chat or Claude Code. No Python runtime needed at the endpoint.

sciagent wizard -m copilot_agent

Full setup guide →

3. Markdown — platform-agnostic spec files

A self-contained set of Markdown files defining persona, tools, data handling, guardrails, and workflow. Paste into any LLM — ChatGPT, Gemini, Claude, local models, etc.

sciagent wizard -m markdown

Full setup guide →

Default Scientific Agents

SciAgent ships 5 ready-to-use agents in templates/agents/ that implement common scientific workflow roles:

Agent	Role
`analysis-planner`	Design the analysis roadmap (read-only)
`data-qc`	Check data quality before analysis
`rigor-reviewer`	Audit results for scientific rigor (read-only)
`report-writer`	Generate structured reports
`code-reviewer`	Review scripts for correctness (read-only)

These agents support handoff workflows: planner → QC → your agent → rigor review → report. See Copilot Agents & Skills Reference for details.

Guardrails

SciAgent enforces scientific rigor through a 5-layer system:

System prompt principles — embedded scientific best practices
Tool priority hierarchy — load real data before analysis
Code scanner — regex patterns block synthetic data generation, result fabrication
Data validator — checks for NaN, Inf, zero variance, suspicious smoothness
Bounds checker — domain-specific value range warnings

All layers are configurable and extensible. See Architecture for the full pipeline diagram.

Installation

# Neither package is on PyPI yet — install from GitHub
pip install "sciagent @ git+https://github.com/smestern/sciagent.git"            # core framework only
pip install "sciagent[cli] @ git+https://github.com/smestern/sciagent.git"       # + terminal REPL (Rich, Typer)
pip install "sciagent[web] @ git+https://github.com/smestern/sciagent.git"       # + browser chat UI (Quart)
pip install "sciagent[all] @ git+https://github.com/smestern/sciagent.git"       # core + CLI + web

# Self-assembly wizard (separate package)
pip install "sciagent-wizard @ git+https://github.com/smestern/sciagent-wizard.git"

The wizard is a separate package (sciagent-wizard) that registers itself as a plugin via the sciagent.plugins entry-point group. Installing it automatically adds the sciagent wizard CLI command, web blueprints (/wizard/, /public/, /ingestor/), and the ingest_library_docs tool — no configuration needed.

See Installation for prerequisites, dev setup, and verification steps.

Plugin Architecture

SciAgent uses a plugin system based on setuptools entry points. Any installed package can extend the framework by registering under the sciagent.plugins group:

# In your package's pyproject.toml
[project.entry-points."sciagent.plugins"]
my-plugin = "my_package:register_plugin"

The register_plugin() callable returns a PluginRegistration declaring:

Field	Type	Purpose
`register_web`	`(app, **ctx) → None`	Register Quart blueprints and auth middleware
`register_cli`	`(app) → None`	Add Typer sub-commands
`get_auth_token`	`() → str	None`
`supported_models`	`dict`	Declare LLM models for routing/billing
`tool_providers`	`dict[str, callable]`	Lazy-load tool functions

The wizard (sciagent-wizard) is the first plugin built on this system. See src/sciagent/plugins.py for the full API.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.github/workflows		.github/workflows
build/plugin/sciagent		build/plugin/sciagent
docs		docs
examples/csv_analyst		examples/csv_analyst
scripts		scripts
src/sciagent		src/sciagent
templates		templates
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
README.md		README.md
logo.svg		logo.svg
logo128.png		logo128.png
logo16.png		logo16.png
logo48.png		logo48.png
pyproject.toml		pyproject.toml
sciagent.code-workspace		sciagent.code-workspace

Variable	Required	Description
`GITHUB_OAUTH_CLIENT_ID`	Yes	OAuth App client ID
`GITHUB_OAUTH_CLIENT_SECRET`	Yes	OAuth App client secret
`SCIAGENT_SESSION_SECRET`	Recommended	Session cookie signing key (random string)
`SCIAGENT_SESSION_SECURE`	No	Set to `1` for HTTPS-only session cookies
`SCIAGENT_ALLOWED_ORIGINS`	No	Restrict CORS origins (default: `*`)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SciAgent

Alternate

Documentation

Quick Start

Three Output Streams

1. Fullstack — Python agent with CLI & web UI

2. Copilot / Claude Code — IDE config files

3. Markdown — platform-agnostic spec files

Default Scientific Agents

Guardrails

Installation

Plugin Architecture

See Also

Setup

How it works

Security

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SciAgent

Alternate

Documentation

Quick Start

Three Output Streams

1. Fullstack — Python agent with CLI & web UI

2. Copilot / Claude Code — IDE config files

3. Markdown — platform-agnostic spec files

Default Scientific Agents

Guardrails

Installation

Plugin Architecture

See Also

Setup

How it works

Security

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages