Plot My Data - using R with AI agents

The R software environment is an open-source platform for statistics, data analysis and visualization. To help R users solve real-world problems, AI agents need access to tools and guidance about their usage.

The aim of this project is to build an intuitive conversational interface to powerful plotting functions. To do this, we are engineering an ecosystem of AI agents and tools that can run R code and make plots.

A test-driven approach to AI development using known-good traces as evaluation cases ensures that the system works as expected. It's made with industry-standard components, supporting different models and scalable deployment options.

Note: In this example, the agent uses the Hide tool to modify R variables without returning the results. This way the LLM isn't flooded with thousands of tokens representing random numbers.

Features

Multiple data sources: Upload a file, provide a URL, or use built-in R datasets
Data awareness: Uploaded files are automatically summarized for the LLM
- This lets you describe a plot without knowing the exact variable names
Code generation: The LLM writes R code based on its internal knowledge
Code execution: Tools are provided for making plots with base R graphics (default) and ggplot2
- To use ggplot2, just mention "ggplot" or "ggplot2" in your message
Instant visualization: Plots are shown in the chat interface and downloadable as artifacts
Interactive analysis: The agent uses an R session so the environment persists across chat messages and tool calls

Running the app

The app can be run with or without a container.

Containerless

Install R and run install.packages(c("ellmer", "mcptools", "readr", "ggplot2"))
Install Python with packages listed in requirements.txt
Put your OpenAI API key in a file named secret.openai-api-key
Execute run_web.sh to start an R session and launch the ADK web UI

Containerized

First, build the project. This creates a plotmydata Docker Compose project and a plotmydata-app image.

docker compose build

Now run the project. This uses your OpenAI API key (sk-proj-...) from secret.openai-api-key.

docker compose up

Press w to start watching file changes. Alternatively, use this command so changes to the R and Python code on the host computer are reflected in the running project.

docker compose watch

Changing the model

The remote LLM is gpt-4o-mini. If you want to use a different one, change it in entrypoint.sh.

To use a local LLM running on your GPU, install Docker Model Runner before running this command.

docker compose -f compose.yaml -f model-runner.yaml up

The local LLM is Gemma 3; this can be changed in model-runner.yaml.

Examples

Plot data: Plot radius_worst (y) vs radius_mean (x) from https://github.com/jedick/plotmydata/raw/refs/heads/main/evals/data/breast-cancer.csv. Add a blue 1:1 line and title "Breast Cancer Wisconsin (Diagnostic)".

Plot functions: Plot a Sierpiński Triangle

Chat with AI agent to plot Sierpiński Triangle

Interactive analysis: [click to open]

Save 100 random numbers from a normal distribution in x
Run y = x^2
Plot a histogram of y

Evals

Accuracy = fraction of correct plots. Plot correctness is currently judged by a human.

Eval set	Size	Prompt set	Accuracy	Notes
01	27	bb4eead	0.41	Mainly base graphics: barplot, boxplot, cdplot, coplot, contour, dotchart, filled.contour, grid
01	27	e9180aa	0.52	Add help tools to get R documentation
02	37	e9180aa	0.49	More base graphics: hist, image, lines, matplot, mosaicplot, pairs, rug, spineplot, plot.window
03	40	30c22a1	0.50	Handle uploaded CSV files
03	40	b8e5f8c	0.38	Add agent for loading and summarizing data

Evals management

The repo tracks both evaluation sets and prompt sets. For example, the evals/01 directory contains all results for the first evaluation set using different prompt sets. The file name uses the short commit hash for the prompt set used for evaluation.

Each eval consists of a query and reference code and image. Because of their size, reference and generated images are not stored in this repo.

To run evals, copy the latest eval CSV file to evals/evals.csv. Then use e.g. run_eval.sh 1 to run the first eval. This script: 1) saves the tool calls, generated code, and current date to the CSV file and 2) saves the generated image to the evals/generated directory.

After running evals, change to the evals directory and run streamlit run edit_evals.py to edit the eval CSV file. This app allows:

Choosing an eval to edit
Viewing the reference and generated images side-by-side
Indicating whether the generated plot is correct (True or False)
Editing other eval data (e.g. query, file name for data upload, reference code, notes)
Adding new evals

Under the hood

Model Context Protocol (MCP) allows AI agents to interact with external tools in a client-server setup
We connect an Agent Development Kit client to an MCP server from the mcptools R package
For access from R, file uploads are saved as artifacts using an ADK plugin, then as temporary files using a callback function
PlotMyData/__init__.py has code to reduce log verbosity and is modified from docker/compose-for-agents

Container notes:

The Docker image is based on rocker/r-ver and adds R packages and a Python installation
Docker Compose is used for port mapping, secrets, and watching file changes with Docker Watch
The ADK web UI is run with the --reload_agents flag so that changes to agent.py on the host system are reflected in the running container

Licenses

This code in repo is licensed under MIT
Some examples used in evals are taken from R and are licensed under GPL-2|GPL-3
breast-cancer.csv (the CSV version from Kaggle) is licensed under CC0; the original dataset (from the UCI Machine Learning Repository) is licensed under CC BY 4.0

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
PlotMyData		PlotMyData
evals		evals
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml
data_summary.R		data_summary.R
entrypoint.sh		entrypoint.sh
model-runner.yaml		model-runner.yaml
prompts.R		prompts.R
prompts.py		prompts.py
requirements.txt		requirements.txt
run_eval.py		run_eval.py
run_eval.sh		run_eval.sh
run_web.sh		run_web.sh
server.R		server.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Plot My Data - using R with AI agents

Features

Running the app

Examples

Evals

Under the hood

Licenses

About

Uh oh!

Releases

Packages

Languages

License

jedick/plotmydata

Folders and files

Latest commit

History

Repository files navigation

Plot My Data - using R with AI agents

Features

Running the app

Examples

Evals

Under the hood

Licenses

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages