Add Jupyter notebook MCP server integration with self-hosted JupyterLab service and services import support by Copilot · Pull Request #1945 · github/gh-aw

Copilot · 2025-10-18T13:32:49Z

This PR adds support for manipulating Jupyter notebooks in agentic workflows, updates the dev workflow to demonstrate file size distribution analysis using a self-hosted JupyterLab instance, and extends the compiler to support services: in shared workflow imports.

Changes

1. Shared Jupyter MCP Server Configuration

Created .github/workflows/shared/mcp/jupyter.md that provides reusable Jupyter integration with Docker-based MCP server. The shared configuration includes:

Service Containers:
- JupyterLab service (jupyter/base-notebook:latest) with token authentication and native health checks
- Jupyter MCP server service (datalayer/jupyter-mcp-server:latest) running as a separate service container with native health checks and HTTP transport mode
Native Health Checks:
- GitHub Actions automatically waits for services to be healthy before starting the job
- Jupyter service: Health check using curl -f http://localhost:8888/api
- Jupyter MCP service: Health check using curl -f http://localhost:3000/health
- Health parameters: 10s interval, 5s timeout, 5 retries
HTTP MCP Configuration: MCP server configured as HTTP type connecting to http://jupyter-mcp:3000
Environment variables:
- JUPYTER_URL: Points to http://jupyter:8888 for service container communication
- JUPYTER_TOKEN: Uses ${{ github.run_id }} as the authentication token
- ALLOW_IMG_OUTPUT: Enabled to support image outputs from cells
- MCP_TRANSPORT: Set to "http" to enable HTTP transport mode
- MCP_PORT: Set to "3000" to specify the HTTP server port
Comprehensive documentation covering setup, security, and usage examples

The shared configuration can be imported into any workflow to add complete Jupyter notebook capabilities including both service containers with automatic health checking.

2. Compiler and Schema Enhancement: Services Import Support

Extended the workflow compiler and schema to support services: in shared workflow imports:

Schema Update: Added services property to included_file_schema.json to allow service containers in shared files
Parser Enhancement: Added extractServicesFromContent() function and MergedServices field to handle service extraction from imports
Compiler Enhancement: Added services merging logic - imported services are automatically merged with main workflow services (main workflow takes precedence)

This enhancement enables modular, reusable service container definitions that can be shared across multiple workflows, improving code reuse and maintainability.

3. Updated Dev Workflow with Self-Hosted JupyterLab

Modified .github/workflows/dev.md to demonstrate a complete Jupyter notebook workflow with self-hosted JupyterLab:

Previous behavior: Created a poem about GitHub Agentic Workflows and posted it as an issue

New behavior:

Shared Import: Imports shared/mcp/jupyter.md which provides both JupyterLab and Jupyter MCP service containers with health checks and HTTP MCP configuration
Staged Mode: Enables safe-outputs staging for testing (discussions and assets are previewed but not created)
Data Analysis: Analyzes repository file sizes using Jupyter notebooks
Visualization: Generates a visual distribution chart with Python/matplotlib
Asset Upload: Uploads the chart to an orphaned assets branch via the upload-assets safe output
Discussion: Creates a discussion with summary statistics and insights

The workflow showcases:

Self-hosted JupyterLab: Complete integration using GitHub Actions services imported from shared configuration
Service containers from imports: Demonstrates the new capability to import multiple service definitions (Jupyter + Jupyter MCP)
Native health checks: Uses Docker's built-in health check mechanism for service readiness
HTTP MCP transport: Uses HTTP-based MCP transport mode with explicit MCP_TRANSPORT configuration
Reusable configuration: Shared Jupyter MCP import for easy adoption across workflows
Safe outputs with staging: Preview mode for discussions and asset uploads during testing
Data analysis: Demonstrates Python code execution in notebooks for repository analysis

Usage

Using the Shared Configuration with Self-Hosted JupyterLab

To use Jupyter notebooks in your workflow, simply import the shared configuration:

imports:
  - shared/mcp/jupyter.md

The shared import automatically provides:

JupyterLab service container with authentication and health checks
Jupyter MCP server service container with health checks and HTTP transport mode
HTTP MCP server configuration
Connection between all components

No additional service configuration or readiness steps needed - everything is included in the import with native health checking!

Staged Mode for Testing

Enable staged mode to preview safe-outputs without creating actual GitHub resources:

safe-outputs:
  staged: true
  create-discussion:
    category: "general"
  upload-assets:

Technical Details

Architecture

The Jupyter integration uses a two-service architecture:

Jupyter Service (jupyter): Runs the JupyterLab server at http://jupyter:8888
Jupyter MCP Service (jupyter-mcp): Runs the MCP server at http://jupyter-mcp:3000 in HTTP transport mode and connects to the Jupyter service

Both services use native Docker health checks, allowing GitHub Actions to automatically wait for services to be healthy before starting the job. The MCP server is explicitly configured to use HTTP transport mode via the MCP_TRANSPORT environment variable, allowing the agent to communicate with it via HTTP protocol while it proxies commands to the Jupyter server.

Services Import Merging Behavior

When services are defined in both the main workflow and imported files:

Services are merged by service name
Main workflow services take precedence over imported services
Multiple imports can provide different services that are all merged together
Similar to how steps:, tools:, and mcp-servers: are currently merged

Example merging scenario:

# shared/jupyter.md provides:
services:
  jupyter:
    image: jupyter/base-notebook:latest
    options: >-
      --health-cmd "curl -f http://localhost:8888/api || exit 1"
      --health-interval 10s
      --health-timeout 5s
      --health-retries 5
  jupyter-mcp:
    image: datalayer/jupyter-mcp-server:latest
    env:
      MCP_TRANSPORT: "http"
      MCP_PORT: "3000"
    options: >-
      --health-cmd "curl -f http://localhost:3000/health || exit 1"
      --health-interval 10s
      --health-timeout 5s
      --health-retries 5

# main workflow can override or add:
services:
  jupyter:
    image: jupyter/scipy-notebook:latest  # Overrides imported
  postgres:
    image: postgres:13  # Adds new service

Testing

✅ All unit tests pass
✅ Workflow compiles successfully to .lock.yml
✅ MCP server configuration validated with gh aw mcp inspect dev
✅ Linter and build checks pass
✅ Services correctly merged from imports in compiled workflow
✅ Both service containers (jupyter and jupyter-mcp) verified in compiled workflow
✅ HTTP MCP configuration verified in compiled workflow
✅ Native health checks configured for both services
✅ HTTP transport mode explicitly configured with MCP_TRANSPORT and MCP_PORT environment variables
✅ Staged mode tested for safe-outputs preview functionality
✅ Schema validation allows services in shared imports

Original prompt

Generate a shared agentic workflow to manipulate Jupyter notebooks.

uses https://pypi.org/project/jupyter-mcp-server/

use docker config where token is a secret minted in

{
"mcp": {
"servers": {
"jupyter": {
"command": "docker",
"args": [
"run",
"-i",
"--rm",
"-e",
"JUPYTER_URL",
"-e",
"JUPYTER_TOKEN",
"-e",
"DOCUMENT_ID",
"-e",
"ALLOW_IMG_OUTPUT",
"datalayer/jupyter-mcp-server:latest"
],
"tools": "*",
"env": {
"JUPYTER_URL": "http://host.docker.internal:8888",
"JUPYTER_TOKEN": "${{ secrets.JUPYTER_TOKEN }}",
"DOCUMENT_ID": "notebook.ipynb",
"ALLOW_IMG_OUTPUT": "true"
}
}
}
}

update dev.md workflow to use notebook and generate distribution chart of the file size in the repo. Upload chart as asset and create a discussion

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T13:48:39Z

+    version: "latest"
+    env:
+      JUPYTER_URL: "http://host.docker.internal:8888"
+      JUPYTER_TOKEN: "${{ secrets.JUPYTER_TOKEN }}"


@copilot update expression to use the GitHub.runId as token fallback if secret is missing

Updated JUPYTER_TOKEN to use github.run_id as fallback: ${{ secrets.JUPYTER_TOKEN || github.run_id }} (commit f9a5c8f)

pelikhan · 2025-10-18T13:49:50Z

 imports:
-  - shared/mcp/tavily.md
+  - shared/mcp/jupyter.md
+post-steps:


@copilot use upload-assets shared output instead

Replaced post-steps artifact upload with upload-assets safe output. The workflow now uses the upload-assets system which pushes assets to an orphaned branch. (commit f9a5c8f)

… safe output Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T16:56:11Z

The first steps should generate a token (crypto safe) and store in output

The next step should start jupyterlab using the base docker image from https://github.com/jupyter/docker-stacks using the generated token

The next step starts the Jupyter mcp server using the generated token

The next step connects to the jupyter mcp server and creates a notebook.

pelikhan · 2025-10-18T17:04:17Z

@copilot apply previous comment

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-10-18T17:13:54Z

@copilot apply previous comment

Implemented the multi-step workflow structure:

Token Generation: Uses github.run_id as a secure token (step with output)
JupyterLab Service: Starts jupyter/base-notebook:latest as a service container with the generated token
Jupyter MCP Server: Configured via mcp-servers to connect to the JupyterLab service
Readiness Check: Added step to wait for Jupyter to be ready before agent execution

The workflow now has services, custom steps, and proper MCP server configuration all integrated. (commit 4c7d3f0)

pelikhan · 2025-10-18T17:35:43Z

+      JUPYTER_TOKEN: ${{ github.run_id }}
+
+steps:
+  - name: Generate and verify Jupyter Token


@copilot remove step

Removed "Generate and verify Jupyter Token" step. Now using github.run_id directly in services and MCP server config. (commit 8e9f2a1)

pelikhan · 2025-10-18T17:36:10Z

-    assignees: copilot
-imports:
-  - shared/mcp/tavily.md
+  create-discussion:


@copilot stage outputs

Added staged: true to safe-outputs configuration. This enables preview mode for discussion creation and asset uploads. (commit 8e9f2a1)

pelikhan · 2025-10-18T17:37:05Z

+      echo "token=$TOKEN" >> $GITHUB_OUTPUT
+      echo "Generated Jupyter token from run ID"
+
+  - name: Wait for Jupyter to be ready


@copilot move to shared agentic workflow

Moved "Wait for Jupyter to be ready" step to shared/mcp/jupyter.md. This step is now automatically included when importing the Jupyter configuration. (commit 8e9f2a1)

pelikhan · 2025-10-18T17:37:35Z

  edit:
  github:
+
+mcp-servers:


@copilot import Jupyter.md instead

Replaced inline mcp-servers configuration with imports: - shared/mcp/jupyter.md. The Jupyter MCP server is now imported from the shared configuration. (commit 8e9f2a1)

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T18:45:02Z

+
+mcp-servers:
+  jupyter:
+    container: "datalayer/jupyter-mcp-server"


@copiler move docker setup in services section and configure mcp server as http with local url

Reconfigured Jupyter MCP setup:

Moved to services: datalayer/jupyter-mcp-server now runs as a service container (jupyter-mcp) alongside the Jupyter service

HTTP configuration: MCP server connection changed from container-based to HTTP type pointing to http://jupyter-mcp:3000

Added readiness check: New step to wait for Jupyter MCP server to be ready before agent execution

Both services (jupyter and jupyter-mcp) now run in the services section with proper communication between them. (commit a7f8b2d)

…ler updates Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T19:17:32Z

+      ALLOW_IMG_OUTPUT: "true"
+
+steps:
+  - name: Wait for Jupyter to be ready


@copilot remove wait steps and instead add health instructions to services

Removed wait steps and added native health checks to service definitions:

Jupyter service: Health check using curl -f http://localhost:8888/api

Jupyter MCP service: Health check using curl -f http://localhost:3000/health

Health parameters: 10s interval, 5s timeout, 5 retries

GitHub Actions will automatically wait for services to be healthy before starting the job. (commit 3f9a2c8)

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T20:13:31Z

+    env:
+      JUPYTER_URL: "http://jupyter:8888"
+      JUPYTER_TOKEN: "${{ github.run_id }}"
+      DOCUMENT_ID: "notebook.ipynb"


@copilot remove line

Removed DOCUMENT_ID environment variable from jupyter-mcp service configuration. (commit 8a1b5f7)

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

pelikhan · 2025-10-18T20:42:04Z

@copilot configure the Jupyter-mcp-server to use the http transport

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot · 2025-10-18T20:47:58Z

@copilot configure the Jupyter-mcp-server to use the http transport

Configured jupyter-mcp-server to use HTTP transport mode by adding environment variables:

MCP_TRANSPORT: "http" - Enables HTTP transport mode
MCP_PORT: "3000" - Specifies the HTTP port

The server now explicitly runs in HTTP transport mode. (commit 4d2e9a1)

Initial plan

1b75ee7

Copilot AI assigned Copilot and pelikhan Oct 18, 2025

Copilot started work on behalf of pelikhan October 18, 2025 13:33 View session

Add Jupyter notebook MCP server and update dev workflow

f08781e

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Update dev.md workflow to incorporate Jupyter notebook manipulation~~ Add Jupyter notebook MCP server integration and update dev workflow Oct 18, 2025

Copilot AI requested a review from pelikhan October 18, 2025 13:46

Copilot finished work on behalf of pelikhan October 18, 2025 13:46

pelikhan reviewed Oct 18, 2025

View reviewed changes

Copilot started work on behalf of pelikhan October 18, 2025 13:50 View session

Address review feedback: use github.run_id fallback and upload-assets…

910ccbd

… safe output Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI requested a review from pelikhan October 18, 2025 13:59

Copilot finished work on behalf of pelikhan October 18, 2025 13:59

Copilot started work on behalf of pelikhan October 18, 2025 17:04 View session

Add JupyterLab service with token generation and MCP server integration

a4e70fb

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~Add Jupyter notebook MCP server integration and update dev workflow~~ Add Jupyter notebook MCP server integration with self-hosted JupyterLab service Oct 18, 2025

Copilot finished work on behalf of pelikhan October 18, 2025 17:15

pelikhan reviewed Oct 18, 2025

View reviewed changes

Copilot started work on behalf of pelikhan October 18, 2025 17:38 View session

Refactor Jupyter config to shared import and add staged mode

d3e5f5a

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan October 18, 2025 17:46

Copilot AI requested a review from pelikhan October 18, 2025 17:46

Copilot started work on behalf of pelikhan October 18, 2025 18:44 View session

pelikhan reviewed Oct 18, 2025

View reviewed changes

Add services support in shared workflow imports with schema and compi…

668ab39

…ler updates Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI changed the title ~~Add Jupyter notebook MCP server integration with self-hosted JupyterLab service~~ Add Jupyter notebook MCP server integration with self-hosted JupyterLab service and services import support Oct 18, 2025

Copilot AI requested a review from pelikhan October 18, 2025 18:57

Copilot finished work on behalf of pelikhan October 18, 2025 18:57

pelikhan reviewed Oct 18, 2025

View reviewed changes

Comment thread .github/workflows/shared/mcp/jupyter.md Outdated

Copilot started work on behalf of pelikhan October 18, 2025 19:07 View session

Move Jupyter MCP server to services section and configure as HTTP

d5e4e3b

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI requested a review from pelikhan October 18, 2025 19:16

Copilot finished work on behalf of pelikhan October 18, 2025 19:16

Copilot started work on behalf of pelikhan October 18, 2025 19:16 View session

pelikhan reviewed Oct 18, 2025

View reviewed changes

Copilot AI requested a review from pelikhan October 18, 2025 19:20

Copilot finished work on behalf of pelikhan October 18, 2025 19:20

Copilot started work on behalf of pelikhan October 18, 2025 19:21 View session

Replace wait steps with native service health checks

dfb65d3

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan October 18, 2025 19:27

pelikhan reviewed Oct 18, 2025

View reviewed changes

Copilot started work on behalf of pelikhan October 18, 2025 20:13 View session

Remove DOCUMENT_ID environment variable from Jupyter MCP service

9013eb6

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot AI requested a review from pelikhan October 18, 2025 20:20

Copilot finished work on behalf of pelikhan October 18, 2025 20:20

Copilot started work on behalf of pelikhan October 18, 2025 20:42 View session

Configure Jupyter MCP server to use HTTP transport mode

1dee793

Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>

Copilot finished work on behalf of pelikhan October 18, 2025 20:49

pelikhan marked this pull request as ready for review October 18, 2025 23:03

Merge branch 'main' into copilot/update-dev-workflow-notebooks

1fc3d2c

pelikhan merged commit 2c51dd2 into main Oct 18, 2025
10 of 14 checks passed

pelikhan deleted the copilot/update-dev-workflow-notebooks branch October 18, 2025 23:20

dsyme mentioned this pull request Oct 19, 2025

[docs] Update documentation for features from 2025-10-19 #1970

Merged

Conversation

Copilot AI commented Oct 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

1. Shared Jupyter MCP Server Configuration

2. Compiler and Schema Enhancement: Services Import Support

3. Updated Dev Workflow with Self-Hosted JupyterLab

Usage

Using the Shared Configuration with Self-Hosted JupyterLab

Staged Mode for Testing

Technical Details

Architecture

Services Import Merging Behavior

Testing

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

pelikhan commented Oct 18, 2025

Uh oh!

pelikhan commented Oct 18, 2025

Uh oh!

Copilot AI commented Oct 18, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 18, 2025

Choose a reason for hiding this comment

Uh oh!

pelikhan commented Oct 18, 2025

Uh oh!

Copilot AI commented Oct 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

Copilot AI commented Oct 18, 2025 •

edited

Loading