Add comprehensive Copilot instructions for PyTensor repository onboarding #1634

Copilot · 2025-10-06T13:58:03Z

Overview

This PR adds a comprehensive .github/copilot-instructions.md file to help coding agents work more efficiently with the PyTensor codebase. The instructions were created by thoroughly exploring the repository structure, documentation, CI/CD workflows, and build processes to provide validated, actionable guidance.

What's Included

The instructions document provides:

1. Repository Overview

High-level description of PyTensor's purpose and architecture
Repository statistics (size, languages, test count, Python version requirements)

2. Environment Setup

Critical emphasis on conda/mamba: Documents why pip-only installations fail and how conda provides necessary dependencies (MKL, compilers)
Step-by-step setup with verification commands
Pre-commit hook installation (required before committing)

3. Build & Testing

Cython compilation details: Explains scan_perform.pyx compilation process
Complete testing strategies: Quick tests, module-specific tests, CI-like configurations
Test timing estimates: Helps set expectations (5-15 min per job, 30-60 min full matrix)
PYTENSOR_FLAGS usage: Documents environment flags for different test modes

4. Development Workflow

Linting with ruff (auto-fix capabilities)
Type checking with mypy (explains known-failing files approach)
Documentation building with Sphinx
Pre-commit hook details

5. Project Structure

Detailed directory layout highlighting key modules (tensor, link, graph, scan)
Backend explanations (C, JAX, Numba, PyTorch)
Configuration file locations and purposes

6. CI/CD Understanding

Breakdown of all three workflows (test.yml, mypy.yml, pypi.yml)
Test matrix explanation (OS, Python versions, NumPy versions, flags)
Timing information for each CI stage

7. Common Pitfalls & Solutions

Installation issues and their conda-based solutions
Test failures (BLAS flags, optional dependencies)
Pre-commit issues (debug statements, format failures)
Testing patterns specific to PyTensor

8. Quality Standards

PR requirements checklist
Code quality expectations
Commit message standards

Why This Helps

These instructions significantly reduce the likelihood of:

CI failures from incorrect environment setup or missing pre-commit runs
Build errors from pip-only installations missing MKL/compilers
Wasted time exploring the codebase for information readily available here
Rejected PRs due to not following project conventions

The document was validated against actual repository files, workflow configurations, and documentation to ensure accuracy. Commands can be copy-pasted directly and will work as documented.

Validation

All information has been:

Cross-referenced with official documentation in doc/dev_start_guide.rst
Verified against actual workflow files in .github/workflows/
Checked against pyproject.toml, environment.yml, and other config files
Structured to be actionable rather than just informational

Size: 326 lines (~1473 words) - well within the 2-page limit specified in requirements.

Original prompt

Your task is to "onboard" this repository to Copilot coding agent by adding a .github/copilot-instructions.md file in the repository that contains information describing how a coding agent seeing it for the first time can work most efficiently.

You will do this task only one time per repository and doing a good job can SIGNIFICANTLY improve the quality of the agent's work, so take your time, think carefully, and search thoroughly before writing the instructions.

- Reduce the likelihood of a coding agent pull request getting rejected by the user due to generating code that fails the continuous integration build, fails a validation pipeline, or having misbehavior. - Minimize bash command and build failures. - Allow the agent to complete its task more quickly by minimizing the need for exploration using grep, find, str_replace_editor, and code search tools. - Instructions must be no longer than 2 pages. - Instructions must not be task specific.

Add the following high level details about the codebase to reduce the amount of searching the agent has to do to understand the codebase each time:

A summary of what the repository does.
High level repository information, such as the size of the repo, the type of the project, the languages, frameworks, or target runtimes in use.

Add information about how to build and validate changes so the agent does not need to search and find it each time.

For each of bootstrap, build, test, run, lint, and any other scripted step, document the sequence of steps to take to run it successfully as well as the versions of any runtime or build tools used.
Each command should be validated by running it to ensure that it works correctly as well as any preconditions and postconditions.
Try cleaning the repo and environment and running commands in different orders and document errors and and misbehavior observed as well as any steps used to mitigate the problem.
Run the tests and document the order of steps required to run the tests.
Make a change to the codebase. Document any unexpected build issues as well as the workarounds.
Document environment setup steps that seem optional but that you have validated are actually required.
Document the time required for commands that failed due to timing out.
When you find a sequence of commands that work for a particular purpose, document them in detail.
Use language to indicate when something should always be done. For example: "always run npm install before building".
Record any validation steps from documentation.

List key facts about the layout and architecture of the codebase to help the agent find where to make changes with minimal searching.

A description of the major architectural elements of the project, including the relative paths to the main project files, the location
of configuration files for linting, compilation, testing, and preferences.
A description of the checks run prior to check in, including any GitHub workflows, continuous integration builds, or other validation pipelines.
Document the steps so that the agent can replicate these itself.
Any explicit validation steps that the agent can consider to have further confidence in its changes.
Dependencies that aren't obvious from the layout or file structure.
Finally, fill in any remaining space with detailed lists of the following, in order of priority: the list of files in the repo root, the
contents of the README, the contents of any key source files, the list of files in the next level down of directories, giving priority to the more structurally important and snippets of code from key source files, such as the one containing the main method.

- Perform a comprehensive inventory of the codebase. Search for and view: - README.md, CONTRIBUTING.md, and all other documentation files. - Search the codebase for build steps and indications of workarounds like 'HACK', 'TODO', etc. - All scripts, particularly those pertaining to build and repo or environment setup. - All build and actions pipelines. - All project files. - All configuration and linting files. - For each file: - think: are the contents or the existence of the file information that the coding agent will need to implement, build, test, validate, or demo a code change? - If yes: - Document the command or information in detail. - Explicitly indicate which commands work and which do not and the order in which commands should be run. - Document any errors encountered as well as the steps taken to workaround them. - Document any other steps or information that the agent can use to reduce time spent exploring or trying and failing to run bash commands. - Finally, explicitly instruct the agent to trust the instructions and only perform a search if the information in the instruction...

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: ricardoV94 <28983449+ricardoV94@users.noreply.github.com>

ricardoV94 · 2025-10-06T20:51:12Z

This should be updated after #1635

ricardoV94 · 2025-10-09T08:55:44Z

Superseded by #1643

Initial plan

59dadc8

Copilot AI assigned Copilot and ricardoV94 Oct 6, 2025

Copilot started work on behalf of ricardoV94 October 6, 2025 13:58 View session

ricardoV94 added GitHub CI/CD no releasenotes labels Oct 6, 2025

Add comprehensive Copilot instructions for PyTensor

0c2e2b3

Co-authored-by: ricardoV94 <28983449+ricardoV94@users.noreply.github.com>

Copilot AI requested a review from ricardoV94 October 6, 2025 14:07

Copilot finished work on behalf of ricardoV94 October 6, 2025 14:07

ricardoV94 closed this Oct 9, 2025

ricardoV94 deleted the copilot/fix-9d0694ba-c043-4a81-8e37-c4dfcc818beb branch October 9, 2025 08:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add comprehensive Copilot instructions for PyTensor repository onboarding #1634

Add comprehensive Copilot instructions for PyTensor repository onboarding #1634

Uh oh!

Copilot AI commented Oct 6, 2025 •

edited

Loading

Uh oh!

ricardoV94 commented Oct 6, 2025

Uh oh!

ricardoV94 commented Oct 9, 2025

Uh oh!

Uh oh!

Add comprehensive Copilot instructions for PyTensor repository onboarding #1634

Add comprehensive Copilot instructions for PyTensor repository onboarding #1634

Uh oh!

Conversation

Copilot AI commented Oct 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

What's Included

1. Repository Overview

2. Environment Setup

3. Build & Testing

4. Development Workflow

5. Project Structure

6. CI/CD Understanding

7. Common Pitfalls & Solutions

8. Quality Standards

Why This Helps

Validation

Uh oh!

ricardoV94 commented Oct 6, 2025

Uh oh!

ricardoV94 commented Oct 9, 2025

Uh oh!

Uh oh!

Copilot AI commented Oct 6, 2025 •

edited

Loading