feat(ci): single step by paul-nechifor · Pull Request #1756 · dimensionalOS/dimos

paul-nechifor · 2026-04-07T05:21:06Z

Problem

We have a docker image build step as part of the build (even if it is cached). Would be better to just uv sync, pytest, mypy.

Closes DIM-783

Solution

Split CI into two workflows: ci.yml (tests + mypy) and docker-build.yml (image builds only)
Only run coverage on merges (to avoid slow down from instrumentation).

Breaking Changes

None.

How to Test

None.

Contributor License Agreement

I have read and approved the CLA.

greptile-apps · 2026-04-07T05:28:12Z

Greptile Summary

This PR consolidates the previously separate tests.yml workflow into ci.yml as a single checks job, and updates docs/development/docker.md to reference ci.yml. The job runs pytest (without coverage on PRs, with coverage on pushes to main/dev) followed by mypy type-checking.

Confidence Score: 5/5

Safe to merge — the consolidation is clean and the new ci.yml correctly handles both PR and push event types.

No P0 or P1 issues found. The single remaining observation (no workflow_dispatch trigger) is a minor P2 convenience concern that doesn't affect correctness.

No files require special attention.

Important Files Changed

Filename	Overview
.github/workflows/ci.yml	Consolidated tests + mypy into a single 'checks' job; tests run without coverage on PRs, with coverage on push; no workflow_dispatch trigger
.github/workflows/docker-build.yml	Docker build workflow — appears unchanged or minimally changed; multi-stage image build with correct change detection
docs/development/docker.md	Updated CI reference from tests.yml to ci.yml; documentation is accurate

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Push to main/dev\nor Pull Request] --> B{paths-ignore\n**.md?}
    B -- Only .md files --> Z[Skip CI]
    B -- Other files --> C{Draft PR?}
    C -- Yes --> Z
    C -- No --> D[checks job]
    D --> E[actions/checkout]
    E --> F[Fix permissions]
    F --> G[uv sync --all-extras --no-extra dds --frozen]
    G --> H[Remove pydrake stubs]
    H --> I{Event type?}
    I -- pull_request --> J[pytest\nnot tool or mujoco]
    I -- push --> K[coverage run\npytest\nnot tool or mujoco]
    K --> L[coverage combine\n& report]
    J --> M[mypy dimos\nif not cancelled]
    L --> M
    M --> N{push event?}
    N -- Yes --> O[Upload coverage\nartifact]
    N -- No --> P[Done]
    O --> P
    M --> Q{Failure?}
    Q -- Yes --> R[df -h disk check]

_{Reviews (1): Last reviewed commit: "paul(ci): single step" | Re-trigger Greptile}

This reverts commit f9227c0.

leshy

honestly I have no idea how this works, but as is tradition, no one but latest person that worked on CI knows how CI works.

so I'll just ask user level questions:

where is the local cache now?
can I add a docker image that depends on some other docker image we build (g1 from ros-python let's say) - will it intelligently build it only when it has to, rebuild if it's parent is rebuilt, and can I run tests on it? - this is where most complexity of CI came from..

if you say no to 2 - can still approve, we will need this eventually, but not yet - can handle this on a case-by-case basis, until someone writes a custom dimos CI action

.github/workflows/ci.yml

Dreamsorcerer

I'll probably make further suggestions in followup PRs, but overall looks reasonable.

paul-nechifor · 2026-04-08T22:39:26Z

honestly I have no idea how this works, but as is tradition, no one but latest person that worked on CI knows how CI works.

so I'll just ask user level questions:
1. where is the local cache now?

2. can I add a docker image that depends on some other docker image we build (g1 from ros-python let's say) - will it intelligently build it only when it has to, rebuild if it's parent is rebuilt, and can I run tests on it? - this is where most complexity of CI came from..
if you say no to 2 - can still approve, we will need this eventually, but not yet - can handle this on a case-by-case basis, until someone writes a custom dimos CI action

It's stored locally. This checks out new branches without cleaning up previous ones.

      - uses: actions/checkout@v5
        with:
          clean: false

I know what you're thinking: "but that messes up different runs". True, but i'm also running this:

git clean -ffdx -e .venv

That removes every non-tracked file which is not in .venv.

So the end effect is that only the .venv dir is preserved between runs. uv sync automatically updates it if it's slightly off.

leshy · 2026-04-09T01:40:12Z

I know what you're thinking: "but that messes up different runs". True, but i'm also running this:

is there an issue with parallelization then? I assumed we have parallel checkouts if running multiple branches tests?
also do you need to clean data/* files?

paul-nechifor · 2026-04-09T01:48:44Z

I know what you're thinking: "but that messes up different runs". True, but i'm also running this:

is there an issue with parallelization then? I assumed we have parallel checkouts if running multiple branches tests? also do you need to clean data/* files?

I don't know if there are multiple checkouts per worker, but that shouldn't be an issue in either case. It will just have as many "caches" as there are checkouts.

I think data needs to be cleared because the pre-commit script doesn't allow files that are not commited and other branches could polute that.

Essentially what is preserved is only what is in .venv and git files because git and uv are capable of bringing the checkout to a clean state. If I tried to preserve data, I would have to add some logic to handle different cases.

leshy

I'm unsure how this works wrt multiple parallel CI runs but one mind in the torment nexus of CI is enough

paul-nechifor · 2026-04-09T01:54:50Z

can I add a docker image that depends on some other docker image we build (g1 from ros-python let's say) - will it intelligently build it only when it has to, rebuild if it's parent is rebuilt, and can I run tests on it? - this is where most complexity of CI came from..

Sorry, forgot to answer this.

I've kept docker-build.yml the same. The only difference is that the images get built on merges to dev, main, or every Monday morning.

If you add a forth docker image for just g1, then it will be built under those same conditions.

leshy · 2026-04-09T01:56:47Z

If you add a forth docker image for just g1, then it will be built under those same conditions.

but docker images introduce system changes that might be required for running actual tests on that build (imagine DDS for example) so this will cause it to fail tests no?

we can shelf this, this is solving an issue we don't have yet, by sacrificing actual irl performance.

in a month someone might want us to CI build custom docker + run pytest on it that passes only within that system, so should keep that in mind

paul-nechifor · 2026-04-09T01:59:10Z

If you add a forth docker image for just g1, then it will be built under those same conditions.

but docker images introduce system changes that might be required for running actual tests on that build (imagine DDS for example) so this will cause it to fail tests no?

The solution is to have multiple smaller commits. If you need to add a new system dependency, add it a as a separate PR. That will built a new image.

leshy · 2026-04-09T02:01:17Z

The solution is to have multiple smaller commits. If you need to add a new system dependency, add it a as a separate PR. That will built a new image.

yeah that's perfect! we simplify 99.9% usecases and make 0.1% slightly slower

paul(ci): single step

952c7bf

paul-nechifor changed the title ~~paul(ci): single step~~ feat(ci): single step Apr 7, 2026

paul-nechifor added 7 commits April 8, 2026 00:32

simplify macos

da00f19

cache uv install

e0d8692

update actions

bc74f3e

do not install installed pacakges

9852707

remove uv caching

cbdedec

try local cache

ee27453

trigger update

f9227c0

paul-nechifor requested a review from leshy April 8, 2026 02:38

paul-nechifor added 2 commits April 8, 2026 05:42

try different way

84cae13

Revert "trigger update"

d4dcb32

This reverts commit f9227c0.

leshy previously approved these changes Apr 8, 2026

View reviewed changes

Dreamsorcerer reviewed Apr 8, 2026

View reviewed changes

.github/workflows/ci.yml Show resolved Hide resolved

Dreamsorcerer previously approved these changes Apr 8, 2026

View reviewed changes

paul-nechifor enabled auto-merge (squash) April 8, 2026 22:19

rename checks to ci-complete

7e95512

paul-nechifor dismissed stale reviews from Dreamsorcerer and leshy via 7e95512 April 8, 2026 22:44

Merge branch 'dev' into paul/feat/single-ci

9a4a849

leshy approved these changes Apr 9, 2026

View reviewed changes

paul-nechifor merged commit 048d405 into dev Apr 9, 2026
3 of 4 checks passed

paul-nechifor deleted the paul/feat/single-ci branch April 9, 2026 02:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ci): single step#1756

feat(ci): single step#1756
paul-nechifor merged 12 commits intodevfrom
paul/feat/single-ci

paul-nechifor commented Apr 7, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Apr 7, 2026

Uh oh!

leshy left a comment •

edited

Loading

Uh oh!

Uh oh!

Dreamsorcerer left a comment •

edited

Loading

Uh oh!

paul-nechifor commented Apr 8, 2026

Uh oh!

leshy commented Apr 9, 2026

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy left a comment •

edited

Loading

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy commented Apr 9, 2026 •

edited

Loading

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

paul-nechifor commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Breaking Changes

How to Test

Contributor License Agreement

Uh oh!

greptile-apps bot commented Apr 7, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

leshy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Dreamsorcerer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paul-nechifor commented Apr 8, 2026

Uh oh!

leshy commented Apr 9, 2026

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

paul-nechifor commented Apr 9, 2026

Uh oh!

leshy commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

paul-nechifor commented Apr 7, 2026 •

edited

Loading

leshy left a comment •

edited

Loading

Dreamsorcerer left a comment •

edited

Loading

leshy left a comment •

edited

Loading

leshy commented Apr 9, 2026 •

edited

Loading