refactor: flatten sub-module imports to use parent package re-exports by NJX-njx · Pull Request #996 · inclusionAI/AReaL

NJX-njx · 2026-03-06T11:56:11Z

Description

Add __init__.py with fully lazy re-exports (__getattr__ + __all__) for areal/api, areal/engine, areal/reward, areal/dataset, and areal/workflow, then rewrite all external imports across ~100 files to use the shorter parent-package form.

For example: from areal.api.engine_api import TrainEngine → from areal.api import TrainEngine.

Related Issue

N/A

Type of Change

Bug fix (non-breaking change that fixes an issue)
New feature (non-breaking change that adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Documentation update
Code refactoring (no functional changes)
Performance improvement
Test coverage improvement

Key Changes

Fully lazy __init__.py re-exports: all symbols use __getattr__ — no eager top-level imports, making areal.api completely side-effect free at import time
areal/api (+23 lazy symbols), areal/engine (+11), areal/reward (+3), areal/workflow (+3), areal/dataset (__all__ only)
Flatten imports in areal/trainer/, areal/utils/, areal/models/, areal/engine/, areal/workflow/, areal/infra/, areal/experimental/, tests/, examples/
Preserve cli_args deep imports — dozens of config classes including SchedulingSpec; explicit path is clearer
Preserve intra-package imports — avoids circular dependency issues
Reward submodules use sibling-relative imports (from . import get_math_verify_worker) instead of absolute package imports, eliminating self-import coupling
Preserve non-exported symbols (DeviceRuntimeInfo, HttpRequest, HttpGenerationResult, etc.) that remain as deep imports

What is NOT changed

Category	Reason
`from areal.api.cli_args import ...` (including `SchedulingSpec`)	Dozens of config classes — explicit sub-module path is more readable
`from areal.engine.core.`, `fsdp_utils.`, `megatron_utils.*`	Internal engine utilities, not part of public API
`from areal.api.io_struct import DeviceRuntimeInfo, HttpRequest, ...`	Not added to `__all__` — internal symbols
Intra-package imports (e.g., `areal/api/engine_api.py` importing from `areal/api/io_struct`)	Prevents circular imports

Checklist

I have read the Contributing Guide
I have run formatting tools (pre-commit or manual)
I have run relevant unit tests and they pass
I have added tests for new functionality
I have updated documentation if needed
My branch is up to date with main
This PR introduces breaking changes (if yes, fill out details below)
If this PR changes documentation, I have built and previewed it locally with jb build docs
No critical issues raised by AI reviewers (/gemini review)

Breaking Change Details (if applicable):

N/A — all old import paths still work. This is a purely additive refactoring.

Additional Context

All re-exports use PEP 562 lazy __getattr__ to ensure zero side effects at module load time. Symbols are only resolved when first accessed, eliminating circular import risks.

106 files changed, 425 insertions(+), 265 deletions(-)

chatgpt-codex-connector · 2026-03-06T11:56:17Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

gemini-code-assist · 2026-03-06T11:56:29Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the usability of the areal library by flattening its import structure for the api, workflow, and engine modules. This change allows developers to import key classes and functions directly from the module root (e.g., from areal.workflow import RLVRWorkflow) rather than requiring deep, verbose paths. The implementation utilizes a lazy import pattern to avoid loading heavy dependencies unnecessarily, ensuring performance is not impacted, and maintains full backward compatibility for existing codebases.

Highlights

Flattened Import Structure: The import structure for areal/api, areal/workflow, and areal/engine modules has been flattened, allowing for shorter and more intuitive import paths (e.g., from areal.workflow import RLVRWorkflow).
Lazy Import Pattern: A __getattr__ lazy import pattern was implemented to avoid eagerly loading heavy dependencies (like torch) when modules are imported, improving startup performance.
Clean API Surface: Only the most commonly used public classes are re-exported in the __init__.py files to maintain a clean and focused API surface.
Backward Compatibility: The changes are fully backward compatible, meaning existing deep import paths will continue to function without issues.
Comprehensive Testing: New comprehensive tests (tests/test_flatten_imports.py) were added to validate the new import paths, __all__ definitions, and the lazy loading mechanism.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

areal/api/init.py
- Added a new __init__.py file to the areal/api module.
- Configured __init__.py to re-export commonly used API types and abstractions.
- Implemented a __getattr__ lazy import mechanism for these re-exported types.
areal/engine/init.py
- Modified the existing __init__.py file in areal/engine.
- Configured __init__.py to re-export engine implementations like FSDPEngine and MegatronEngine.
- Integrated a __getattr__ lazy import pattern for the engine classes.
areal/workflow/init.py
- Added a new __init__.py file to the areal/workflow module.
- Configured __init__.py to re-export workflow classes such as RLVRWorkflow and MultiTurnWorkflow.
- Implemented a __getattr__ lazy import mechanism for these workflow classes.
tests/test_flatten_imports.py
- Added a new test file to verify the functionality of the flattened import structure.
- Included tests for areal.api, areal.workflow, and areal.engine to ensure correct re-exports and lazy loading.
- Verified that AttributeError is raised for invalid attribute access and that __all__ is correctly defined.
- Added a specific test to confirm that importing the workflow module does not eagerly load torch.

Activity

No human activity has been recorded on this pull request yet.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a flattened import structure for the api, engine, and workflow modules, which is a great usability improvement. The use of __getattr__ for lazy loading is well-implemented to avoid unnecessary imports of heavy dependencies. My review includes suggestions to improve maintainability by reducing code duplication, enhance developer experience with better support for static analysis tools, and make the new tests more robust.

areal/api/__init__.py

tests/test_flatten_imports.py

Copilot

Pull request overview

This PR implements the "Flatten the import structure of areal modules" item from the 2026 Q1 roadmap (#907). It adds __init__.py files that allow commonly used classes to be imported via short paths (e.g., from areal.api import AllocationMode) instead of requiring deep module paths (e.g., from areal.api.alloc_mode import AllocationMode). The implementation uses PEP 562 module-level __getattr__ for lazy imports to avoid eagerly pulling in heavy dependencies like torch.

Changes:

New areal/api/__init__.py with lazy re-exports for engine, scheduler, allocation, and IO types (with AsyncRewardWrapper and RolloutWorkflow eagerly imported since they don't pull in torch)
New areal/workflow/__init__.py with lazy re-exports for RLVRWorkflow, MultiTurnWorkflow, and VisionRLVRWorkflow
Modified areal/engine/__init__.py (previously empty) with lazy re-exports for all four engine classes
New tests/test_flatten_imports.py with tests for all new import paths

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 6 comments.

File	Description
`areal/api/__init__.py`	Adds short-path imports for public API types using lazy `__getattr__` pattern
`areal/workflow/__init__.py`	Adds short-path imports for all workflow classes using lazy `__getattr__` pattern
`areal/engine/__init__.py`	Adds short-path imports for all engine classes (previously empty file)
`tests/test_flatten_imports.py`	Tests the new import paths across all three modules

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tests/test_flatten_imports.py

areal/api/__init__.py

tests/test_flatten_imports.py

Add __init__.py with lazy re-exports (__getattr__ + __all__) for areal/api, areal/engine, areal/reward, areal/workflow, and __all__ for areal/dataset, then rewrite all external imports across the codebase to use the shorter parent-package form (e.g. `from areal.api import TrainEngine`). Key changes: - All re-exports are fully lazy via __getattr__ (no eager imports) - Flatten ~100 files across areal/, tests/, examples/ - Preserve cli_args deep imports (dozens of config classes, including SchedulingSpec which stays in cli_args) - Preserve intra-package relative imports to avoid circular deps - Reward submodules use sibling-relative imports (from . import ...) - Preserve non-exported symbols (DeviceRuntimeInfo, HttpRequest, etc.)

rchardx

LGTM

…onflicts Runner 2.317.0 does not support node24 required by actions/checkout@v6, causing 'Set up job' failures on dynamically provisioned GCP instances. Key changes: - Bump RUNNER_VERSION from 2.317.0 to 2.332.0 in test-areal.yml and build-docker-image.yml - Remove duplicate imports in test_rollout_controller.py from rebase with PR #996

* feat(infra): split Docker image into sglang and vllm variants Build separate sglang and vllm Docker images from a single parameterized Dockerfile. Each variant ships only one inference backend, eliminating dependency conflicts and reducing image size. Key changes: - Dockerfile with ARG BASE_IMAGE/VARIANT for per-variant builds - CI workflows: build-docker-image, test-areal, tag-release, install-test - pyproject.toml: sglang/vllm conflicting extras, cuda renamed to cuda-train - pytest markers and conditional fixtures for backend-specific tests - GRPO tests parametrized over both sglang and vllm inference backends - Docker validation tools auto-detect installed inference variant Refs: #985 * fix(infra): update GCP runner to v2.332.0 and resolve rebase import conflicts Runner 2.317.0 does not support node24 required by actions/checkout@v6, causing 'Set up job' failures on dynamically provisioned GCP instances. Key changes: - Bump RUNNER_VERSION from 2.317.0 to 2.332.0 in test-areal.yml and build-docker-image.yml - Remove duplicate imports in test_rollout_controller.py from rebase with PR #996

…#996) Add __init__.py with lazy re-exports (__getattr__ + __all__) for areal/api, areal/engine, areal/reward, areal/workflow, and __all__ for areal/dataset, then rewrite all external imports across the codebase to use the shorter parent-package form (e.g. `from areal.api import TrainEngine`). Key changes: - All re-exports are fully lazy via __getattr__ (no eager imports) - Flatten ~100 files across areal/, tests/, examples/ - Preserve cli_args deep imports (dozens of config classes, including SchedulingSpec which stays in cli_args) - Preserve intra-package relative imports to avoid circular deps - Reward submodules use sibling-relative imports (from . import ...) - Preserve non-exported symbols (DeviceRuntimeInfo, HttpRequest, etc.) Co-authored-by: Wentai Zhang <zhangwentai.zwt@antgroup.com>

* feat(infra): split Docker image into sglang and vllm variants Build separate sglang and vllm Docker images from a single parameterized Dockerfile. Each variant ships only one inference backend, eliminating dependency conflicts and reducing image size. Key changes: - Dockerfile with ARG BASE_IMAGE/VARIANT for per-variant builds - CI workflows: build-docker-image, test-areal, tag-release, install-test - pyproject.toml: sglang/vllm conflicting extras, cuda renamed to cuda-train - pytest markers and conditional fixtures for backend-specific tests - GRPO tests parametrized over both sglang and vllm inference backends - Docker validation tools auto-detect installed inference variant Refs: #985 * fix(infra): update GCP runner to v2.332.0 and resolve rebase import conflicts Runner 2.317.0 does not support node24 required by actions/checkout@v6, causing 'Set up job' failures on dynamically provisioned GCP instances. Key changes: - Bump RUNNER_VERSION from 2.317.0 to 2.332.0 in test-areal.yml and build-docker-image.yml - Remove duplicate imports in test_rollout_controller.py from rebase with PR #996

Copilot AI review requested due to automatic review settings March 6, 2026 11:56

Copilot started reviewing on behalf of NJX-njx March 6, 2026 11:56 View session

gemini-code-assist bot reviewed Mar 6, 2026

View reviewed changes

areal/api/__init__.py Outdated Show resolved Hide resolved

tests/test_flatten_imports.py Outdated Show resolved Hide resolved

Copilot AI reviewed Mar 6, 2026

View reviewed changes

rchardx force-pushed the feature/flatten-import-structure branch 3 times, most recently from fd268da to 7fdd976 Compare March 6, 2026 18:26

rchardx changed the title ~~feat: flatten import structure for api, workflow, and engine modules~~ refactor: flatten sub-module imports to use parent package re-exports Mar 6, 2026

rchardx added the safe-to-test Ready to run unit-tests in a PR. label Mar 6, 2026

rchardx force-pushed the feature/flatten-import-structure branch from 7fdd976 to 6b215a3 Compare March 6, 2026 18:29

rchardx added safe-to-test Ready to run unit-tests in a PR. and removed safe-to-test Ready to run unit-tests in a PR. labels Mar 6, 2026

rchardx force-pushed the feature/flatten-import-structure branch from 6b215a3 to d4aa152 Compare March 6, 2026 18:36

rchardx removed the safe-to-test Ready to run unit-tests in a PR. label Mar 6, 2026

rchardx force-pushed the feature/flatten-import-structure branch from d4aa152 to 9b1c36d Compare March 6, 2026 18:56

rchardx approved these changes Mar 6, 2026

View reviewed changes

rchardx merged commit 71d2824 into inclusionAI:main Mar 6, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: flatten sub-module imports to use parent package re-exports#996

refactor: flatten sub-module imports to use parent package re-exports#996
rchardx merged 1 commit intoinclusionAI:mainfrom
NJX-njx:feature/flatten-import-structure

NJX-njx commented Mar 6, 2026 •

edited by rchardx

Loading

Uh oh!

chatgpt-codex-connector bot commented Mar 6, 2026

Uh oh!

gemini-code-assist bot commented Mar 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rchardx left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

NJX-njx commented Mar 6, 2026 • edited by rchardx Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

Key Changes

What is NOT changed

Checklist

Additional Context

Uh oh!

chatgpt-codex-connector bot commented Mar 6, 2026

Uh oh!

gemini-code-assist bot commented Mar 6, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rchardx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

NJX-njx commented Mar 6, 2026 •

edited by rchardx

Loading