Centralize Turbo export/push config, make quantization defaults deterministic, and prepare v2.1 pre-release by Copilot · Pull Request #25 · codewithdark-git/QuantLLM

Copilot · 2026-04-24T20:14:57Z

Export/push behavior required repeating format/quantization across multiple calls, which could diverge (export() vs push()) and produce inconsistent outputs. This change introduces a single Turbo-level config for downstream export/push behavior with explicit defaults, format-safe quantization behavior, and v2.1 pre-release documentation/version updates.

Shared Turbo-level export/push config
- Added config support to turbo(...) and TurboModel.from_pretrained(...).
- Introduced centralized defaults in TurboModel:
  - format: "safetensors"
  - push_format: "safetensors"
  - quantization: "Q4_K_M"
  - push_quantization: None
- Added internal resolver logic to normalize and store this config once at initialization.
- Supports nullable overrides for push_quantization (explicit None is preserved).
export() now consumes internal config
- format is optional; when omitted, uses shared config.
- GGUF quantization defaults come from shared config (Q4_K_M) unless explicitly overridden.
- Explicit format / quantization args still take precedence.
push() now consumes internal config with format-safe quantization
- format is optional; when omitted, uses push_format.
- push_quantization is no longer always-on by default.
- GGUF push still resolves quantization deterministically (falls back to shared GGUF quantization).
- ONNX/MLX push no longer gets forced GGUF-style quantization when not configured.
Docs + examples updated for new usage
- Updated API/docs/examples to show configuring once at Turbo init, then calling export() / push() without repeating format/quantization in common GGUF flows.
- Refreshed old usage snippets where the new shared-config API is preferred.
Pre-release version update
- Updated project/docs versioning and visible version strings to v2.1.0rc1 for pre-release preparation.
Focused coverage for config propagation
- Added tests validating:
  - deterministic default resolution,
  - export/push default propagation from shared config,
  - synchronization behavior when only export-side values are provided,
  - nullable push_quantization override behavior,
  - ONNX push does not force quantization by default,
  - portable temp staging paths via tmp_path.

from quantllm import turbo

model = turbo(
    "meta-llama/Llama-3.2-3B",
    config={
        "format": "gguf",
        "quantization": "Q4_K_M",
        "push_format": "gguf",
    },
)

model.export()               # uses gguf + Q4_K_M from shared config
model.push("user/my-model")  # uses gguf defaults from shared config

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/aa78d528-be1d-4467-813d-711a55ade22a Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

codewithdark-git

@copilot The changes is correct but we need to update the all Documentation and were that old code use update it to new One API and also pre-release to v2.1 .

Copilot

Pull request overview

This PR centralizes export/push configuration at the Turbo/TurboModel level to avoid per-call duplication and to make default export/push behavior deterministic across export() and push().

Changes:

Added shared config support to turbo(...) and TurboModel.from_pretrained(...), resolved once into TurboModel.export_push_config.
Updated export() and push_to_hub() to consume the resolved internal config when per-call args are omitted.
Added tests plus documentation/examples updates to reflect configuring export/push defaults once at initialization.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`quantllm/core/turbo_model.py`	Introduces default export/push config, resolver, and uses it to default `export()`/`push_to_hub()` behavior.
`tests/test_export_push_config.py`	Adds coverage for deterministic config resolution and propagation into `export()`/`push()`.
`docs/guide/hub-integration.md`	Updates hub examples to show one-time config at `turbo(...)` init.
`docs/api/turbo.md`	Documents the new `config` parameter on `turbo(...)`.
`README.md`	Updates quickstart snippet to use shared config + argument-free `export()`/`push()`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

codewithdark-git · 2026-04-25T06:21:42Z

@copilot apply changes based on the comments in this thread

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/4f6f0bc8-c954-444c-afa4-6244c9cf4865 Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

Copilot · 2026-04-25T06:30:56Z

@copilot The changes is correct but we need to update the all Documentation and were that old code use update it to new One API and also pre-release to v2.1 .

Done in 1aa2f52 (and follow-up 1c63330). I updated documentation/examples to the new shared config API usage where applicable and bumped the project/docs pre-release versioning to v2.1.0rc1.

@copilot apply changes based on the comments in this thread

Applied the review-thread fixes in 1aa2f52 and 1c63330: ONNX/MLX push no longer gets forced GGUF quantization defaults, nullable push_quantization override is supported, test staging paths now use tmp_path, and added explicit test coverage for nullable override behavior.

Initial plan

5bdaf11

Copilot AI assigned Copilot and codewithdark-git Apr 24, 2026

Copilot started work on behalf of codewithdark-git April 24, 2026 20:15 View session

Copilot AI linked an issue Apr 24, 2026 that may be closed by this pull request

Centralize config for export/push parameters and improve quantization default handling #24

Closed

Copilot AI and others added 2 commits April 24, 2026 20:23

feat: centralize export and push config defaults in turbo

2cd4bd2

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/aa78d528-be1d-4467-813d-711a55ade22a Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

refactor: use shared Turbo config defaults for export and push

10ff17b

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/aa78d528-be1d-4467-813d-711a55ade22a Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Centralize config for export/push parameters and improve quantization default handling~~ Centralize Turbo export/push config and make quantization defaults deterministic Apr 24, 2026

Copilot AI requested a review from codewithdark-git April 24, 2026 20:29

Copilot finished work on behalf of codewithdark-git April 24, 2026 20:29

codewithdark-git reviewed Apr 25, 2026

View reviewed changes

codewithdark-git marked this pull request as ready for review April 25, 2026 06:11

Copilot AI review requested due to automatic review settings April 25, 2026 06:11

Copilot started reviewing on behalf of codewithdark-git April 25, 2026 06:11 View session

Copilot AI reviewed Apr 25, 2026

View reviewed changes

Comment thread quantllm/core/turbo_model.py

Comment thread quantllm/core/turbo_model.py Outdated

Comment thread quantllm/core/turbo_model.py Outdated

Comment thread tests/test_export_push_config.py

codewithdark-git approved these changes Apr 25, 2026

View reviewed changes

Update quantllm/core/turbo_model.py

037d537

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of codewithdark-git April 25, 2026 06:21 View session

Copilot AI and others added 2 commits April 25, 2026 06:29

fix: address review feedback and refresh docs for v2.1 prerelease

1aa2f52

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/4f6f0bc8-c954-444c-afa4-6244c9cf4865 Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

test: cover nullable push_quantization override behavior

1c63330

Agent-Logs-Url: https://github.com/codewithdark-git/QuantLLM/sessions/4f6f0bc8-c954-444c-afa4-6244c9cf4865 Co-authored-by: codewithdark-git <144595403+codewithdark-git@users.noreply.github.com>

Copilot AI changed the title ~~Centralize Turbo export/push config and make quantization defaults deterministic~~ Centralize Turbo export/push config, make quantization defaults deterministic, and prepare v2.1 pre-release Apr 25, 2026

Copilot AI requested a review from codewithdark-git April 25, 2026 06:31

Copilot finished work on behalf of codewithdark-git April 25, 2026 06:31

codewithdark-git approved these changes Apr 25, 2026

View reviewed changes

codewithdark-git merged commit 4828488 into main Apr 25, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Centralize Turbo export/push config, make quantization defaults deterministic, and prepare v2.1 pre-release#25

Centralize Turbo export/push config, make quantization defaults deterministic, and prepare v2.1 pre-release#25
codewithdark-git merged 6 commits intomainfrom
copilot/centralize-config-export-push-parameters

Copilot AI commented Apr 24, 2026 •

edited

Loading

Uh oh!

codewithdark-git left a comment •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codewithdark-git commented Apr 25, 2026

Uh oh!

Copilot AI commented Apr 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

Copilot AI commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codewithdark-git left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codewithdark-git commented Apr 25, 2026

Uh oh!

Copilot AI commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Apr 24, 2026 •

edited

Loading

codewithdark-git left a comment •

edited

Loading

Copilot AI commented Apr 25, 2026 •

edited

Loading