feat: add model download cache manager by leehack · Pull Request #129 · leehack/llamadart

leehack · 2026-05-11T01:28:56Z

Summary

Implements the native model download/cache manager for remote model sources, including cache policies, metadata, checksum validation, retries, cancellation, resumable downloads, and cache cleanup APIs.
Wires LlamaEngine.loadModelSource so file-backed/native backends download to a local cached path before loading, while URL-capable backends reject native-cache-only options.
Splits chat-app GGUF model and mmproj handling into independent asset sources so model/projector cache, download, delete, and activation behavior no longer assumes both files come from the same origin.
Hardens web cache identities by persisting digest-based cache markers instead of raw/redacted signed URLs, and rejects unsupported web local-filesystem asset combinations loudly instead of silently succeeding.
Migrates example server and model verification tools away from ad-hoc download logic and updates README/changelog/docs for the full download/cache behavior.

Closes #125

Production-readiness scope

This PR is intended to be merge-ready for the declared feature scope:

Users can load existing local models exactly as before via loadModel(...).
Users can load remote/local structured sources via the additive loadModelSource(...) API.
Native/file-backed users get managed download/cache behavior with progress, retry, resume, cancellation, metadata, checksum validation, and cache maintenance APIs.
URL-capable web users keep direct URL loading for simple supported requests.
Unsupported option/platform combinations fail with explicit errors instead of being treated as successful.
Chat app model and mmproj assets are resolved independently for mixed local/remote source combinations where the platform can actually load them.
No public API breaking changes are intended; existing loadModel(...) and loadModelFromUrl(...) callers are unchanged.

Intentionally deferred follow-ups

These are not required for the current feature to work, and are tracked separately to avoid merging incomplete scope into main:

Standardize local-only E2E runners across Dart, Flutter, and Web smoke tests #130 Standardize local-only E2E runners across Dart, Flutter, and Web smoke tests.
chore(process): add a production-readiness checklist for PRs #131 Add a production-readiness checklist for PRs.
feat(chat-app): improve model/mmproj asset-level cache UX #132 Improve chat-app model/mmproj asset-level cache UX.
feat(models): consider source-based multimodal projector loading API #133 Consider source-based multimodal projector loading API.
feat(models): harden cache metadata versioning and recovery policy #134 Harden cache metadata versioning and recovery policy.
feat(flutter): add model download task/controller helper for apps #135 Add model download task/controller helper for apps.
feat(models): improve Hugging Face source ergonomics #136 Improve Hugging Face source ergonomics.
docs(models): clarify local ModelSource option semantics #137 Clarify local ModelSource option semantics.
feat(models): serialize concurrent downloads for the same cache key #138 Serialize concurrent downloads for the same cache key.

Test Plan

Review Notes

Final independent pre-commit/read-only reviews found no remaining merge blockers.
A second-pass production-readiness review after the latest PR body update returned PASS for the declared scope. Non-blocking follow-ups were either already tracked or have been filed as docs(models): clarify local ModelSource option semantics #137/feat(models): serialize concurrent downloads for the same cache key #138 and added to feat(chat-app): improve model/mmproj asset-level cache UX #132.
Review-found blockers were fixed before the latest green CI run:
- Web cache markers no longer use raw/redacted URL strings that could collapse query-only differences; persisted keys use a digest of the canonical identity.
- Web activation/download now rejects local filesystem projector sources instead of treating unsupported combinations as successful.
- Percent-encoded traversal, async transient directory creation, URL-load error redaction, and resume documentation comments were addressed in later commits.
Added regression coverage for cache hits/misses, noCache cleanup, refresh preservation, retry policy, checksum mismatch cleanup, partial resume validators/If-Range, cancellation cleanup, cache list/get/remove/clear/prune, browser stub behavior, engine remote-source wiring, independent model/mmproj asset handling, and local-only real model+projector loading.

Current review-comment status

All Copilot review threads have been resolved after verifying that the comments were addressed by follow-up commits or updated documentation. No unaddressed merge-blocking review comment is known.

Add structured ModelSource, ModelLoadOptions, resolver targets, and download/cache value models for future package-managed GGUF download flows. Wire LlamaEngine.loadModelSource to preserve existing local loading and route remote sources through URL-capable backends while explicitly rejecting unsupported foundation options. Document the additive API surface and add regression coverage for resolver validation, URL redaction, and placeholder download managers.

Copilot

Pull request overview

This PR introduces a first-party, package-managed model download + cache system for remote GGUF model sources (HTTP(S) and hf://), and wires it into LlamaEngine.loadModelSource(...) so native/file-backed backends download to a local cached file before loading while URL-capable web backends continue to load directly (with option restrictions). It also migrates example/testing tooling and updates docs/changelog to reflect the new structured model source workflow.

Changes:

Added new structured model source/value-model APIs (ModelSource, ModelLoadOptions, ModelResolver targets) plus a cross-platform ModelDownloadManager with native IO implementation and browser stub.
Updated LlamaEngine to support loadModelSource(...), including native download/cache integration, URL-backend option rejection, and URL redaction for logs/metadata.
Migrated server and verification tools to use the new download manager and added comprehensive unit coverage for source parsing, cache behavior, resume/retry, and cleanup.

Reviewed changes

Copilot reviewed 27 out of 27 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
website/docs/guides/model-lifecycle.md	Documents `loadModelSource(...)`, cache policies, manager APIs, and mobile download guidance.
website/docs/changelog/recent-releases.md	Adds 0.6.12 release notes for the new model download/cache manager and engine wiring.
tool/testing/verify_recommended_models.dart	Replaces ad-hoc HTTP download logic with `DefaultModelDownloadManager.ensureModel(...)`.
tool/testing/verify_models.dart	Switches model download path to `ensureModel(...)` while keeping legacy local-file behavior.
test/unit/core/models/model_source_test.dart	Adds unit tests for `ModelSource` parsing, redaction, and deterministic keying.
test/unit/core/models/model_resolver_test.dart	Adds tests for resolver targets, defaults, cancellation handling, and remote passthrough.
test/unit/core/models/model_load_options_test.dart	Adds tests for option storage, header immutability, and validation.
test/unit/core/models/download/model_download_manager_test.dart	Validates public export surface of the manager + IO implementation.
test/unit/core/models/download/model_download_manager_stub_test.dart	Ensures browser stub throws the expected unsupported exception.
test/unit/core/models/download/model_download_manager_io_test.dart	Adds end-to-end IO manager tests (cache policies, retry, resume, checksum, cleanup, prune APIs).
test/unit/core/models/download/model_download_manager_base_test.dart	Tests the throwing base manager behavior for unsupported operations.
test/unit/core/models/download/model_cache_entry_test.dart	Adds tests for progress math, cache entry JSON/redaction, and validations.
test/unit/core/engine/engine_test.dart	Adds engine tests for `loadModelSource(...)` routing, progress forwarding, and URL redaction behavior.
test/integration/engine_integration_test.dart	Updates native URL-loading expectation to `LlamaUnsupportedException`.
README.md	Adds usage docs for downloading/caching remote GGUFs via structured sources.
pubspec.yaml	Adds `crypto` dependency for SHA-256 keying/checksum support.
lib/src/core/models/model_source.dart	Introduces `ModelSource` (path/http/hf) parsing, validation, cache keying, and redaction.
lib/src/core/models/model_resolver.dart	Adds resolver interfaces + default resolver and load target value types.
lib/src/core/models/model_load_options.dart	Adds cache policy + load options (headers, bearer token, sha256, resume, retries, cancel).
lib/src/core/models/download/model_download_manager.dart	Adds conditional export for stub vs IO manager implementation.
lib/src/core/models/download/model_download_manager_stub.dart	Adds non-IO stub that throws `LlamaUnsupportedException`.
lib/src/core/models/download/model_download_manager_io.dart	Implements the native download/cache manager (streaming, `.part`, resume/retry, metadata, prune).
lib/src/core/models/download/model_download_manager_base.dart	Adds base API types: progress, cache entry metadata, and throwing manager base.
lib/src/core/engine/engine.dart	Wires structured model loading into the engine and adds URL redaction + option rejection.
lib/llamadart.dart	Exports new public APIs (sources/options/resolver/manager).
example/llamadart_server/lib/src/features/model_management/infrastructure/model_service.dart	Migrates server model acquisition to `ModelSource` + download manager.
CHANGELOG.md	Adds 0.6.12 changelog notes for model source/download/cache manager feature set.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

codecov-commenter · 2026-05-11T01:36:12Z

Codecov Report

❌ Patch coverage is 89.15816% with 85 lines in your changes missing coverage. Please review.
✅ Project coverage is 77.74%. Comparing base (fc98c90) to head (2bfc7ad).

Files with missing lines	Patch %	Lines
lib/src/platform/io/model_download_manager_io.dart	88.86%	45 Missing ⚠️
lib/src/core/models/model_source.dart	87.42%	22 Missing ⚠️
...e/models/download/model_download_manager_base.dart	91.07%	10 Missing ⚠️
lib/src/core/engine/engine.dart	87.69%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #129      +/-   ##
==========================================
+ Coverage   76.73%   77.74%   +1.00%     
==========================================
  Files          70       75       +5     
  Lines        8734     9512     +778     
==========================================
+ Hits         6702     7395     +693     
- Misses       2032     2117      +85

Flag	Coverage Δ
unittests	`77.74% <89.15%> (+1.00%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 4 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 3 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 1 comment.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 1 comment.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 2 comments.

Copilot

Pull request overview

Copilot reviewed 39 out of 40 changed files in this pull request and generated 1 comment.

leehack added 2 commits May 10, 2026 20:03

feat: add model download cache manager

9f71e21

Copilot AI review requested due to automatic review settings May 11, 2026 01:28

Copilot started reviewing on behalf of leehack May 11, 2026 01:29 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

leehack added 6 commits May 10, 2026 21:37

docs: move pending changelog notes to unreleased

aea015c

fix: address model download review feedback

6390397

docs: clarify resumable download requirements

a3070c0

fix: keep IO download manager outside core boundary

b21f5e6

fix(chat-app): split model asset source handling

de1bd04

test(chat-app): add local-only model cache e2e

ddcd2d5

leehack requested a review from Copilot May 11, 2026 14:20

Copilot started reviewing on behalf of leehack May 11, 2026 14:20 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/core/models/download/model_download_manager_base.dart Outdated

Comment thread lib/src/core/models/model_load_options.dart Outdated

fix: address model source review comments

b7af32b

leehack requested a review from Copilot May 11, 2026 16:04

Copilot started reviewing on behalf of leehack May 11, 2026 16:05 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread lib/src/platform/io/model_download_manager_io.dart

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread website/docs/guides/model-lifecycle.md

fix: address model download review comments

5a18b2f

leehack requested a review from Copilot May 11, 2026 17:22

Copilot started reviewing on behalf of leehack May 11, 2026 17:23 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread lib/src/core/models/model_load_options.dart Outdated

fix: address follow-up download review comments

1d88f96

leehack requested a review from Copilot May 11, 2026 17:40

Copilot started reviewing on behalf of leehack May 11, 2026 17:41 View session

fix: clean up failed transient downloads

d354322

leehack requested a review from Copilot May 11, 2026 21:26

Copilot started reviewing on behalf of leehack May 11, 2026 21:27 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart

fix: restart downloads after unsatisfiable ranges

849db7f

leehack requested a review from Copilot May 11, 2026 21:38

Copilot started reviewing on behalf of leehack May 11, 2026 21:39 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread lib/src/core/models/model_source.dart Outdated

fix: validate cache file inputs

80b4892

leehack requested a review from Copilot May 11, 2026 21:54

Copilot started reviewing on behalf of leehack May 11, 2026 21:55 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread example/chat_app/lib/screens/manage_models_screen.dart

fix: guard redirects and unresolved model paths

2c21c7c

leehack requested a review from Copilot May 11, 2026 22:06

Copilot started reviewing on behalf of leehack May 11, 2026 22:07 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/platform/io/model_download_manager_io.dart Outdated

Comment thread lib/src/core/models/model_resolver.dart Outdated

fix: address cache metadata review comments

4e0ecd7

leehack requested a review from Copilot May 11, 2026 22:18

Copilot started reviewing on behalf of leehack May 11, 2026 22:19 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/core/engine/engine.dart

fix: preserve resolved source cache identity

bc41eac

leehack requested a review from Copilot May 11, 2026 22:30

Copilot started reviewing on behalf of leehack May 11, 2026 22:31 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/core/models/download/model_download_manager_base.dart

Comment thread lib/src/core/engine/engine.dart Outdated

fix: honor resolved local paths and redact cache keys

2bfc7ad

leehack requested a review from Copilot May 11, 2026 22:41

Copilot started reviewing on behalf of leehack May 11, 2026 22:42 View session

Copilot AI reviewed May 11, 2026

View reviewed changes

Comment thread lib/src/core/engine/engine.dart

leehack merged commit dec0d63 into main May 11, 2026
10 checks passed

leehack deleted the feat/model-source-resolver branch May 11, 2026 23:32

Conversation

leehack commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Production-readiness scope

Intentionally deferred follow-ups

Test Plan

Review Notes

Current review-comment status

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

leehack commented May 11, 2026 •

edited

Loading

codecov-commenter commented May 11, 2026 •

edited

Loading