feat(webgpu): harden large-model web loading readiness by leehack · Pull Request #73 · leehack/llamadart

leehack · 2026-02-27T01:41:33Z

Summary

Improve web large-model loading resilience in WebGpuLlamaBackend with smarter fallback/retry behavior, wasm64/fetch-backend controls, and clearer runtime diagnostics for memory/COI/thread constraints.
Add chat app web cache prefetch + eviction wiring and UX polish for large-model flows (progress labels, warning-only messaging, reduced synthetic progress dominance).
Pin bridge assets to leehack/llama-web-bridge-assets@v0.1.5, update bridge integration docs, and inject COI custom_headers in HF static deploy workflow output.

Validation

dart analyze
dart test -p chrome test/unit/backends/webgpu/webgpu_backend_test.dart
flutter build web --release (from example/chat_app)

Improve web bridge loading resilience and diagnostics for large GGUF models, including cache prefetch wiring, wasm64/fetch fallback controls, and clearer user-facing warnings. Align chat app and docs to bridge assets v0.1.5 and add COI headers to HF static deployment so browser runtime requirements are met by default.

codecov-commenter · 2026-02-27T02:40:39Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.51%. Comparing base (a6c6471) to head (6f20bbc).
⚠️ Report is 6 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main      #73   +/-   ##
=======================================
  Coverage   76.51%   76.51%           
=======================================
  Files          66       66           
  Lines        8196     8196           
=======================================
  Hits         6271     6271           
  Misses       1925     1925

Flag	Coverage Δ
unittests	`76.51% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

leehack merged commit fd4ac69 into main Feb 27, 2026
6 checks passed

leehack deleted the feat/webgpu-large-model-readiness branch February 27, 2026 02:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(webgpu): harden large-model web loading readiness#73

feat(webgpu): harden large-model web loading readiness#73
leehack merged 1 commit intomainfrom
feat/webgpu-large-model-readiness

leehack commented Feb 27, 2026

Uh oh!

Uh oh!

codecov-commenter commented Feb 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leehack commented Feb 27, 2026

Summary

Validation

Uh oh!

Uh oh!

codecov-commenter commented Feb 27, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants