Skip to content

feat(webgpu): harden large-model web loading readiness#73

Merged
leehack merged 1 commit intomainfrom
feat/webgpu-large-model-readiness
Feb 27, 2026
Merged

feat(webgpu): harden large-model web loading readiness#73
leehack merged 1 commit intomainfrom
feat/webgpu-large-model-readiness

Conversation

@leehack
Copy link
Copy Markdown
Owner

@leehack leehack commented Feb 27, 2026

Summary

  • Improve web large-model loading resilience in WebGpuLlamaBackend with smarter fallback/retry behavior, wasm64/fetch-backend controls, and clearer runtime diagnostics for memory/COI/thread constraints.
  • Add chat app web cache prefetch + eviction wiring and UX polish for large-model flows (progress labels, warning-only messaging, reduced synthetic progress dominance).
  • Pin bridge assets to leehack/llama-web-bridge-assets@v0.1.5, update bridge integration docs, and inject COI custom_headers in HF static deploy workflow output.

Validation

  • dart analyze
  • dart test -p chrome test/unit/backends/webgpu/webgpu_backend_test.dart
  • flutter build web --release (from example/chat_app)

Improve web bridge loading resilience and diagnostics for large GGUF models, including cache prefetch wiring, wasm64/fetch fallback controls, and clearer user-facing warnings. Align chat app and docs to bridge assets v0.1.5 and add COI headers to HF static deployment so browser runtime requirements are met by default.
@leehack leehack merged commit fd4ac69 into main Feb 27, 2026
6 checks passed
@leehack leehack deleted the feat/webgpu-large-model-readiness branch February 27, 2026 02:10
@codecov-commenter
Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 76.51%. Comparing base (a6c6471) to head (6f20bbc).
⚠️ Report is 6 commits behind head on main.

Additional details and impacted files
@@           Coverage Diff           @@
##             main      #73   +/-   ##
=======================================
  Coverage   76.51%   76.51%           
=======================================
  Files          66       66           
  Lines        8196     8196           
=======================================
  Hits         6271     6271           
  Misses       1925     1925           
Flag Coverage Δ
unittests 76.51% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants