feat(native): expose context batch sizing controls by leehack · Pull Request #71 · leehack/llamadart

leehack · 2026-02-26T01:31:54Z

Summary

Add ModelParams.batchSize / ModelParams.microBatchSize so native n_batch and n_ubatch can be tuned independently of contextSize while preserving legacy defaults.
Wire resolved batch sizing into native context creation and prompt decode chunking to support smaller CPU-friendly batch settings safely.
Add unit coverage for batch-size resolution and update docs/changelog, including llama.cpp semantics references for n_batch and n_ubatch.

Validation

dart analyze
dart test test/unit/core/models/inference/model_params_test.dart
dart test test/unit/backends/llama_cpp/llama_cpp_service_test.dart

Add ModelParams batchSize/microBatchSize with legacy defaults and wire them into context creation and prompt decode chunking so users can tune memory/performance without changing existing behavior.

Document that batchSize maps to n_batch and microBatchSize maps to n_ubatch so users can tune these fields with the same mental model as upstream llama.cpp CLI options.

leehack · 2026-02-26T01:33:55Z

@maiguangyang Let me know if this fulfills your request.

codecov-commenter · 2026-02-26T01:50:10Z

Codecov Report

❌ Patch coverage is 86.48649% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.54%. Comparing base (fd4ac69) to head (102c3bb).
⚠️ Report is 4 commits behind head on main.

Files with missing lines	Patch %	Lines
lib/src/backends/llama_cpp/llama_cpp_service.dart	91.42%	3 Missing ⚠️
lib/src/core/models/inference/model_params.dart	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #71      +/-   ##
==========================================
+ Coverage   76.51%   76.54%   +0.03%     
==========================================
  Files          66       66              
  Lines        8196     8216      +20     
==========================================
+ Hits         6271     6289      +18     
- Misses       1925     1927       +2

Flag	Coverage Δ
unittests	`76.54% <86.48%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

leehack added 2 commits February 25, 2026 20:20

feat(native): expose context batch sizing controls

821d58f

Add ModelParams batchSize/microBatchSize with legacy defaults and wire them into context creation and prompt decode chunking so users can tune memory/performance without changing existing behavior.

docs(model_params): clarify llama.cpp batch semantics

29222d3

Document that batchSize maps to n_batch and microBatchSize maps to n_ubatch so users can tune these fields with the same mental model as upstream llama.cpp CLI options.

Merge main into feat/issue-68-context-batch-params

102c3bb

leehack merged commit b39e062 into main Feb 27, 2026
6 checks passed

leehack deleted the feat/issue-68-context-batch-params branch February 27, 2026 02:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(native): expose context batch sizing controls#71

feat(native): expose context batch sizing controls#71
leehack merged 3 commits intomainfrom
feat/issue-68-context-batch-params

leehack commented Feb 26, 2026

Uh oh!

leehack commented Feb 26, 2026

Uh oh!

codecov-commenter commented Feb 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

leehack commented Feb 26, 2026

Summary

Validation

Uh oh!

leehack commented Feb 26, 2026

Uh oh!

codecov-commenter commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov-commenter commented Feb 26, 2026 •

edited

Loading