Skip to content

Conversation

@jan-service-account
Copy link

Updates dev branch with latest release (b5415) from ggml-org/llama.cpp

jeffbolznv and others added 5 commits May 17, 2025 08:35
* vulkan: move common FA code to flash_attn_base.comp

* vulkan: move common FA index/stride setup code to flash_attn_base.comp

* build fix
* parallel : add option for non-shared and larger prompts

* parallel : update readme [no ci]

* cont : add note about base models [no ci]

* parallel : better var name

ggml-ci
…13595)

* fix: use the current build config for `vulkan-shaders-gen`

* fix: only pass a valid build type to `--config`
* added no-prefill-assistant flag

* reworded documentation comment

* updated server README.md
@jan-service-account jan-service-account merged commit a684bef into dev May 18, 2025
9 checks passed
@jan-service-account jan-service-account deleted the update-dev-from-master-2025-05-18-00-09 branch May 18, 2025 00:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants