Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sync : ggml-blas #2238

Merged
merged 8 commits into from
Jun 18, 2024
Merged

sync : ggml-blas #2238

merged 8 commits into from
Jun 18, 2024

Conversation

ggerganov
Copy link
Owner

@ggerganov ggerganov commented Jun 16, 2024

TODO:

ggerganov and others added 5 commits June 16, 2024 18:38
* Refactor shaders, extract GLSL code from ggml_vk_generate_shaders.py into vulkan-shaders directory

* Improve debug log code

* Add memory debug output option

* Fix flake8

* Fix unnecessary high llama-3 VRAM use
src/ggml-blas.cpp Outdated Show resolved Hide resolved
ggerganov and others added 3 commits June 16, 2024 19:23
* whisper : use ggml_backend_sched (wip)

* use sched in whisper_allocr

* whisper : single backend in whisper_context

* whisper : remove whisper_state->backends_used

* whisper : remove whisper_context->backend

* whisper : reset scheduler after init

* whisper : fix external encoder (e.g. CoreML)

* whisper : cleanup

* whisper : handle null GPU buffer types + fix sycl

---------

Co-authored-by: slaren <slarengh@gmail.com>
@ggerganov ggerganov marked this pull request as ready for review June 18, 2024 06:37
@ggerganov ggerganov merged commit 5d950c4 into master Jun 18, 2024
93 checks passed
@ggerganov ggerganov deleted the sync-blas branch June 18, 2024 06:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants