Skip to content

Conversation

@anzz1
Copy link
Contributor

@anzz1 anzz1 commented Mar 25, 2023

@anzz1 anzz1 merged commit e899bf5 into master Mar 25, 2023
@anzz1 anzz1 deleted the patch-prefix-arg-bounds branch March 25, 2023 12:42
SamuelOliveirads pushed a commit to SamuelOliveirads/llama.cpp that referenced this pull request Dec 29, 2025
* iq1_s_r4: CUDA dequantize

* iq1_s_r4: CUDA GEMV

* iq1_s_r4: MMQ on CUDA

Requires Turing or better (will fall back to dequantize+cuBLAS on older cards).

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants