(merge) bounds checking for input prefix #492

anzz1 · 2023-03-25T12:09:00Z

merge changes from Command line args bounds checking #424 with feat: '--in-prefix STRING' option #426

* iq1_s_r4: CUDA dequantize * iq1_s_r4: CUDA GEMV * iq1_s_r4: MMQ on CUDA Requires Turing or better (will fall back to dequantize+cuBLAS on older cards). --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

bounds checking for input prefix

d37af8d

anzz1 requested review from Green-Sky, blackhole89, ggerganov, j-f1, prusnak and sw March 25, 2023 12:09

sw approved these changes Mar 25, 2023

View reviewed changes

anzz1 merged commit e899bf5 into master Mar 25, 2023

anzz1 deleted the patch-prefix-arg-bounds branch March 25, 2023 12:42

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(merge) bounds checking for input prefix #492

(merge) bounds checking for input prefix #492

Uh oh!

anzz1 commented Mar 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

(merge) bounds checking for input prefix #492

(merge) bounds checking for input prefix #492

Uh oh!

Conversation

anzz1 commented Mar 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants