Fix color getting reset before prompt output done by blackhole89 · Pull Request #65 · ggml-org/llama.cpp

blackhole89 · 2023-03-12T22:01:02Z

This should fix the issue where sometimes the last few tokens of the initial prompt will not get colored correctly, because the color reset ANSI code was emitted early.

(cherry picked from commit 7eb2987619feee04c40eff69b604017d09919cb6)

* Long-range falcon upgrade (16k context) Default context is not 2048 The embedding rotation has been adapted to react to context and expected generation Uses "NTK" fourier aware scaling of the rotation space. 7B and 40B have been tested to work well up to a context of 8k Tests at > 8k are incoming once performance at these sizes works better RAM requirements for K/V caches: Falcon 7B at 8k context : ~2 GB RAM Falcon 40B at 8k context : ~5.5 GB RAM In addition falcon_eval() now uses a configuration struct instead of passing many parameters through multiple abstraction layers. This makes it much easier to pass new features from main into libfalcon * perplexity bugfix --------- Co-authored-by: John <cmt-nct@users.noreply.github.com>

* Adding GGML_UNARY_OP_SWIGLU This commit implements the ggml op and CPU compute forward. I see ~3-4% speedup of PP-512 for Phi-3.5-mini. * GGML_UNARY_OP_SWIGLU: CUDA implementation I observe ~12% speedup for PP-512(Phi-3.5-mini). * GGML_UNARY_OP_SWIGLU: Metal implementation We get ~2% speedup for PP-512(Phi-3.5-mini). * GGML_UNARY_OP_SWIGLU: minor improvement on Metal * GGML_UNARY_OP_SWIGLU: cleanup --------- Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>

Fix color getting reset before prompt output done

ea84034

(cherry picked from commit 7eb2987619feee04c40eff69b604017d09919cb6)

ggerganov merged commit 404fac0 into ggml-org:master Mar 12, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

ccwpys01-beep mentioned this pull request Dec 16, 2025

Hexagon backend issue : GET_ROWS, SET_ROWS, FLASH_ATTN only work on CPU #18075

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix color getting reset before prompt output done#65

Fix color getting reset before prompt output done#65
ggerganov merged 1 commit intoggml-org:masterfrom
blackhole89:fix-colors

blackhole89 commented Mar 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

blackhole89 commented Mar 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants