Skip to content

Fix color getting reset before prompt output done#65

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
blackhole89:fix-colors
Mar 12, 2023
Merged

Fix color getting reset before prompt output done#65
ggerganov merged 1 commit intoggml-org:masterfrom
blackhole89:fix-colors

Conversation

@blackhole89
Copy link
Copy Markdown
Contributor

This should fix the issue where sometimes the last few tokens of the initial prompt will not get colored correctly, because the color reset ANSI code was emitted early.

(cherry picked from commit 7eb2987619feee04c40eff69b604017d09919cb6)
@ggerganov ggerganov merged commit 404fac0 into ggml-org:master Mar 12, 2023
44670 pushed a commit to 44670/llama.cpp that referenced this pull request Aug 2, 2023
* Long-range falcon upgrade (16k context)
Default context is not 2048
The embedding rotation has been adapted to react to context and expected generation
Uses "NTK" fourier aware scaling of the rotation space.

7B and 40B have been tested to work well up to a context of 8k
Tests at > 8k are incoming once performance at these sizes works better

RAM requirements for K/V caches:
Falcon 7B at 8k context : ~2 GB RAM
Falcon 40B at 8k context : ~5.5 GB RAM

In addition falcon_eval() now uses a configuration struct instead of passing many parameters through multiple abstraction layers.
This makes it much easier to pass new features from main into libfalcon

* perplexity bugfix

---------

Co-authored-by: John <cmt-nct@users.noreply.github.com>
SamuelOliveirads pushed a commit to SamuelOliveirads/llama.cpp that referenced this pull request Dec 29, 2025
* Adding GGML_UNARY_OP_SWIGLU

This commit implements the ggml op and CPU compute
forward. I see ~3-4% speedup of PP-512 for Phi-3.5-mini.

* GGML_UNARY_OP_SWIGLU: CUDA implementation

I observe ~12% speedup for PP-512(Phi-3.5-mini).

* GGML_UNARY_OP_SWIGLU: Metal implementation

We get ~2% speedup for PP-512(Phi-3.5-mini).

* GGML_UNARY_OP_SWIGLU: minor improvement on Metal

* GGML_UNARY_OP_SWIGLU: cleanup

---------

Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants