[metal] alibi for arbitrary number of heads #3426

li-plus · 2023-10-01T16:51:24Z

Support ALiBi op on metal for all kinds of n_heads. Not required power of two any more.

…example * 'master' of github.com:ggerganov/llama.cpp: (24 commits) convert : fix Baichuan2 models by using vocab size in config.json (ggerganov#3299) readme : add project status link ggml : fix build after ggerganov#3329 llm : add Refact model (ggerganov#3329) sync : ggml (conv 1d + 2d updates, UB fixes) (ggerganov#3468) finetune : readme fix typo (ggerganov#3465) ggml : add RISC-V Vector Support for K-Quants and improved the existing intrinsics (ggerganov#3453) main : consistent prefix/suffix coloring (ggerganov#3425) llama : fix session saving/loading (ggerganov#3400) llama : expose model's rope_freq_scale in the API (ggerganov#3418) metal : alibi for arbitrary number of heads (ggerganov#3426) cmake : make LLAMA_NATIVE flag actually use the instructions supported by the processor (ggerganov#3273) Work on the BPE tokenizer (ggerganov#3252) convert : fix vocab size when not defined in hparams (ggerganov#3421) cmake : increase minimum version for add_link_options (ggerganov#3444) CLBlast: Add broadcast support for matrix multiplication (ggerganov#3402) gguf : add BERT, MPT, and GPT-J arch info (ggerganov#3408) gguf : general usability improvements (ggerganov#3409) cmake : make CUDA flags more similar to the Makefile (ggerganov#3420) finetune : fix ggerganov#3404 (ggerganov#3437) ...

[metal] alibi for arbitrary number of heads

c0d710d

ggerganov approved these changes Oct 3, 2023

View reviewed changes

ggerganov merged commit f56e1ba into ggerganov:master Oct 3, 2023
10 checks passed

li-plus deleted the metal-general-alibi branch October 3, 2023 17:08

yusiwen pushed a commit to yusiwen/llama.cpp that referenced this pull request Oct 7, 2023

metal : alibi for arbitrary number of heads (ggerganov#3426)

6bd6c2f

FNsi mentioned this pull request Oct 29, 2023

Parallel RoPE on metal #3024

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metal] alibi for arbitrary number of heads #3426

[metal] alibi for arbitrary number of heads #3426

li-plus commented Oct 1, 2023

[metal] alibi for arbitrary number of heads #3426

[metal] alibi for arbitrary number of heads #3426

Conversation

li-plus commented Oct 1, 2023