Bug: No docs explain the value for cache-type-k/v

### What happened?

```
-ctk,  --cache-type-k TYPE              KV cache data type for K (default: f16)
                                        (env: LLAMA_ARG_CACHE_TYPE_K)
-ctv,  --cache-type-v TYPE              KV cache data type for V (default: f16)
                                        (env: LLAMA_ARG_CACHE_TYPE_V)
```

It gives the default, but what are the other choices?  I've been googling and can't find it anywhere.

### Name and Version

 docker run --rm --runtime nvidia --gpus all ghcr.io/ggerganov/llama.cpp:server-cuda --help

### What operating system are you seeing the problem on?

Windows

### Relevant log output

```shell
-ctk,  --cache-type-k TYPE              KV cache data type for K (default: f16)
                                        (env: LLAMA_ARG_CACHE_TYPE_K)
-ctv,  --cache-type-v TYPE              KV cache data type for V (default: f16)
                                        (env: LLAMA_ARG_CACHE_TYPE_V)
```


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug: No docs explain the value for cache-type-k/v #10373

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Bug: No docs explain the value for cache-type-k/v #10373

Description

What happened?

Name and Version

What operating system are you seeing the problem on?

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions