Closed as not planned
Description
LocalAI version:
LocalAI Version d9204ea (d9204ea)
Environment, CPU architecture, OS, and Version:
x86, Ubuntu 24.04, CUDA 12.6
Describe the bug
Using docker compose and downloading model. gpt4 and gp4o works.
Downloaded model successfully and when I go to Chat => Select model deepseek-r1-distill-qwen-14b and write to chat,
After sometime, no response generated
To Reproduce
- install with docker compose
- download model
- go to Chat => Select model deepseek-r1-distill-qwen-14b
- write to chat
- no response, no loading progress indicator
Expected behavior
chat works
Logs
Additional context
Running it using DollarDeploy and this docker compose setup: https://github.com/dollardeploy/templates/tree/main/local-ai-nvidia-cuda-12