Name and Version
0.1.0
Operating systems
Other? (Please let us know in description)
GGML backends
CUDA
Hardware
2 identical cards
Models
No response
Problem description & steps to reproduce
Per NickCanCode in Reddit comment https://www.reddit.com/r/LocalLLaMA/comments/1t88zvv/comment/okxsz31/
Doesn't work for me. It gives [log] whenever I make a request.
P.S. Using 2 identical cards.
First Bad Commit
No response
Relevant log output
beellama.cpp-main\ggml\src\ggml-cuda\ggml-cuda.cu:98: CUDA error
CUDA error: an illegal memory access was encountered
Name and Version
0.1.0
Operating systems
Other? (Please let us know in description)
GGML backends
CUDA
Hardware
2 identical cards
Models
No response
Problem description & steps to reproduce
Per NickCanCode in Reddit comment https://www.reddit.com/r/LocalLLaMA/comments/1t88zvv/comment/okxsz31/
First Bad Commit
No response
Relevant log output