Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Concurrency scheduling is not supported. #3590

Closed
hwfancyz7k opened this issue Apr 11, 2024 · 1 comment
Closed

Concurrency scheduling is not supported. #3590

hwfancyz7k opened this issue Apr 11, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@hwfancyz7k
Copy link

What is the issue?

I have multiple Intel CPUs and NVIDIA GPUs, but the generate interface can only initiate one request at a time. Even though I have sufficient resources, it gets stuck without further scheduling. This is a hot bug, please fix it as soon as possible.

What did you expect to see?

No response

Steps to reproduce

No response

Are there any recent changes that introduced the issue?

No response

OS

No response

Architecture

No response

Platform

No response

Ollama version

No response

GPU

No response

GPU info

No response

CPU

No response

Other software

No response

@hwfancyz7k hwfancyz7k added bug Something isn't working needs-triage labels Apr 11, 2024
@pdevine
Copy link
Contributor

pdevine commented Apr 12, 2024

@hwfancyz7k the good news is this is coming very soon (in the next couple of weeks)

I'm going to close the issue though as a dupe of #358

@pdevine pdevine closed this as completed Apr 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants