-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failures on insufficient GPU memory #94
Comments
@Johnz86 hi! 'Required memory exceeds the GPU's memory' is just a warning, it does not affect inference at all. But CONTRASTcode/3B requires ~8Gb of VRAM at full context. Using 4Gb for large files can lead to OOM. This warning is unclear and we're fix it in the future. |
@Johnz86 you can check error occured in refact.ai below chat (yellow box). Also please give server logs, it should be OOM or something like this. |
Here is an example of docker container logs:
|
model is not loaded (2) -- it can't access the model, according the logs. I guess the good way to go about solving this -- react to configuration changes faster. |
This is my local setup:
The container seem to start and load the model:
I tried to run the vscode extension with and without api key:
I tried to use the extension, but it is inprogress forever:
The logs inside the container signal an issue, but do not specify what:
Only if I launch the webui, then I can see:
Required memory exceeds the GPU's memory.
Could you please improve the logs, that a more detailed messasge is visible and provide clear warning, that there is not enough memory on graphic card?
The text was updated successfully, but these errors were encountered: