Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow performance on NVIDIA MX450 GPU - Possible to optimize or downsize the model for ordinary users? #4

Open
brynne8 opened this issue Mar 24, 2023 · 3 comments

Comments

@brynne8
Copy link

brynne8 commented Mar 24, 2023

I am using an NVIDIA MX450 GPU on my laptop and was excited to try your selfhostedAI project. After installing GCC and CUDA 11, I managed to get it running. However, the performance is quite slow.

I am wondering if the model might be too large for my relatively modest laptop hardware. Is there any way to optimize the model or provide a smaller version that would be more suitable for ordinary users like myself?

@josStorer
Copy link
Owner

The recommended minimum video memory is 6GB, and for the MX450, CPU will be used, which may result in slower speed.

@josStorer
Copy link
Owner

Try RWKV-Runner https://github.com/josStorer/RWKV-Runner

@josStorer
Copy link
Owner

Minimum VRAM requirement is 2GB

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants