-
-
Notifications
You must be signed in to change notification settings - Fork 5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Installing DeepSpeed on Windows - The Correct Procedure / Solution / Solved #4734
Comments
FYI Im working on some code (which will need some looking at) but I can confirm DeepSpeed is loaded on Windows, based off those above instructions! Im using CUDA 12.x which is giving issues, but Im looking into that. Another user is using cuda 11.8 and they seem to have it working fine. #4712 (comment) |
does it have a noticable speed increase for inference? |
@rktvr Yes, I've been working on some code and giving it a go. I need to do a little bit more testing before I can say all is good, so I will confirm in a day or two with a bit more info. On TTS, its about 3-4x faster is my current estimate. On Windows there are limitations to it. e.g. It runs on Python 3.9.18, you have to install the CUDA Toolkit 11.8 or 12 (depending on your CUDA) and a couple of other bits. On Linux you have to install CUDA. Also, you need to change the CUDA_HOME environment, which Text-Generation-WebUI has already set and I'm not sure if this could have any other impacts. My advice is DONT go installing it just yet! You may not see any benefit anyway, because you need DeepSpeed implemented in the code that calls the TTS engine anyway. |
@rktvr Please have a look here #4712 (comment) I've got it working with Coqui_tts and there is an example screenshot in that link. |
Version 11.1 + CUDA 12.1 + stock ooba (3.11) Python install works fine aswell. Also had a look at the 12.4 version but it seems to have way more isues aswell as added deepspeed-kernel which cant be compiled properly either. (ironically is broken aswell). Attached the compiled .whl which im using right now and should be working. env_report.py has a minor fix for windows in that aswell. Edit 07-12-2023: /build_win.bat:
/csrc/quantization/pt_binding.cpp - lines 244-250 - change to:
/csrc/transformer/inference/csrc/pt_binding.cpp
lines 550-551 - change to:
line 1581 - change to:
/deepspeed/env_report.py
line 83 - 100 - change to:
|
Thank you for the .whl, it works perfectly! Generating over a minute of audio only takes ~15 seconds now which is a massive improvement. |
If anyone is interested in a about 40+ voice files, this link should be live for about 6 days https://filebin.net/t97nd69ac7qm2rsf Also, I've fully released the updated Coqui TTS extension now https://github.com/erew123/alltalk_tts If you want to try it :) |
So Python 3.9 and 3.11 seem to be working according to above discussion. I'm on Python 3.10, is there a way to install DeepSpeed? |
Allegedly/supposedly/possibly, this is a pre-built wheel file for python 3.10 https://huggingface.co/Jmica/audiobook_maker/tree/main the one that says cp310 I have not tried it, and wuzzoy tried it and said it didnt work for him #4712 (comment) (Ive not yet tried the 3.11 @S95Sedan method yet... as I've been too deep in other things) |
S95Sedan's build worked fine for me. The one I built myself (successfully) was throwing this error: ERROR: deepspeed-0.8.3+unknown-cp39-cp39-win_amd64.whl is not a supported wheel on this platform. |
If you tried to install it in a 3.11 python environment, it's normal. |
@S95Sedan Just tested out your DeepSpeed 11.2 wheel file and it works great!! Really nice job on that! Are you ok with me including a reference to you having done this and also linking to the file+adding documentation in AllTalk (Will reference you figured this out)? Thanks |
@S95Sedan If it is ok with you, I have published up your instructions here https://github.com/erew123/alltalk_tts?tab=readme-ov-file#deepspeed-112-for-windows--python-311 Happy to remove them and just solely link to you (if you prefer) or remove your name (if you prefer) etc. If you're ok with me using your name etc, Id like to amend the instructions within AllTalk and also reference you at the bottom in the "Thanks" area of the documentation. Let me know! And thanks for the info on this! :) |
@erew123 yeah no worries, feel free to grab whatever you need to make it as complete as possible. Edit1: Your docs seem to mention 11.2 aswell but the version uploaded here is 11.1. |
@S95Sedan Thanks so much! That is awesome! :) Ill make sure you are thanks+referenced and linked in AllTalk. Will be on its next update. |
@S95Sedan Thank you so much for the instructions. If anyone is interested, you can get the prebuild-wheel here |
Closing this ticket off as I think the issue is put to bed now. Thanks everyone who got involved! |
This is not a feature request, this is actually how you install DeepSpeed on Windows. I have requested Microsoft update their GitHub, but if not, here is my post to them microsoft/DeepSpeed#4729
Please note, there are limitations of DeepSpeed on Windows VS the Linux version.
Also note, there could be other things needed to get bits of DS working on Windows fully
Other than using the instructions above, you can also install the Nvidia Cuda Toolkit, Create a new Python 3.9.18 environment, set your CUDA_HOME environment variable in that environment and download someone else's wheel file it. I actually do have both a cuda 11.8 and 12.1 wheel for Python 3.9.18.
@oobabooga
The text was updated successfully, but these errors were encountered: