New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
module 'tutel_custom_kernel' has no attribute 'inject_source' #132
Comments
I don't know whether python & python3 command target to the same python runtime in your environment. |
Sorry I made a mistake, for the tutorial I still used |
Firstly, can you run "python -m pip uninstall tutel" many times to ensure it is fully cleaned? Then, can you run and share the output logs of that installation command |
Thanks for your suggestions!
Thanks! |
I see this old file "helloworld_sharded_experts.py" is in the logs, it indicates that some of these codes are not the latest, and I don't see cpp code used by tutel_custom_kernel is being built. Can you further try the following 2 options to check any one of them can work? Option 1 - Do a clean Install of Tutel from another port: # Get Rid of Environmental Issues
$ python -m pip install --upgrade pip setuptools
$ python -m pip uninstall tutel -y
$ python -m pip uninstall tutel_custom_kernel -y
# Clean Install from Repo
$ python -m pip install --user git+https://github.com/microsoft/tutel@v0.1.x
# Test
$ python -m tutel.examples.helloworld Option 2 - Cleanup early build cache to avoid environmental problems: # Get Rid of Environmental Issues
$ python -m pip install --upgrade pip setuptools
$ python -m pip uninstall tutel -y
$ python -m pip uninstall tutel_custom_kernel -y
# Clean Install from Local
$ rm -r ./tutel/dist ./tutel/build
$ python ./tutel/setup.py install --user
# Test
$ python -m tutel.examples.helloworld |
Thanks very much for your help!!! I will try it.
Will this error affect the following running process?
|
@LisaWang0306 That dependency-missing error will just skip NCCL related optimization, so it shouldn't be related to the next one. Will any of these commands work?
It can help to determine which option triggers your issue, since I cannot reproduce that in all of our environments. Thanks! |
Finally! |
Thanks for your information. |
There are no more issues left. Thanks! |
My cuda version is 11.4, python version is 3.6.5
Following the requirement, my torch and torchvision versions are
torch==1.10.0+cu113
andtorchvision==0.11.1+cu113
.Then I run
git clone https://github.com/microsoft/tutel --branch v0.1.x
python ./tutel/setup.py install --user
then run the tutorial:
python ./tutel/examples/helloworld.py --batch_size=16
but meet the following error:
Do you know how to solve this problem?
Thank you very much!
The text was updated successfully, but these errors were encountered: