After build from source, Error occured "device kernel image is invalid" #1955

akakakakakaa · 2023-07-18T12:27:30Z

I tried on V100, cuda 11.7.

after digging source code, When I change cuda version in this line, working.
https://github.com/openai/triton/blob/9e3e10c5edb4a062cf547ae73e6ebfb19aad7bdf/python/setup.py#L129

So, When I want to install triton from source, Do I need to control cuda version by editing setup.py?

bhack · 2023-07-29T12:17:24Z

@akakakakakaa Do you had this error at runtime? Do you have a small code gist to reproduce this?

akakakakakaa · 2023-07-31T01:17:21Z

@bhack I tried to install exactly as written in the README But uses pip install . instead of pip install -e .. because In my case, pip install -e . can't recognize hidden directory.

git clone https://github.com/openai/triton.git;
cd triton/python;
pip install cmake; # build-time dependency
pip install .

After Installing, I tried to run 06-fused-attention.py and I met error in runtime device kernel image is invalid

huangxiao2008 · 2023-11-21T09:09:58Z

Same Problem I met

zy-fang · 2024-01-19T11:26:28Z

I have encountered the same problem, how did you solve it?

xingjinglu · 2024-02-06T14:43:59Z

I havve encountered the same problem and sovled it.
The reason, for the main branch of triton, the the default version of ptxas, cuobjdump,nvdisasm in triton is cuda-12.x(which is set in triton/python/setup.py). So when you build trion for cuda-11.x, you need to set the right version of cuda bins with setting the the path of these bins in the environment.

The environment of mine is:

Driver Version: 470.141.03 CUDA Version: 11.4
torch: conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia

Build triton from source as below:

export TRITON_PTXAS_PATH=/usr/local/cuda/bin/ptxas                                                                      
export TRITON_CUOBJDUMP_PATH=/usr/local/cuda/bin/cuobjdump                                                              
export TRITON_NVDISASM_PATH=/usr/local/cuda/bin/nvdisasm  

cd triton/python
pip install -e .

Test.
python python tutorials/01-vector-add.py

The result is as below:

tensor([1.3713, 1.3076, 0.4940, ..., 0.6724, 1.2141, 0.9733], device='cuda:0')
tensor([1.3713, 1.3076, 0.4940, ..., 0.6724, 1.2141, 0.9733], device='cuda:0')
The maximum difference between torch and triton is 0.0
vector-add-performance:
size Triton Torch
0 4096.0 11.377778 11.130435
1 8192.0 21.787235 23.813955
2 16384.0 44.521738 41.795915
3 32768.0 73.142858 72.710056
4 65536.0 127.336788 127.336788
5 131072.0 199.399583 200.620406
6 262144.0 283.296835 285.767442
7 524288.0 381.023277 371.659727
8 1048576.0 412.608613 416.101597
9 2097152.0 444.311871 449.646643
10 4194304.0 463.766462 468.393097
11 8388608.0 472.615390 479.385543
12 16777216.0 477.602370 484.554523
13 33554432.0 478.037844 484.414634
14 67108864.0 479.979873 488.623552
15 134217728.0 479.870017 489.126924

There is a short summary on how build triton from source.

sujuyu · 2024-05-14T11:26:27Z

I havve encountered the same problem and sovled it. The reason, for the main branch of triton, the the default version of ptxas, cuobjdump,nvdisasm in triton is cuda-12.x(which is set in triton/python/setup.py). So when you build trion for cuda-11.x, you need to set the right version of cuda bins with setting the the path of these bins in the environment.

The environment of mine is:

Driver Version: 470.141.03 CUDA Version: 11.4

torch: conda install pytorch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 pytorch-cuda=11.8 -c pytorch -c nvidia

Build triton from source as below:
export TRITON_PTXAS_PATH=/usr/local/cuda/bin/ptxas                                                                      
export TRITON_CUOBJDUMP_PATH=/usr/local/cuda/bin/cuobjdump                                                              
export TRITON_NVDISASM_PATH=/usr/local/cuda/bin/nvdisasm  

cd triton/python
pip install -e .
Test. python python tutorials/01-vector-add.py

The result is as below:

tensor([1.3713, 1.3076, 0.4940, ..., 0.6724, 1.2141, 0.9733], device='cuda:0') tensor([1.3713, 1.3076, 0.4940, ..., 0.6724, 1.2141, 0.9733], device='cuda:0') The maximum difference between torch and triton is 0.0 vector-add-performance: size Triton Torch 0 4096.0 11.377778 11.130435 1 8192.0 21.787235 23.813955 2 16384.0 44.521738 41.795915 3 32768.0 73.142858 72.710056 4 65536.0 127.336788 127.336788 5 131072.0 199.399583 200.620406 6 262144.0 283.296835 285.767442 7 524288.0 381.023277 371.659727 8 1048576.0 412.608613 416.101597 9 2097152.0 444.311871 449.646643 10 4194304.0 463.766462 468.393097 11 8388608.0 472.615390 479.385543 12 16777216.0 477.602370 484.554523 13 33554432.0 478.037844 484.414634 14 67108864.0 479.979873 488.623552 15 134217728.0 479.870017 489.126924

There is a short summary on how build triton from source.

Your suggestion is effective, thank you very much

bhack mentioned this issue Jul 27, 2023

Pytorch nighlty and openAI/triton cuda pytorch/pytorch#106144

Open

This was referenced Feb 6, 2024

[Feature Request] Add additional debug information for Triton Error [CUDA]: device kernel image is invalid #3001

Open

RuntimeError: Triton Error [CUDA]: device kernel image is invalid #1556

Closed

akakakakakaa closed this as completed Mar 24, 2024

sWnCsZ mentioned this issue Apr 22, 2024

关于"device kernel image is invalid"的问题 xingjinglu/WorkTips#1

Open

hkunzhe mentioned this issue May 28, 2024

Fix video caption aigc-apps/EasyAnimate#3

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

After build from source, Error occured "device kernel image is invalid" #1955

After build from source, Error occured "device kernel image is invalid" #1955

akakakakakaa commented Jul 18, 2023 •

edited

bhack commented Jul 29, 2023

akakakakakaa commented Jul 31, 2023 •

edited

huangxiao2008 commented Nov 21, 2023

zy-fang commented Jan 19, 2024

xingjinglu commented Feb 6, 2024 •

edited

sujuyu commented May 14, 2024

After build from source, Error occured "device kernel image is invalid" #1955

After build from source, Error occured "device kernel image is invalid" #1955

Comments

akakakakakaa commented Jul 18, 2023 • edited

bhack commented Jul 29, 2023

akakakakakaa commented Jul 31, 2023 • edited

huangxiao2008 commented Nov 21, 2023

zy-fang commented Jan 19, 2024

xingjinglu commented Feb 6, 2024 • edited

sujuyu commented May 14, 2024

akakakakakaa commented Jul 18, 2023 •

edited

akakakakakaa commented Jul 31, 2023 •

edited

xingjinglu commented Feb 6, 2024 •

edited