nvrtc.nvrtcCompileProgram is changing the preferred encoding from UTF-8 to ANSI_X3.4-1968 #29

redsnic · 2022-10-13T07:39:25Z

Dear developers,

I found out that calling the NVRTC for compilation is changing the preferred encoding for the current Python instance.

For more details and to reproduce the issue, please refer to this StackOverflow question.

Do you have an idea on why this happens, and how it is possible to revert the preferred encoding to its original setting?

Thank you in advance

kmaehashi · 2022-10-15T05:17:18Z

This sounds like an issue of NVRTC rather than CUDA Python. The issue was also reproducible in CuPy built without CUDA Python.

>>> import locale, cupy
>>> locale.getpreferredencoding()
'UTF-8'
>>> cupy.arange(10)
array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
>>> locale.getpreferredencoding()
'ANSI_X3.4-1968'

Env: CUDA 11.8 / Ubuntu 20.04

redsnic · 2022-10-15T12:11:18Z

Good to know, I have filed a bug to Nvidia now, let's see.
Thank you again.

redsnic · 2022-10-27T14:55:03Z

Just to give a small update. By discussing the issue with Nvidia we found out that it is possible to export LC_ALL="POSIX" as a workaround to avoid NVCC changing the encoding to ASCII.

The causes of the bug are still unknown and I will report when I have any other news.

vzhurba01 · 2022-12-05T20:32:02Z

Since this is a bug outside of CUDA Python, I'll close this issue.

Thanks for sharing that workaround. If there's a link you can share for where this bug is being tracked, I'm sure folks would appreciate it.

redsnic · 2022-12-06T09:53:31Z

Here is the link to the bug report: https://developer.nvidia.com/nvidia_bug/3833924

Thank you again for your help

…n#29).

…/cuda-python#29

Flamefire · 2024-03-07T15:51:38Z

Just to give a small update. By discussing the issue with Nvidia we found out that it is possible to export LC_ALL="POSIX" as a workaround to avoid NVCC changing the encoding to ASCII.

The causes of the bug are still unknown and I will report when I have any other news.

Any updates here? It is indeed a bug in NVRTC, specifically nvrtcCompileProgram and can be reproduced in C++:

#include <langinfo.h>
#include <cuda.h>
#include <nvrtc.h>
#include <vector>
#include <iostream>

int main(){
    setlocale(LC_ALL, "");
    std::string code = "";
    std::vector<const char*> args = {"--gpu-architecture=sm_80"};
    nvrtcProgram program;
    nvrtcCreateProgram(&program, code.c_str(), nullptr, 0, nullptr, nullptr);
    std::cout << nl_langinfo(CODESET) << '\n';
    nvrtcCompileProgram(program, args.size(), args.data());
    std::cout << nl_langinfo(CODESET) << '\n';
}

Compile with nvcc test.cu -lnvrtc and observe:

$ LC_ALL=en_US.UTF-8 ./a.out 
UTF-8
ANSI_X3.4-1968
$ LC_ALL=C.UTF-8 ./a.out 
UTF-8
ANSI_X3.4-1968
$ LC_ALL=POSIX ./a.out 
ANSI_X3.4-1968
ANSI_X3.4-1968

I'm a bit confused about the statement "export LC_ALL="POSIX" as a workaround to avoid NVCC changing the encoding to ASCII" as that by definition sets the encoding to ASCII in the first place. So the change is only masked.

adrianeboyd mentioned this issue Dec 1, 2022

Encoding change to ANSI_X3.4-1968 on import explosion/spaCy#11909

Closed

vzhurba01 closed this as completed Dec 5, 2022

rmccorm4 mentioned this issue Feb 6, 2023

Custom Build Python Backend Locale Error triton-inference-server/server#5321

Closed

metrizable mentioned this issue Feb 23, 2023

Spacy 3.5 works on Colab with no hardware acceleration but not on Standard GPU googlecolab/colabtools#3424

Closed

dmargala mentioned this issue Mar 24, 2023

unit test failures on Perlmutter desihub/desispec#1963

Closed

kmaehashi mentioned this issue Apr 10, 2023

cupy.arange changes system's default encoding cupy/cupy#7514

Open

leofang mentioned this issue May 24, 2023

NVRTC compilation fails when CuPy is installed under directory path containing non-ASCII cupy/cupy#7581

Open

emeryberger added a commit to plasma-umass/scalene that referenced this issue Jul 15, 2023

Fixes #625, which is due to a bug caused by NVIDIA (NVIDIA/cuda-pytho…

0165c4b

…n#29).

emeryberger mentioned this issue Jul 15, 2023

how to fix the problem of encoding plasma-umass/scalene#625

Closed

miguelwon mentioned this issue Aug 2, 2023

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 335: ordinal not in range(128) explosion/spaCy#12872

Closed

sunqm added a commit to sunqm/pyscf that referenced this issue Nov 18, 2023

Explicit file encoding for cuda mixed environment due to issue NVIDIA…

574fa5a

…/cuda-python#29

sunqm mentioned this issue Nov 18, 2023

Switch off sparse algorithm in DFT numerical integration for small systems pyscf/pyscf#1962

Merged

sunqm added a commit to pyscf/pyscf that referenced this issue Nov 18, 2023

Explicit file encoding for cuda mixed environment due to issue NVIDIA…

2ce7c8e

…/cuda-python#29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvrtc.nvrtcCompileProgram is changing the preferred encoding from UTF-8 to ANSI_X3.4-1968 #29

nvrtc.nvrtcCompileProgram is changing the preferred encoding from UTF-8 to ANSI_X3.4-1968 #29

redsnic commented Oct 13, 2022

kmaehashi commented Oct 15, 2022 •

edited

redsnic commented Oct 15, 2022

redsnic commented Oct 27, 2022

vzhurba01 commented Dec 5, 2022

redsnic commented Dec 6, 2022

Flamefire commented Mar 7, 2024

nvrtc.nvrtcCompileProgram is changing the preferred encoding from UTF-8 to ANSI_X3.4-1968 #29

nvrtc.nvrtcCompileProgram is changing the preferred encoding from UTF-8 to ANSI_X3.4-1968 #29

Comments

redsnic commented Oct 13, 2022

kmaehashi commented Oct 15, 2022 • edited

redsnic commented Oct 15, 2022

redsnic commented Oct 27, 2022

vzhurba01 commented Dec 5, 2022

redsnic commented Dec 6, 2022

Flamefire commented Mar 7, 2024

kmaehashi commented Oct 15, 2022 •

edited