New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PIP bad handling of diacritics in folder names. #3087
Comments
For what it's worth, the first one seems to have been encoded UTF-8 and then decoded CP437; and the second has been encoded Latin-1 and decoded CP850.
No idea whether that helps at all. @rin67630, is your system code page set to any one of those four? If you type |
Ups! the reported shell's codepage is 850 indeed. That, albeit the Registry has: |
Hex FDE9 (decimal 65001) is the Windows name for UTF-8. I'm not sure what that registry entry actually does, but it's theoretically saying that you should be using UTF-8. |
Can you just clarify - what is the command you executed? Also, can you confirm what the commands
return? |
(The reason I ask is that I suspect that it's the exe wrapper that runs pip which is the problem here, rather than pip itself. That wrapper is supplied by distlib, so if that is where the problem lies, we may need to report the issue there). |
I am just running "pip" from the command line, nothing else and immediately get the error. Your py commands run as following: |
Just out of interest, what's this show? py -c "import sys; print(ascii(sys.executable))" Might tell us what the file system encoding is, if nothing else. |
sorry i forgot the second one: Thank you for your help. |
C:\Users\BürgerGegenFluglärm>py -c "import sys; print(ascii(sys.executable))" |
Strange, I would expect ascii() to double all those backslashes. But I think I made an error in recommending this, as it's simply giving us back the same Unicode text we were expecting. The FS encoding isn't visible, but it also doesn't seem to matter, as the decode to Unicode is giving the right results. Also, the console encoding is being correctly handled, as normal Unicode output is giving correct display. My bad, sorry! |
@pfmoore couldn't this possibly be identical to this distlib issue: https://bitbucket.org/pypa/distlib/issues/71/script-wrappers-dont-handle-non-ascii, which you reported back in March this year? |
@rin67630 OK, the fact that the @wm75 Yes, it does look like that one. And that was released in distlib 0.2.1, which doesn't appear to be in the version of pip that's included with Python 3.5rc2 (the latest version I have). @rin67630 This should be fixed by upgrading pip. Can you please try
That should upgrade pip from version 7.1.0 to 7.1.2, which includes a fixed distlib, and should fix the issue. |
FWIW, the fixed version of pip is in Python 3.5rc4 (and hence will be in 3.5 final). |
I have tried the upgrade, but got a bech of exceptions listed below. C:\Users\BürgerGegenFluglärm>python -m pip install --upgrade pip Since i am just starting using python, i am a bit confused by this firework of exceptions... |
Sigh. Agreed, that's a horrible error. To be honest, I have no idea what it means :-( Ah - I see @Random832 just spotted my typo - thanks, yes that would cause you problems. Sorry about that. As stated, if you try again without the trailing ``` it should work better... The advice to reinstall Python, with the newest Python 3.5 release candidate, should also fix your problem (alternatively, using the stable Python 3.4 would also fix the issue, as that would install in C:\Python34 and so avoid the non-ASCII path altogether). I'm sorry your initial experience with Python has been less than ideal - thanks for taking the time to report the issue and help us to ensure we fix things like this, it's much appreciated. |
@rin67630 did you manage to solve your issue? |
Closing this, if it continues to happen please reopen this issue or open a new one with reproduction steps. |
Just after a fresh install of the current downloaded Python with PIP on Windows.
on which the user name contains diacritics, i get upon starting PIP:
Where the correct path is supposed to be "C:\Users\BürgerGegenFluglärm\AppData..."
The funny thing is that the message mentions the path twice, with two
different wrong codings.
That appears to be an issue isn't it?
Isn't PIP supposed to be 100% UTF-8?
The text was updated successfully, but these errors were encountered: