error message of ord is not intuitive #71195

zhangyangyu · 2016-05-12T08:04:50Z

BPO	27008
Nosy	@bitdancer, @serhiy-storchaka, @zhangyangyu

^{Note: these values reflect the state of the issue at the time it was migrated and might not reflect the current state.}

Show more details

GitHub fields:

assignee = None
closed_at = <Date 2016-05-12.16:11:08.262>
created_at = <Date 2016-05-12.08:04:50.468>
labels = ['invalid']
title = 'error message of ord is not intuitive'
updated_at = <Date 2016-05-12.16:11:08.261>
user = 'https://github.com/zhangyangyu'

bugs.python.org fields:

activity = <Date 2016-05-12.16:11:08.261>
actor = 'r.david.murray'
assignee = 'none'
closed = True
closed_date = <Date 2016-05-12.16:11:08.262>
closer = 'r.david.murray'
components = []
creation = <Date 2016-05-12.08:04:50.468>
creator = 'xiang.zhang'
dependencies = []
files = []
hgrepos = []
issue_num = 27008
keywords = []
message_count = 9.0
messages = ['265376', '265377', '265378', '265380', '265382', '265384', '265401', '265409', '265415']
nosy_count = 3.0
nosy_names = ['r.david.murray', 'serhiy.storchaka', 'xiang.zhang']
pr_nums = []
priority = 'normal'
resolution = 'not a bug'
stage = 'resolved'
status = 'closed'
superseder = None
type = None
url = 'https://bugs.python.org/issue27008'
versions = ['Python 3.6']

zhangyangyu · 2016-05-12T08:04:50Z

The error message of ord is not that right. It says 'ord() expected string of length 1'. I don't think in Py3.x string can refer to both bytes and unicodes.

serhiy-storchaka · 2016-05-12T08:18:19Z

This not the only place where "string" means unicode string or bytes string. I don't think this is large issue.

zhangyangyu · 2016-05-12T08:23:57Z

If you think it's OK, it's fine. Please close this thread.

zhangyangyu · 2016-05-12T08:35:05Z

BTW, what do you think about the second TypeError in ord. Shouldn't it be ValueError?

serhiy-storchaka · 2016-05-12T08:53:48Z

In any case it doesn't make much sense to change just one separate docstring. If you want to avoid misleading and support consistent wording, you should examine all occurrences of the word "string" in the documentation, docstrings, error messages and comments -- does it mean Unicode string, bytes-like object that supports the buffer protocol (including memoryview), bytes-like object that supports str-like interface (including bytes, bytearray, but excluding memoryview), either Unicode or bytes string? I tried to do this but abandoned the work on half-way. This is too large work.

BTW, what do you think about the second TypeError in ord. Shouldn't it be ValueError?

It could be ValueError. But for compatibility it should stay TypeError. This is not wrong if we consider strings of size 1 as separate type.

zhangyangyu · 2016-05-12T09:11:55Z

Ohh, you have tried to do that. It must be a large work.

But on the other hand, if this is a too large work, why not solve this case by case? This work is too large to get someone work on it, even you, the most active developer in the community I see have abandoned. And then maybe improving the situation a little bit every time is the only solution.

This is not wrong if we consider strings of size 1 as separate type

This sounds weird. But I can understand the importance of compatibility.

bitdancer · 2016-05-12T12:46:13Z

You are right, if it is too big a job to do it all at once, then we can fix them as we find them. Do you want to propose a patch?

However, in this case I think there is arguably not a bug. It looks as though the intent is that ord only support strings (see the documentation). The fact that it supports bytes-like objects is redudant (ord(b'a') == b'a'[0]). I'd call it a bug that it supports bytes-like objects, but we probably kept (and should keep it) it to make it easier to port python2 code to python3.

So, if any change were to be made here, it would probably be to change the error message if and only if the input is not in fact a string, and perhaps even recommend using the indexing syntax.

On the gripping hand, I've never been a fan of the fact that indexing a byte string gets you an integer :)

zhangyangyu · 2016-05-12T14:38:15Z

I also notice the document. I get surprised when I see the implementation also supports bytes while the doc says one unicode character. But then I tell myself that maybe unicode character also includes bytes? I am not sure about the English description.

But giving your opinion that maybe the bytes supports are not intended now, I think leaving the error message untouched is quite OK. It describes what it is intended to do, same as the doc.

Really glad to have your comments, Serhiy and David.

bitdancer · 2016-05-12T16:11:08Z

All right, we'll close this then.

bitdancer closed this as completed May 12, 2016

bitdancer added the invalid label May 12, 2016

ezio-melotti transferred this issue from another repository Apr 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error message of ord is not intuitive #71195

error message of ord is not intuitive #71195

zhangyangyu commented May 12, 2016

zhangyangyu commented May 12, 2016

serhiy-storchaka commented May 12, 2016

zhangyangyu commented May 12, 2016

zhangyangyu commented May 12, 2016

serhiy-storchaka commented May 12, 2016

zhangyangyu commented May 12, 2016

bitdancer commented May 12, 2016

zhangyangyu commented May 12, 2016

bitdancer commented May 12, 2016

error message of ord is not intuitive #71195

error message of ord is not intuitive #71195

Comments

zhangyangyu commented May 12, 2016

zhangyangyu commented May 12, 2016

serhiy-storchaka commented May 12, 2016

zhangyangyu commented May 12, 2016

zhangyangyu commented May 12, 2016

serhiy-storchaka commented May 12, 2016

zhangyangyu commented May 12, 2016

bitdancer commented May 12, 2016

zhangyangyu commented May 12, 2016

bitdancer commented May 12, 2016