error in np.lexsort axis kwarg #10521

ogauthe · 2018-02-04T17:13:02Z

Hello,

I would like to sort an array according to lexical order, sorting by first the second column. np.lexsort is the function to use, but it seems not to handle the kwarg axis.

>>> import numpy as np
>>> a = np.array([[0,1],[1,0],[0,0],[0,-1],[0,1],[1,-1]])
>>> a.ndim
2
>>> np.lexsort(a.T,axis=1)
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ValueError: axis(=1) out of bounds

Info version :
Debian Python 3.5.
>>> np.__version__
'1.12.1'

The same bug happens with Anaconda python and numpy.

The function np.argsort does not suffer from this problem, but is not what I want. In my case, I could use np.lexsort(a.T[::-1]) as a workaround.

The text was updated successfully, but these errors were encountered:

jaimefrio · 2018-02-04T18:07:36Z

Not sure, but may be related to #5782. Lexsort could use a serious rehaul...

charris · 2018-02-04T18:50:26Z

The documentation is confusing, indeed, deceptive. The axis is for the case when each key is an array, in which case it the axis is taken from that array. I think the logic in the relevant function, PyArray_LexSort could use a close review. In your case, there are 6 keys stored in the 1D last axis. Now if you had passed keys in the shape `(6, 2, 2)' the keys would be 2D and the axis would be valid.

In [1]: a = np.array([[0,1],[1,0],[0,0],[0,-1],[0,1],[1,-1]])

In [2]: lexsort(a[:,:,None], axis=1)
Out[2]: 
array([[0],
       [0]])

Not sure why the result has that shape, but frankly, I have no idea what the intended use of that keyword is :(

charris · 2018-02-04T19:00:47Z

Apropos the current problem, this may be what you want

In [3]: lexsort(a[None,...], axis=1)
Out[3]: 
array([[0, 1],
       [1, 0],
       [0, 1],
       [1, 0],
       [0, 1],
       [1, 0]])

Note that you probably need to remove the first dimension of the result.

charris · 2018-02-04T19:27:45Z

And whatever is intended, I think it is buggy.

ogauthe · 2018-02-04T23:22:12Z

The syntax np.lexsort(a[None,...], axis=0) seems complicate and counterintuitive to me. More, its behaviour is not accurate:

>>> a[np.lexsort(a[None,...], axis=0)[:,0]]
array([[ 0,  1],
       [ 0,  0],
       [ 0, -1],
       [ 0,  1],
       [ 1,  0],
       [ 1, -1]])

Ok it sorts by first column, but then it forgets the second! Actually, it gives the same result as np.argsort(a,axis=0) here - which it should not.

To sort according to columns, the easy way is to translate, but I do not understand either why the sorting order is second then first line (hence the [::-1])

>>> a[np.lexsort(a.T[::-1])]
array([[ 0, -1],
       [ 0,  0],
       [ 0,  1],
       [ 0,  1],
       [ 1, -1],
       [ 1,  0]])

I would expect lexsort to let me choose the axis and then sort from 0 to N-1, with a simple, similar to argsort syntax:np.lexsort(a,axis=0), and give a 1D array of indices.

adeak · 2021-02-27T22:31:12Z

I just ran into this while poking lexsort. It took me 10 minutes to figure out why I was getting the error

>>> np.lexsort(np.arange(2*3).reshape(2, 3), axis=-1)  # default axis
array([0, 1, 2])
>>> np.lexsort(np.arange(2*3).reshape(2, 3), axis=1)  # 1 is -1 in case of 2d, right?
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<__array_function__ internals>", line 5, in lexsort
numpy.AxisError: axis 1 is out of bounds for array of dimension 1

I now understand that lexsort always unpacks along the first dimension (interpreting arrays as mere iterables of arrays with one fewer dimension), and the axis keyword is understood in terms of these smaller arrays.

I haven't checked the blame in these past 3 years but the docstring is still confusing. There's no mention of the axis keyword beyond its own section in the docstring, and the rest of the docstring talks about 1d and 2d keys only (in which case the axis keyword is fairly redundant).

Should we try clarifying the docs? Looking at the above comments I'm not even sure the behaviour is always as expected? It would also be nice to be able to give a better error message, but I suspect that might need too much special-casing in the implementation (if I'm reading it correctly we'd have to patch somewhere around here).

braindevices · 2022-11-09T23:14:41Z

apparently this is still a problem

mattip added the 04 - Documentation label Feb 28, 2021

liang3zy22 mentioned this issue Oct 16, 2023

DOC: Update lexsort docstring for axis kwargs #24935

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

error in np.lexsort axis kwarg #10521

error in np.lexsort axis kwarg #10521

ogauthe commented Feb 4, 2018 •

edited

Loading

jaimefrio commented Feb 4, 2018

charris commented Feb 4, 2018

charris commented Feb 4, 2018

charris commented Feb 4, 2018

ogauthe commented Feb 4, 2018

adeak commented Feb 27, 2021 •

edited

Loading

braindevices commented Nov 9, 2022

error in np.lexsort axis kwarg #10521

error in np.lexsort axis kwarg #10521

Comments

ogauthe commented Feb 4, 2018 • edited Loading

jaimefrio commented Feb 4, 2018

charris commented Feb 4, 2018

charris commented Feb 4, 2018

charris commented Feb 4, 2018

ogauthe commented Feb 4, 2018

adeak commented Feb 27, 2021 • edited Loading

braindevices commented Nov 9, 2022

ogauthe commented Feb 4, 2018 •

edited

Loading

adeak commented Feb 27, 2021 •

edited

Loading