Cannot create sparse matrix from size_t indexes #9437

david-cortes · 2018-11-04T09:44:09Z

If I try to create a sparse matrix in COOrdinate format with row and column indexes in size_t type, coo_matrix will throw an error as it seems it tries to cast size_t to float64.

The following code illustrates the problem:

import numpy as np
import ctypes
from scipy.sparse import coo_matrix

nrow = 10**4
ncol = 10**5
nnz = 10**6
np.random.seed(1)
row_id = np.random.randint(nrow, size=nnz).astype(ctypes.c_size_t)
col_id = np.random.randint(ncol, size=nnz).astype(ctypes.c_size_t)
vals = np.random.normal(size=nnz)

sp_matrix = coo_matrix((vals, (row_id, col_id)))

Throws:

Traceback (most recent call last):

  File "<ipython-input-1-5bbb7c9f9698>", line 13, in <module>
    sp_matrix = coo_matrix((vals, (row_id, col_id)))

  File "/home/david/anaconda3/lib/python3.6/site-packages/scipy/sparse/coo.py", line 150, in __init__
    self._shape = check_shape((M, N))

  File "/home/david/anaconda3/lib/python3.6/site-packages/scipy/sparse/sputils.py", line 281, in check_shape
    new_shape = tuple(operator.index(arg) for arg in args)

  File "/home/david/anaconda3/lib/python3.6/site-packages/scipy/sparse/sputils.py", line 281, in <genexpr>
    new_shape = tuple(operator.index(arg) for arg in args)

TypeError: 'numpy.float64' object cannot be interpreted as an integer

Adding the shape will however allow to construct this matrix:

coo_matrix((vals, (row_id, col_id)), shape=(nrow, ncol))

<10000x100000 sparse matrix of type '<class 'numpy.float64'>'
	with 1000000 stored elements in COOrdinate format>

The problem is not present when the indexes are of int or long type:

coo_matrix((vals, (row_id.astype(ctypes.c_long), col_id.astype(ctypes.c_long))))

<10000x100000 sparse matrix of type '<class 'numpy.float64'>'
	with 1000000 stored elements in COOrdinate format>

The text was updated successfully, but these errors were encountered:

pv · 2018-11-04T12:48:19Z

The index needs to be signed integer, size_t is unsigned.

The error message is perhaps confusing, but it's because in casting float64 is the next common type uint64 and signed integers can be cast to.

EDIT: ok, the shape check should not fail like this nevertheless

david-cortes mentioned this issue Nov 4, 2018

BUG: Initialize coo matrix with size_t indexes #9438

Merged

WarrenWeckesser added defect A clear bug or issue that prevents SciPy from being installed or used as expected scipy.sparse labels Nov 4, 2018

rgommers closed this as completed in #9438 Feb 28, 2019

rgommers added this to the 1.3.0 milestone Feb 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot create sparse matrix from size_t indexes #9437

Cannot create sparse matrix from size_t indexes #9437

david-cortes commented Nov 4, 2018 •

edited

pv commented Nov 4, 2018 •

edited

Cannot create sparse matrix from size_t indexes #9437

Cannot create sparse matrix from size_t indexes #9437

Comments

david-cortes commented Nov 4, 2018 • edited

pv commented Nov 4, 2018 • edited

david-cortes commented Nov 4, 2018 •

edited

pv commented Nov 4, 2018 •

edited