I feel confused about data_utils.read_sparse_file function #24

xuehuachunsheng · 2022-09-19T12:25:25Z

When I run
true_labels = data_utils.read_sparse_file('Sandbox/Data/EUR-Lex/tst_X_Y.txt')
It raise an exception that ValueError: buffer size must be a multiple of element size from sparse.py line 257,
i.e., indices = np.frombuffer(ind, np.int64)
I see that ind is an array that with typecode = 'l', it seems that the each element in ind takes 4 bytes, according to https://docs.python.org/3/library/array.html.

So I am confused that why loading frombuffer with dtype np.int64?

My machine is 64bit OS.
Python is the version 3.8.8.

The text was updated successfully, but these errors were encountered:

kunaldahiya · 2022-09-19T16:34:43Z

Hi

Are you running it on Windows?

xuehuachunsheng · 2022-09-20T02:15:20Z

Yes. So where is the problem?

kunaldahiya · 2022-09-20T16:49:27Z

The issue is Windows specific. Please refer to the solution mentioned here.

xuehuachunsheng closed this as completed Sep 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I feel confused about data_utils.read_sparse_file function #24

I feel confused about data_utils.read_sparse_file function #24

xuehuachunsheng commented Sep 19, 2022

kunaldahiya commented Sep 19, 2022

xuehuachunsheng commented Sep 20, 2022

kunaldahiya commented Sep 20, 2022

I feel confused about data_utils.read_sparse_file function #24

I feel confused about data_utils.read_sparse_file function #24

Comments

xuehuachunsheng commented Sep 19, 2022

kunaldahiya commented Sep 19, 2022

xuehuachunsheng commented Sep 20, 2022

kunaldahiya commented Sep 20, 2022