cuda error in extract_spikes #316

rajatsaxena · 2021-02-01T23:56:31Z

I get the following error:

799.31 sec, 5601 batches, 7512789 spikes 
813.56 sec, 5701 batches, 7529466 spikes 
827.84 sec, 5801 batches, 7541652 spikes 
842.04 sec, 5901 batches, 7551935 spikes 
856.31 sec, 6001 batches, 7560302 spikes 
870.63 sec, 6101 batches, 7567715 spikes 
884.90 sec, 6201 batches, 7574967 spikes 
899.16 sec, 6301 batches, 7582074 spikes 
913.43 sec, 6401 batches, 7588447 spikes 
927.68 sec, 6501 batches, 7594498 spikes 
Error using gpuArray/subsasgn
An unexpected error occurred trying to launch a kernel. The CUDA error was:
invalid configuration argument

Error in extract_spikes (line 97)
    st(5,:) = cF;

Error in main_kilosort3_swil3 (line 40)
[rez, st3, tF]     = extract_spikes(rez);

It seems like in a random batch, the output variable st has the shape 4 x N rather than 6 x N as expected from spikedetector3PC. I will have to go through the mex code to understand why there is a shape mismatch. Let me know if you need any more information.

The text was updated successfully, but these errors were encountered:

marius10p · 2021-02-02T00:28:42Z

That looks like an uninformative CUDA error. Does it always stop in the same place? Can you try to see what's different about that batch? You might also need to upgrade your Nvidia drivers. What is your GPU?

rajatsaxena · 2021-02-02T08:16:08Z

It does always stop at the same place. I couldn't find anything weird with the batch except the batch before this had variable st with size 6 x 0. I tried skipping this batch and running through a different set of batches: got the same error with the same pattern, i.e., the previous batch had variable st with size 6 x 0. The two erroneous batches are of 6 x 23 and 6 x 13 size.

I have updated the drivers. GPU = Nvidia GeForce 1070 Ti

marius10p · 2021-02-02T12:15:05Z

It's probably the batch with 0 spikes where it errored (CUDA errors come up asynchronously, on the next GPU operation). And it's probably 0 spikes because something went wrong inside the kernel, not because there are really 0 spikes (can you check, do you see spikes in that batch?).

rajatsaxena · 2021-02-02T18:20:07Z

yeah, I see spikes in that batch.

AlexSonneborn · 2021-02-04T18:57:15Z

Hi @rajatsaxena, have you made any progress with this in the last couple of days? I just got Kilosort 3 and am also having this problem. It only occurs with certain recordings though, maybe because of a lack of spikes in a batch?

shirquinn · 2021-02-09T11:01:43Z

Hi, @rajatsaxena @AlexSonneborn , any news? I'm also encountering this error

DradeAW · 2021-02-09T14:40:44Z

Hi,

I'm also having the same issue.
Here is my output:

>> main_kilosort3
Looking for data inside /users/nsr/wyngaard/Documents/cells_tracking/0153/session_1/rec 
Time   0s. Computing whitening matrix.. 
Getting channel whitening matrix... 
Channel-whitening matrix computed. 
Time   0s. Loading raw data and applying filters... 
Time  13s. Finished preprocessing 147 batches. 
vertical pitch size is 1.250000e+01 
horizontal pitch size is 250 
  -25.0000  104.1667  233.3333  362.5000  491.6667  620.8333  750.0000

    39

0.04 sec, 1 batches, 109 spikes 
2.80 sec, 101 batches, 10689 spikes 
4.05 sec, 147 batches, 36229 spikes 
time 10.70, Shifted up/down 147 batches. 
0.03 sec, 1 batches, 236 spikes 
Error using gpuArray/subsasgn
An unexpected error occurred trying to launch a kernel. The CUDA error was:
invalid configuration argument

Error in extract_spikes (line 97)
    st(5,:) = cF;

Error in main_kilosort3 (line 40)
[rez, st3, tF]     = extract_spikes(rez);

bryzgalovdm · 2021-02-12T10:25:04Z

Hello,

I had the same error due to mismatch between real and indicated number of channels. (I indicated 68 channels for 64-channel recording).
Maybe, it will help someone - check whether your input (nChan, groups) matches the reality of your data.

DradeAW · 2021-02-12T11:11:51Z

Ah yes, I forgot to change ops.NchanTOT (which was still at 385 instead of 64).

Thank you @bryzgalovdm !

sujayane · 2021-03-14T14:57:57Z

I still have this problem with ops.NchanTOT=384 (i.e right # of channels for NP probe). @rajatsaxena @AlexSonneborn - did you solve this by correcting the channel count?

sujayane · 2021-03-14T20:52:37Z

My problem was also #of channels after all. ops.NchanTOT should be 385 (384 neural channels + 1 sync channel) as was originally in the config file.

JoseGuzman · 2021-03-16T16:29:36Z

I also find this problem with the a correct ops.NchatTOT. It happens when Kilosort3 and Kilosort2.5 finds a long period without spikes -like blank periods at the beginning of the recording, see #358 - but also in recordings where these absent periods occurs for some reason.

0.03 sec, 1 batches, 0 spikes 
Error using gpuArray/subsasgn
An unexpected error occurred trying to launch a kernel. The CUDA error was:
invalid configuration argument

Error in extract_spikes (line 97)
    st(5,:) = cF;

Error in main_kilosort3 (line 49)
[rez, st3, tF]     = extract_spikes(rez);

I can get rid of in Kilosort-2.5 if I add ops.nblocks=0 to my config file

gawygawy · 2021-03-25T10:41:23Z

Hi, I also find that if if I turn off the registration I can avoid this error for certain recordings. Why is that?

JoseGuzman · 2021-03-25T10:54:57Z

I'm not very sure about it but pull #288 suggests a solution that may work when you have zeros in some channels...

shirquinn · 2021-04-01T20:46:38Z

even when nblocks =0, and even with #288 solution, i keep getting this error with kilosort 3
i record with a 32 channel linear neuronexus electrode.. if anyone have more suggestions please share

RishiRajalingham · 2021-04-10T00:51:30Z

@gawygawy @shirquinn This is my guess: this error occurs when there are no spikes in a registration block in a batch. This could be because the batch contains a period of time when there are no spikes (e.g. paused streaming with zero padding), or all the electrodes in that block aren't measuring spiking activity (e.g. all channels in that block sitting out of cortex).

A fix for the former is to make your batch size bigger, or to remove those batches beforehand (as in #288).
A fix for the latter is to make the registration blocks bigger (make ops.nblocks smaller).

jingjie-li · 2021-06-11T14:03:49Z

I also got this kind of error with my data. But setting nblocks =0 cannot solve that.
The problem is that I have a short batch which has firing_rate=0. A way to solve this is to make those batches larger.
To do that, we tried to set a breakpoint at line 15 of https://github.com/MouseLand/Kilosort/blob/main/preProcess/preprocessDataSub.m

and run ops.NT = ops.NT*2.
Therefore we can set the batch size doubled. And it can avoid that error in my data.

Rubinsteinlab · 2022-05-02T12:18:33Z

@celelion , do you just add ops.NT = ops.NT*2 to line 15? can you send me the code to see?
Thank you

johnmbarrett mentioned this issue Feb 3, 2021

idrop is bigger than W in triageTemplates2 #44

Closed

rajatsaxena closed this as completed Feb 12, 2021

JoseGuzman mentioned this issue Mar 16, 2021

Error in spike times when customizing ops.trange(1) #358

Closed

benjamin-heasly mentioned this issue Apr 26, 2023

Retry two GPU operations. benjamin-heasly/Kilosort#2

Merged

carsen-stringer mentioned this issue Feb 29, 2024

adding rtd and badges #595

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda error in extract_spikes #316

cuda error in extract_spikes #316

rajatsaxena commented Feb 1, 2021 •

edited

marius10p commented Feb 2, 2021

rajatsaxena commented Feb 2, 2021

marius10p commented Feb 2, 2021

rajatsaxena commented Feb 2, 2021

AlexSonneborn commented Feb 4, 2021

shirquinn commented Feb 9, 2021

DradeAW commented Feb 9, 2021

bryzgalovdm commented Feb 12, 2021

DradeAW commented Feb 12, 2021

sujayane commented Mar 14, 2021

sujayane commented Mar 14, 2021

JoseGuzman commented Mar 16, 2021 •

edited

gawygawy commented Mar 25, 2021

JoseGuzman commented Mar 25, 2021 •

edited

shirquinn commented Apr 1, 2021

RishiRajalingham commented Apr 10, 2021

jingjie-li commented Jun 11, 2021

Rubinsteinlab commented May 2, 2022

cuda error in extract_spikes #316

cuda error in extract_spikes #316

Comments

rajatsaxena commented Feb 1, 2021 • edited

marius10p commented Feb 2, 2021

rajatsaxena commented Feb 2, 2021

marius10p commented Feb 2, 2021

rajatsaxena commented Feb 2, 2021

AlexSonneborn commented Feb 4, 2021

shirquinn commented Feb 9, 2021

DradeAW commented Feb 9, 2021

bryzgalovdm commented Feb 12, 2021

DradeAW commented Feb 12, 2021

sujayane commented Mar 14, 2021

sujayane commented Mar 14, 2021

JoseGuzman commented Mar 16, 2021 • edited

gawygawy commented Mar 25, 2021

JoseGuzman commented Mar 25, 2021 • edited

shirquinn commented Apr 1, 2021

RishiRajalingham commented Apr 10, 2021

jingjie-li commented Jun 11, 2021

Rubinsteinlab commented May 2, 2022

rajatsaxena commented Feb 1, 2021 •

edited

JoseGuzman commented Mar 16, 2021 •

edited

JoseGuzman commented Mar 25, 2021 •

edited