Analyzer extension exit status by jonahpearl · Pull Request #3347 · SpikeInterface/spikeinterface

jonahpearl · 2024-08-28T17:27:16Z

Here's a first pass at adding information about an extension's completion (or not) to its metadata.

This PR adds a run_info.json file to each extension's folder. This contains keys:

run_completed (whether or not the call to AnalyzerExtension._run() finished),
data_loadable (whether or not AnalyzerExtension.load_data() can be called without an error)
runtime_s (the runtime of AnalyzerExtension._run() in seconds).

The core logic is implemented in AnalyzerExtension.run() and wraps the call to the extension-specific _run(). Then when AnalyzerExtension.get_extension() is called (and redirects to AnalyzerExtension.load() internally), if the run wasn't completed, it simply returns None as if the extension doesn't exist, and the user would be able to catch that and re-run the extension.

There is a mild complication with extensions that use the run_node_pipeline if called through compute_several_extensions, because then AnalyzerExtension.run() is never actually called. Instead, it looks like AnalyzerExtension.save() is used, so I added the relevant lines there to catch that, and assume that if the code makes it there, the run is completed.

TODO:

I'm not sure if the implementations in merge() and copy() are correct, I don't totally understand when those methods are expected to be called / what's wrapping them.
Write some tests?

Notes:

It is a bit of a slow-down to check if the data are loadable after each run. I'm ambivalent as to whether that check is actually useful, especially since data corruption is more likely to happen sometimes significantly after the run, not immediately after.
It seemed more reasonable to add a separate run_info.json file than trying to cram this into info.json, since that is more about the code itself.

for more information, see https://pre-commit.ci

samuelgarcia · 2024-08-29T07:51:26Z

Hi Jonah.
Thanks a lot. this looks like really cool.
I will have a deeper look into this.

data_loadable and run_completed are a bit redundant. (unles the computation is done but the wring failed.)

for more information, see https://pre-commit.ci

jonahpearl · 2024-08-29T14:20:47Z

I also fixed the edge case where (for some reason) the run completes, but then the data file gets deleted; it will now raise a warning upon loading the sorting analyzer, and return None on calls to get_extension(), since the extension is no longer usable and should be re-computed as if it had never been.

for more information, see https://pre-commit.ci

jonahpearl · 2024-08-29T15:13:40Z

Also — I still think my naive / default usage of this code would be to check has_extension(), and if that's true, assume everything is ok. Sure, the "check for None" pattern can be shown in the docs, but it's still odd that has_extension() now essentially means, is there a folder, regardless of if there's data. It looks like internally, sometimes it's used to mean, "is there a folder", like here, but then sometimes it means, "is there usable data", like here or here or here. One could imagine having a private function for "is there a folder here", and either getting rid of the public-facing version in favor of get_extension and checking none, or just making has_extension essentially do the none check for you.

Just taking the third example:

if sorting_analyzer.has_extension("spike_amplitudes"):
    peak_amplitudes = sorting_analyzer.get_extension("spike_amplitudes").get_data()
else:
    peak_amplitudes = None

right now, that will fail if the extension doesn't have data, since has_extension() will come back True but then get_extension() will return None and the call to get_data() will raise an AttributeError. I guess currently the preferred implementation would be

spike_amps = sorting_analyzer.get_extension("spike_amplitudes")
if spike_amps:
    peak_amplitudes = spike_amps.get_data()
else:
    peak_amplitudes = None

(or with a tertiary operator plus assignment expression it can fit in one line)

peak_amplitudes = _ext.get_data() if (_ext := sorting_analyzer.get_extension("spike_amplitudes")) else None

but the pattern still just feels confusing 🤷

alejoe91 · 2024-09-05T08:21:45Z

@samuelgarcia this is good to merge on my side. I added a back-compatibility logic for loading folders/zarr produced prior to this change

samuelgarcia · 2024-09-11T06:50:26Z

            for r, result in enumerate(results):
                extension_name, variable_name = result_routage[r]
                extension_instances[extension_name].data[variable_name] = result
+                extension_instances[extension_name].run_info["runtime_s"] = runtime_s


OK for me but if we want the total run ttime then suming of run_time will be sring because this run time is shared.

This is the best estimate we can get :)

samuelgarcia · 2024-09-11T06:52:59Z

Merci beaucoup Jonah et Alessio.

jonahpearl and others added 4 commits August 28, 2024 11:37

save run_info to check extensions for completion

5c351a4

save run time

725b208

bug fixes for pipeline extensions

d95a754

[pre-commit.ci] auto fixes from pre-commit.com hooks

1699413

for more information, see https://pre-commit.ci

samuelgarcia reviewed Aug 29, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

alejoe91 added the core Changes to core module label Aug 29, 2024

alejoe91 reviewed Aug 29, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

jonahpearl and others added 7 commits August 29, 2024 09:18

Merge branch 'main' into analyzer_extension_exit_status

74ca832

switch to perf counter

869b01a

use perf counter

554f6e3

remove data_loadable and _check_data_loadable

be023ed

edge case where data file is deleted

943e398

[pre-commit.ci] auto fixes from pre-commit.com hooks

f9d7c04

for more information, see https://pre-commit.ci

always return None if extension data is missing

8e9acfc

[pre-commit.ci] auto fixes from pre-commit.com hooks

f1a3d97

for more information, see https://pre-commit.ci

Merge branch 'main' into analyzer_extension_exit_status

c3e742e

alejoe91 reviewed Sep 4, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py

Fix error in load data, ensure backward compatibility, and add tests

dd53372

alejoe91 reviewed Sep 4, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

Update src/spikeinterface/core/sortinganalyzer.py

8240ee9

alejoe91 approved these changes Sep 5, 2024

View reviewed changes

alejoe91 added this to the 0.101.1 milestone Sep 10, 2024

alejoe91 reviewed Sep 10, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

Update src/spikeinterface/core/sortinganalyzer.py

5769eff

alejoe91 reviewed Sep 10, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

alejoe91 reviewed Sep 10, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

alejoe91 reviewed Sep 10, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py

alejoe91 reviewed Sep 10, 2024

View reviewed changes

Comment thread src/spikeinterface/core/sortinganalyzer.py Outdated

Apply suggestions from code review

cc21f06

alejoe91 approved these changes Sep 10, 2024

View reviewed changes

samuelgarcia reviewed Sep 11, 2024

View reviewed changes

samuelgarcia approved these changes Sep 11, 2024

View reviewed changes

samuelgarcia merged commit 469187a into SpikeInterface:main Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analyzer extension exit status#3347

Analyzer extension exit status#3347
samuelgarcia merged 17 commits intoSpikeInterface:mainfrom
jonahpearl:analyzer_extension_exit_status

jonahpearl commented Aug 28, 2024 •

edited

Loading

Uh oh!

Uh oh!

samuelgarcia commented Aug 29, 2024

Uh oh!

Uh oh!

jonahpearl commented Aug 29, 2024

Uh oh!

jonahpearl commented Aug 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

alejoe91 commented Sep 5, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samuelgarcia Sep 11, 2024

Uh oh!

alejoe91 Sep 11, 2024

Uh oh!

samuelgarcia commented Sep 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jonahpearl commented Aug 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

samuelgarcia commented Aug 29, 2024

Uh oh!

Uh oh!

jonahpearl commented Aug 29, 2024

Uh oh!

jonahpearl commented Aug 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alejoe91 commented Sep 5, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

samuelgarcia Sep 11, 2024

Choose a reason for hiding this comment

Uh oh!

alejoe91 Sep 11, 2024

Choose a reason for hiding this comment

Uh oh!

samuelgarcia commented Sep 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jonahpearl commented Aug 28, 2024 •

edited

Loading

jonahpearl commented Aug 29, 2024 •

edited

Loading