In [1]:
%load_ext autoreload
%autoreload 2

The autoreload extension is already loaded. To reload it, use:
  %reload_ext autoreload


In [2]:
import spikeextractors as se
import spiketoolkit as st
import spikewidgets as sw
import time
import numpy as np
import matplotlib.pylab as plt
import scipy.signal as ss
%matplotlib notebook

18:58:00 [I] klustakwik KlustaKwik2 version 0.2.6


### Create toy example dataset

In [3]:
# recording, sorting = se.example_datasets.toy_example(num_channels=4, duration=30)
recording = se.MEArecRecordingExtractor('/home/alessiob/Documents/Codes/MEArec/data/recordings/recordings_20cells_Neuronexus-32_10.0_10.0uV_20-02-2019:15:11.h5')
sorting = se.MEArecSortingExtractor('/home/alessiob/Documents/Codes/MEArec/data/recordings/recordings_20cells_Neuronexus-32_10.0_10.0uV_20-02-2019:15:11.h5')

Assuming the `sorting` is the output of a spike sorter, the `postprocessing` module allows to extract all relevant information from the paired recording-sorting.

### Extracting waveforms

Waveforms are extracted with the `getUnitWaveforms` function by extracting snippets of the recordings when spikes are detected. When waveforms are extracted, the can be loaded in the `SortingExtractor` object as features. The ms before and after the spike event can be chosen. Waveforms are returned as a list of np.arrays (n_spikes, n_channels, n_points)

In [4]:
wf = st.postprocessing.getUnitWaveforms(recording, sorting, ms_before=1, ms_after=2, 
                                        save_as_features=True, verbose=True)

Waveform 1/20
Waveform 2/20
Waveform 3/20
Waveform 4/20
Waveform 5/20
Waveform 6/20
Waveform 7/20
Waveform 8/20
Waveform 9/20
Waveform 10/20
Waveform 11/20
Waveform 12/20
Waveform 13/20
Waveform 14/20
Waveform 15/20
Waveform 16/20
Waveform 17/20
Waveform 18/20
Waveform 19/20
Waveform 20/20


Now `waveforms` is a unit spike feature!

In [5]:
sorting.get_unit_spike_feature_names()
wf[0].shape

(48, 32, 96)

In [6]:
# plotting waveforms of units 0,1,2 on channel 0
plt.figure()
_ = plt.plot(wf[0][:, 0, :].T, color='k', lw=0.3)
_ = plt.plot(wf[1][:, 0, :].T, color='r', lw=0.3)
_ = plt.plot(wf[2][:, 0, :].T, color='b', lw=0.3)

<IPython.core.display.Javascript object>

If the a certain property (e.g. `group`) is present in the RecordingExtractor, the waveforms can be extracted only on the channels with that property using the `grouping_property` and `compute_property_from_recording` arguments. For example, if channel [0,1] are in group 0 and channel [2,3] are in group 2, then if the peak of the waveforms is in channel [0,1] it will be assigned to group 0 and will have 2 channels and the same for group 1.

In [7]:
channel_groups = [[0, 1], [2, 3]]
for ch in recording.get_channel_ids():
    for gr, channel_group in enumerate(channel_groups):
        if ch in channel_group:
            recording.set_channel_property(ch, 'group', gr)
print(recording.get_channel_property(0, 'group'))

0


In [8]:
wf_by_group = st.postprocessing.getUnitWaveforms(recording, sorting, ms_before=1, ms_after=2, 
                                                 save_as_features=False, verbose=True,
                                                 grouping_property='group', compute_property_from_recording=True)

# now waveforms will only have 2 channels
print(wf_by_group[0].shape)

Waveforms by property:  group


ValueError: This property has not been added to this channel

### Templates (EAP)

Similarly to waveforms, templates - average waveforms - can be easily extracted using the `getUnitTemplates`. When spike trains have numerous spikes, you can set the `max_num_waveforms` to be extracted. If waveforms have already been computd and stored as `features`, those will be used. Templates can be saved as unit properties.

In [9]:
templates = st.postprocessing.getUnitTemplate(recording, sorting, max_num_waveforms=200,
                                              save_as_property=True, verbose=True)

Using 'waveforms' features


In [10]:
sorting.get_unit_property_names()

['template']

In [11]:
# plotting templates of units 0,1,2 on all four channels
plt.figure()
_ = plt.plot(templates[0].T, color='k')
_ = plt.plot(templates[1].T, color='r')
_ = plt.plot(templates[2].T, color='b')

<IPython.core.display.Javascript object>

### Maximum channel

In the same way, one can get the ecording channel with the maximum amplitude and save it as a property.

In [12]:
max_chan = st.postprocessing.getUnitMaxChannel(recording, sorting, save_as_property=True, verbose=True)
print(max_chan)

Using 'template' property
[5, 26, 13, 28, 24, 31, 22, 30, 25, 22, 1, 21, 12, 0, 21, 8, 0, 26, 1, 0]


In [13]:
sorting.get_unit_property_names()

['max_channel', 'template']

### PCA scores

For some applications, for example validating the spike sorting output, PCA scores can be computed.


In [14]:
pca_scores = st.postprocessing.computePCAScores(recording, sorting, n_comp=3, verbose=True)

for pc in pca_scores:
    print(pc.shape)

Using 'waveforms' features
Fitting PCA of 3 dimensions on 1649 waveforms
(48, 3)
(29, 3)
(36, 3)
(45, 3)
(43, 3)
(62, 3)
(60, 3)
(68, 3)
(40, 3)
(89, 3)
(53, 3)
(52, 3)
(46, 3)
(61, 3)
(88, 3)
(181, 3)
(155, 3)
(112, 3)
(135, 3)
(246, 3)


In [15]:
fig = plt.figure()
ax = fig.add_subplot(111)
ax.plot(pca_scores[0][:,0], pca_scores[0][:,1], 'r*')
ax.plot(pca_scores[2][:,0], pca_scores[2][:,1], 'b*')

<IPython.core.display.Javascript object>

[<matplotlib.lines.Line2D at 0x7f9991094048>]

PCA scores can be also computed electrode-wise. In the previous example, PCA was applied to the concatenation of the waveforms over channels. 

In [16]:
pca_scores_by_electrode = st.postprocessing.computePCAScores(recording, sorting, n_comp=3, by_electrode=True)

for pc in pca_scores_by_electrode:
    print(pc.shape)

Fitting PCA of 3 dimensions on 52768 waveforms
(48, 32, 3)
(29, 32, 3)
(36, 32, 3)
(45, 32, 3)
(43, 32, 3)
(62, 32, 3)
(60, 32, 3)
(68, 32, 3)
(40, 32, 3)
(89, 32, 3)
(53, 32, 3)
(52, 32, 3)
(46, 32, 3)
(61, 32, 3)
(88, 32, 3)
(181, 32, 3)
(155, 32, 3)
(112, 32, 3)
(135, 32, 3)
(246, 32, 3)


In this case, as expected, 3 principal components are extracted for each electrode.

In [17]:
fig = plt.figure()
ax = fig.add_subplot(111)
ax.plot(pca_scores_by_electrode[0][:, 0, 0], pca_scores_by_electrode[0][:, 1, 0], 'r*')
ax.plot(pca_scores_by_electrode[2][:, 0, 0], pca_scores_by_electrode[2][:, 1, 1], 'b*')

<IPython.core.display.Javascript object>

[<matplotlib.lines.Line2D at 0x7f9991050dd8>]

### Data curation using Phy

Finally, it is common to visualize and manually curate the data after spike sorting.
In order to do so, we interface wiht the Phy (https://phy-contrib.readthedocs.io/en/latest/template-gui/).

First, we need to export the data to the phy format:

In [24]:
st.postprocessing.exportToPhy(recording, sorting, output_folder='phy', electrode_dimensions=[1,2])

Fitting PCA of 5 dimensions on 52768 waveforms
Saved phy format to:  /home/alessiob/Documents/Codes/spike_sorting/spiketoolkit/examples/phy
Run:

phy template-gui  /home/alessiob/Documents/Codes/spike_sorting/spiketoolkit/examples/phy/params.py


In [26]:
!phy template-gui  /home/alessiob/Documents/Codes/spike_sorting/spiketoolkit/examples/phy/params.py --debug

19:03:06 [I] klustakwik KlustaKwik2 version 0.2.6
[37m19:03:06 [D] __init__:48          Copy /home/alessiob/Documents/Codes/expipe/cinpla-base/src/phy-contrib/phycontrib/template/static/state.json to /home/alessiob/.phy/TemplateGUI/state.json[0m
[37m19:03:06 [D] model:395            Loading spike clusters.[0m
[37m19:03:06 [D] model:410            Loading templates.[0m
[37m19:03:06 [D] model:431            Loading the whitening matrix.[0m
[37m19:03:06 [D] model:435            Loading the inverse of the whitening matrix.[0m
[37m19:03:06 [D] model:101            Loading traces at `/home/alessiob/Documents/Codes/spike_sorting/spiketoolkit/examples/phy/recording.dat`.[0m
[37m19:03:06 [D] context:65           Initialize joblib cache dir at `/home/alessiob/Documents/Codes/spike_sorting/spiketoolkit/examples/phy/.phy`.[0m
[37m19:03:06 [D] config:46            Load config file `/home/alessiob/.phy/phy_config.py`.[0m
[37m19:03:06 [D] __init__:33          Loading 0 plugins.[0m


In this case, in phy, we manually merged to units. We can load back the curated data using the `PhysortingExtractor`:

In [22]:
sorting_curated = se.PhySortingExtractor('phy/')

In [25]:
print('Before curation: ', len(sorting.get_unit_ids()))
print('After curation: ', len(sorting_curated.get_unit_ids()))

Before curation:  10
After curation:  9
