epochs event_id as dict and support for multiple marker numbers #133

agramfort · 2012-10-05T17:06:14Z

I'd like to be able to write

epochs = mne.Epochs(raw, events, event_id=dict(auditory=1, visual=3), ...)
evoked_auditory = epochs.average('auditory')
evoked_visual = epochs.average('visual')
evoked = epochs.average()

and evoked_auditory.comment would then contain "Evoked auditory" and not unknown as it is now.

cc/ @Eric89GXL @mluessi @christianmbrodbeck @dengemann if you want to give it a try...

The text was updated successfully, but these errors were encountered:

larsoner · 2012-10-05T17:08:09Z

Sounds easy enough. I can do it while I'm digging around in there anyway.

Speaking of which, I'd like to add support for saving the standard errors, too, if it's not already in there. Did I just miss it somewhere?

larsoner · 2012-10-05T17:09:15Z

something like epochs.average(comment='auditory', stderr=True/False) would be nice

dengemann · 2012-10-05T17:11:57Z

very nice idea +1 for that
maybe with this we could revisit my epochs-response matching example (reading correct buttonpresses from epochs).

On 05.10.2012, at 19:08, Eric89GXL notifications@github.com wrote:

Sounds easy enough. I can do it while I'm digging around in there anyway.

Speaking of which, I'd like to add support for saving the standard errors, too, if it's not already in there. Did I just miss it somewhere?

—
Reply to this email directly or view it on GitHub.

agramfort · 2012-10-05T17:16:47Z

glad you both like this idea.

Regarding capturing the standard error what you want is a logger

see for inspiration (or not if too complex) this PR on scikit-learn :

scikit-learn/scikit-learn#1171

but we could do something simpler...

larsoner · 2012-10-05T17:20:00Z

I'm not sure what you mean by standard error---does it have something to do with the subject making errors? I just meant the standard deviation across epochs divided by the sqrt of the number of epochs for each point in time (to complement the average at each point in time for plotting purposes, generally)...

larsoner · 2012-10-05T17:21:13Z

i.e., the data that gets put in the evoked files by putting "stderr" in the .ave files in the C code. As you can see I'm a little hung up on the C code functionality :)

agramfort · 2012-10-05T17:22:31Z

#LOL

by the way you probably know it's possible to compute the std in one pass:

http://www.strchr.com/standard_deviation_in_one_pass

larsoner · 2012-10-05T17:24:44Z

I did not know that. I wonder which one numpy.std uses.

agramfort · 2012-10-05T17:29:46Z

I did not know that. I wonder which one numpy.std uses.

you can read the code :

https://github.com/numpy/numpy

:)

dengemann · 2012-10-05T17:34:59Z

lol too :-)

no it was just related to a discussion alex and i had about smartening up the epochs objects.
letting the epochs know about the type of events via a dict points into that direction.
i was refering to some code i wrote for creating epochs based on the subjects behavioral responses.

btw. -- somewhat related -- i'm about to finalize my ica WIP-PR soon. I was thinking about allowing to include the mixing / unmixing matrixes in the raw object similar like projs, so that you could toggle the raw data between ica and channel space. does it make sense to you?

On 05.10.2012, at 19:22, Alexandre Gramfort notifications@github.com wrote:

#LOL

by the way you probably know it's possible to compute the std in one pass:

http://www.strchr.com/standard_deviation_in_one_pass
—
Reply to this email directly or view it on GitHub.

christianbrodbeck · 2012-10-05T21:13:24Z

I would suggest enhancing epochs with an ultimate goal of also capturing single trial properties.

I have been working on some classes to represent events, which would provide a large range of functionalities. If people like this idea these classes could be used in one way or another to manage events.

For the classes, look at an example or the documentation I have made so far.

I have made a branch where I used these objects to implement basically the functionality you are asking for (there is an example script but you would have to have my package installed). The branch adds a model attribute to Epochs which has all events. This model makes it easy to add additional functionality. For instance, the Epochs.model also keeps the epoch index based on the original event order, so that after epochs are rejected individual epochs can still be identified (although this works only with preload=True, bc with preload=False get_data() does not modify the Epochs object).

I think such an event representation would be even more useful for loading events in the first place. This would make it easier to assign more than one label to each event, and specify e.g. multifactorial designs. Each experimental paradigm then would only need a single label_events function that would add labels based on IDs (example). The whole selection of events could then be made using labels rather than IDs. E.g.::

>>> import loader
>>> ds = loader.load_evts('xxxxx/MEG/sample/sample_audvis_raw.fif')
    Read a total of 3 projection items:
        PCA-v1 (1 x 102)  idle
        PCA-v2 (1 x 102)  idle
        PCA-v3 (1 x 102)  idle
Adding average EEG reference projection.
Created an SSP operator (subspace dimension = 4)
4 projection items activated
>>> print ds[:10]
eventID   i_start   condition   side   modality
-----------------------------------------------
2         27977     RA          R      A       
3         28345     LV          L      V       
1         28771     LA          L      A       
4         29219     RV          R      V       
2         29652     RA          R      A       
3         30025     LV          L      V       
1         30450     LA          L      A       
4         30839     RV          R      V       
2         31240     RA          R      A       
3         31665     LV          L      V       
>>> ds = ds.subset(ds['modality'] == 'A')
>>> print ds[:10]
eventID   i_start   condition   side   modality
-----------------------------------------------
2         27977     RA          R      A       
1         28771     LA          L      A       
2         29652     RA          R      A       
1         30450     LA          L      A       
2         31240     RA          R      A       
1         32101     LA          L      A       
2         32935     RA          R      A       
1         33712     LA          L      A       
2         34532     RA          R      A       
1         35428     LA          L      A       
>>>

And then epochs could be loaded just for the selected events.

Minor comment: the Epochs.average() method already has a kwarg so Epochs.average(event_label) would break backwards compatibility.

larsoner · 2012-10-05T21:34:31Z

While we're on the topic of re-coding events, it came to my attention a bit ago that having different numbers of trials in two conditions A and B will cause a bias when you estimate the difference in the magnitude of the activities in the two conditions (e.g., when using the magnitude of the dSPM or current values in the two conditions). For an intuitive sense, of this consider that your noise will go down as sqrt(N) with N trials, so if there are 4x the number of trials in condition A versus condition B, then you'll have half the noise amplitude in condition A---once you take the magnitude, if noise used to be zero-mean Gaussian it now has a folded normal distribution with a non-zero mean that biases condition B more than condition A.

In any case, if we can build in a way to do trial-count equalization at some level to this code, it would also be great. Right now we do it all offline with list file I/O...

dengemann · 2012-10-05T21:54:00Z

wow, that sounds really interesting.
did you actually come across patsy?
just for not reduplicating work:

http://patsy.readthedocs.org/en/latest/index.html

maybe one could create some kind of interface to the stats-world via your approach / patsy / pandas.
we just have to make sure that the api doesnt get cluttered.
i will upload an example demonstrating my event-evaluation function tomorrow / the next days. I would be happy to know what you think on it
(it gets key-depression-latencies / durations and labels epochs as valid vd invalid depending on criteria)
i also did some explorations on how to most smoothly draw pandas dataframes from epochs.
i stopped this at some point because it felt somewhat 'unnatural' if you know what i mean.
(tabular strucures seem to me more natural for natural for behavioral / second-level / lower-dimensional data
but i might be mistaken)
so my conclusion was that the main classes shouldnt be oover-doped too much
but an IO module might be it.
wdyt? hope this makes sense.

D

On 05.10.2012, at 23:13, Christian Brodbeck notifications@github.com wrote:

I would suggest enhancing epochs with an ultimate goal of also capturing single trial properties.

I have been working on some classes to represent events, which would provide a large range of functionalities. If people like this idea these classes could be used in one way or another to manage events.

For the classes, look at an example or the documentation I have made so far.

I have made a branch where I used these objects to implement basically the functionality you are asking for (there is an example script but you would have to have my package installed). The branch adds a model attribute to Epochs which has all events. This model makes it easy to add additional functionality. For instance, the Epochs.model also keeps the epoch index based on the original event order, so that after epochs are rejected individual epochs can still be identified (although this works only with preload=True, bc with preload=False get_data() does not modify the Epochs object).

I think such an event representation would be even more useful for loading events in the first place. This would make it easier to assign more than one label to each event, and specify e.g. multifactorial designs. Each experimental paradigm then would only need a single label_events function that would add labels based on IDs (example). The whole selection of events could then be made using labels rather than IDs. E.g.::

import loader
ds = loader.load_evts('xxxxx/MEG/sample/sample_audvis_raw.fif')
Read a total of 3 projection items:
PCA-v1 (1 x 102) idle
PCA-v2 (1 x 102) idle
PCA-v3 (1 x 102) idle
Adding average EEG reference projection.
Created an SSP operator (subspace dimension = 4)
4 projection items activated
print ds[:10]

eventID i_start condition side modality

2 27977 RA R A
3 28345 LV L V
1 28771 LA L A
4 29219 RV R V
2 29652 RA R A
3 30025 LV L V
1 30450 LA L A
4 30839 RV R V
2 31240 RA R A
3 31665 LV L V
ds = ds.subset(ds['modality'] == 'A')
print ds[:10]

eventID i_start condition side modality

2 27977 RA R A
1 28771 LA L A
2 29652 RA R A
1 30450 LA L A
2 31240 RA R A
1 32101 LA L A
2 32935 RA R A
1 33712 LA L A
2 34532 RA R A
1 35428 LA L A

And then epochs could be loaded just for the selected events.

Minor comment: the Epochs.average() method already has a kwarg so Epochs.average(event_label) would break backwards compatibility.

—
Reply to this email directly or view it on GitHub.

dengemann · 2012-10-05T21:55:22Z

good that you name it.
i was puzzled about that too.

On 05.10.2012, at 23:34, Eric89GXL notifications@github.com wrote:

While we're on the topic of re-coding events, it came to my attention a bit ago that having different numbers of trials in two conditions A and B will cause a bias when you estimate the difference in the magnitude of the activities in the two conditions (e.g., when using the magnitude of the dSPM or current values in the two conditions). For an intuitive sense, of this consider that your noise will go down as sqrt(N) with N trials, so if there are 4x the number of trials in condition A versus condition B, then you'll have half the noise amplitude in condition A---once you take the magnitude, if noise used to be zero-mean Gaussian it now has a folded normal distribution with a non-zero mean that biases condition B more than condition A.

In any case, if we can build in a way to do trial-count equalization at some level to this code, it would also be great. Right now we do it all offline with list file I/O...

—
Reply to this email directly or view it on GitHub.

larsoner · 2012-10-06T07:52:49Z

I'm working on adding functionality to save stderr as well as means in evoked (including from epochs.average()). Looks like the best way to do it is to modify the guts of the Evoked object to support multiple data sets, i.e. make evoked.data a list. This most closely mirrors how the files are stored. While we could do multiple evoked objects per condition (or mean / stderr) you wanted to have, that makes for a bunch of unnecessary copies of channel data, etc. that we'd have to add conditions for, so I lean against that solution. From the bit of work I've done, it looks like we'd have to make the following things lists:

evoked.data
evoked.nave
evoked.aspect_kind (mean or stderr)
evoked.comment

Other than that, we should be able to leave the structures as-is. Unfortunately this would break backward compatibility with people reading these fields directly, but it seems like the cleanest from a coding standpoint. What do people think?

If backward compatibility is critical, I can instead work on instead improving read_evoked and write_evoked. Instead of just calling the Evoked() and evoked.save() functions, respectively, I can try to get them to do something intelligent to combine (or split) these calls when reading or writing the evoked files. That would get us closer to what the C code did, storing every item in one place, at least.

larsoner · 2012-10-06T19:22:37Z

In any case, it might make sense for now just to extend the functionality of read/write_evoked to allow reading and writing multiple event codes to FIF files in order to maintain backward compatibility. That's what I've implemented in PR 135, let me know what you think.

agramfort · 2012-10-07T13:57:39Z

In any case, it might make sense for now just to extend the functionality
of read/write_evoked to allow reading and writing multiple event codes to
FIF files in order to maintain backward compatibility. That's what I've
implemented in PR 135, let me know what you think.

+1 for that

and +1 for denis' suggestion to find a way to expose epochs as objects that
pandas or statsmodels or patsy understands for simple stats like
anovas and GLMs.

df = epochs.as_dataframe()

?

agramfort · 2012-10-07T13:58:49Z

and +1 for adding a method summary to Epochs to print things like

eventID i_start condition

2 27977 RA
1 28771 LA
2 29652 RA
1 30450 LA
2 31240 RA

I am really open to suggestions.

and print also in seconds rather that index numbers

agramfort · 2012-10-07T14:09:41Z

since a trigger is always one integer it would be better to do:

event_id = {1:['left', 'auditory'], 2:['right', 'auditory'], 3:['left', 'visual'], 4:['right', 'visual']}

@christianmbrodbeck would that suite your needs better?

dengemann · 2012-10-07T14:29:48Z

@ Data frames

Actually i did something like that, it would looks like that (inside Epochs):

def to_data_frame(self, frame=True):
     """Get the epochs as Pandas panel of data frames

     Parameters
     ----------

     frame : boolean
        If frame, data frame will be returned with a hierarchical
        epochs * time-slices index, else a panel object of
        channels * time-slices data frames for each epoch.

     Returns
     -------
     out : depending on arguments
        data frame object or panel object

     """
     import pandas as pa
     data = self.get_data()
     epoch_ids = ["Epoch %i" % (i + 1) for i in np.arange(data.shape[0])]

     ret = pa.Panel(data=data, items=epoch_ids, major_axis=self.ch_names)
     if frame:
         ret = ret.swapaxes(0, 1).to_frame()
         ret.index.names = ["epochs", "tsl"]
         return ret
     else:
         ret.swapaxes(1, 2)

And here a minimum usage example:

import mne
import numpy as np
from mne.fiff import Raw
from mne.datasets import sample
from pandas.stats.api import rolling_mean

from mne.datasets import sample
data_path = sample.data_path('examples/')
raw_fname = data_path + '/MEG/sample/sample_audvis_filt-0-40_raw.fif'
raw = Raw(raw_fname)
events = mne.find_events(raw, stim_channel='STI 014')
exclude = raw.info['bads'] + ['MEG 2443', 'EEG 053']
picks = mne.fiff.pick_types(raw.info, meg=True, eeg=True, eog=True, stim=False, exclude=exclude)

event_id = 1
tmin = -0.2
tmax = 0.5
baseline = (None, 0)
reject = dict(grad=4000e-13, mag=4e-12, eog=150e-6)

epochs = mne.Epochs(raw, events, event_id, tmin, tmax, proj=True, picks=picks,
                    baseline=baseline, preload=False, reject=reject)

epochs_df = epochs.to_data_frame()

meg_chs = [c for c in epochs.ch_names if c.startswith("MEG")]

#display some channels
epochs_df.ix[:, :10].head(20)

# split timeslices
grouped_tsl = epochs_df[meg_chs].groupby(level='tsl')

# then create a quick average plot
grouped_tsl.mean().plot(legend=0)

# or a trellis plot on a few channels 
grouped_tsl.mean()[meg_chs[:10]].plot(subplots=1)

# use median instead
grouped_tsl.median().plot(legend=0)

# use custom numpy function
grouped_tsl.agg(np.std).plot(legend=0)

# average and then smooth using a rolling mean and finally plot in one sinfle line!
grouped_tsl.apply(lambda x: rolling_mean(x.mean(), 10)).plot(legend=0)

# apply different functio for channels
grouped_tsl.agg({"MEG 0113": np.mean, "MEG 0213": np.median})

# investigate epochs and create string table for dumping into file
grouped_epochs = epochs_df[meg_chs].groupby(level='epochs')

result_table = (grouped_epochs.max().ix[:, 1:3] * 1e15).to_string()

# investigate a specific channel's std across epochs
grouped_epochs.std()["MEG 0113"].plot()

grouped_tsl.agg(np.std).plot(legend=0)

What do you think on that, worth a PR?

christianbrodbeck · 2012-10-07T15:55:20Z

@dengemann no I haven't come across patsy, thanks for pointing it out! I will have to have a closer look at it. I have come across Pandas, but I understood it's not made for higher dimensional data.

For simplifying sensor analysis I've been using an additional "ndvar" class which I have not documented externally yet; The main idea is to have a numpy array that "knows" about its dimensions, which can be used in indexing and plotting.

The object behaves like factor and var objects for indexing and can be managed in a dataset in the same way
ndvars represents for each case not a single value or label, but instead an n-dimensional array (e.g., time for a simple time series; time by sensor for an epoch; …)
for each dimension, the object can store meaningful values, e.g. sensor positions, which can then then be used by plotting functions
indexing with dimensions allows fast extraction of univariate dependent variables such as peak values for analysis in other packages (e.g., indexing a (trial X sensor X time) ndvar object with a single sensor returns a (trial X time) ndvar for that sensor). Once you reach a univariate variable it can be transferred into another stats package

Here are some examples with random data, and here is an example with the mne sample data.

If that's of any use it would be great to integrate it with other packages. A limitation is representing data of from different types (e.g. gradiometer and magnetometer) together. However, I think source estimates could well be represented with ndvars, the relevant dimension being a source space.

christianbrodbeck · 2012-10-07T16:14:43Z

@agramfort

since a trigger is always one integer it would be better to do:
event_id = {1:['left', 'auditory'], 2:['right', 'auditory'], 3:['left', 'visual'], 4:['right', 'visual']}

That would be somewhat more flexible but might get somewhat complicated with multiple factors. Also, what I mean by trial information is something like e.g. lexical frequency for words, which often cannot be encoded in the triggers because there are not enough values. Rather we'd have an external list with those values and would have to use that information together with the data. Having a proper data model for the epochs would provide a natural way to interact with that information.

However, especially because of the .fif file format limitations (?), maybe it would make more sense to store the data model externally? The epochs object could restrict itself to representing the case index in some way (so that after rejecting epochs the corresponding to the model can be reconstructed) but epochs could of course also represent simple label codes for situations where a more complex label is unnecessary? I.e. someone with a simple model would not have to worry about the model representation?

dengemann · 2012-10-07T20:12:11Z

@christianmbrodbeck

Hi chris,

that looks very appealing, nice!
probably the ndvar approach would be suitible for internal usage, however increasingly so on the long run (api changes, complexity).

the pandas folks are working on nd-dataframes. however, in fact the data frame already supports higher dimensional structures via hierarchical or multi index, that is a tuple based index.

http://pandas.pydata.org/pandas-docs/stable/indexing.html

also see the recent WIP #137

you can easily achieve what we are looking with pandas, e.g. subjects x conditions x trials x timeslices indices.
and its fast (cython) and is tested.

so as for now pandas might be a good choice for doing analysis-related restructuring. we then could iself-pacedly catch up with the api step by step and see where to set the internal / foreign fuctionality border.

does that make sense?

D

On 07.10.2012, at 17:55, Christian Brodbeck notifications@github.com wrote:

@dengemann no I haven't come across patsy, thanks for pointing it out! I will have to have a closer look at it. I have come across Pandas, but I understood it's not made for higher dimensional data.

For simplifying sensor analysis I've been using an additional "ndvar" class which I have not documented externally yet; The main idea is to have a numpy array that "knows" about its dimensions, which can be used in indexing and plotting.

The object behaves like factor and var objects for indexing and can be managed in a dataset in the same way
ndvars represents for each case not a single value or label, but instead an n-dimensional array (e.g., time for a simple time series; time by sensor for an epoch; …)
for each dimension, the object can store meaningful values, e.g. sensor positions, which can then then be used by plotting functions
indexing with dimensions allows fast extraction of univariate dependent variables such as peak values for analysis in other packages (e.g., indexing a (trial X sensor X time) ndvar object with a single sensor returns a (trial X time) ndvar for that sensor). Once you reach a univariate variable it can be transferred into another stats package
Here are some examples with random data, and here is an example with the mne sample data.

If that's of any use it would be great to integrate it with other packages. A limitation is representing data of from different types (e.g. gradiometer and magnetometer) together. However, I think source estimates could well be represented with ndvars, the relevant dimension being a source space.

—
Reply to this email directly or view it on GitHub.

dengemann · 2012-12-05T12:42:51Z

... Hi @agramfort @Eric89GXL @mluessi @christianmbrodbeck --- what did actually happen to this, thinking about future directions I 'sexing' / 'smartening' up the Epochs still is a nice one. Where are we at with this?

agramfort · 2012-12-05T13:46:43Z

not far... I guess we need a dedicated contributor to embrace the project ;)

dengemann · 2012-12-05T13:47:39Z

You damn pointer ;-)

On Wed, Dec 5, 2012 at 2:46 PM, Alexandre Gramfort <notifications@github.com

wrote:

not far... I guess we need a dedicated contributor to embrace the project
;)

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/133#issuecomment-11041754.

dengemann · 2012-12-07T22:53:43Z

i can only second that, each sentence.
would you like to issue a PR?

On 07.12.2012, at 23:49, Eric89GXL notifications@github.com wrote:

I really like the new dict support for epochs instances. They will allow me to potentially code something in mne-python that I've been doing manually thus far, which is trial-count-matching conditions and combining conditions, with minimal changes to mne-python.

For example, let's say you have an experiment with auditory and visual stimuli, coming from the left and right (2x2 design). You may want to contrast A-V (thus collapsing L&R). In this case, it would be ideal (and possibly important) to equalize the trial counts in AL, AR, VL, and VR conditions before combining AL+AR and VL+VR to get A and V.

I propose the following changes (or something analogous) to facilitate this:

We already have the function mne.equalize_epoch_counts, but this operates on multiple Epochs instances to equalize trial counts. Now that we have support for multiple markers in one Epochs instance, it should be very straightforward to add this functionality to Epochs instances themselves, as:
epochs.equalize_counts(types_1, types_2, ...)
This function would pool all trials of type types_1 together, all trials of type types_2 together, etc., and equalize the trial counts of these groups by dropping an appropriate number of trials from each groups. In the example above, this would be something like:
epochs.equalize_counts(['AudL'], ['AudR'], ['VisL'], ['VisR'])
Following this operation, epochs would have the same number of 'AudL' and 'VisR' trials and so on. If someone didn't really care about whether or not auditory had more L trials than visual, but just wanted to compare them (obviously not optimal in this example but there are situations where it would make sense), they could do:
epochs.equalize_counts(['AudL', 'AudR'], ['VisL', 'VisR'])

Add the method epochs.collapse_trial_types(old_types, new_type). This function would simply re-label all events in a given set of types to be a new label. In the example, this would be:

epochs.collapse_types(['AudL', 'AudR'], 'Aud'])
epochs.collapse_types(['VisL', 'VisR'], 'Vis'])
What do people think? I think the changes to mne-python would be fairly minimal, and I think this is functionality that should be present. It's a necessary first step for any analysis performed in our lab, since we typically have designs where there are multiple conditions, and they must be trial-count-equalized to avoid bias...

—
Reply to this email directly or view it on GitHub.

larsoner · 2012-12-07T22:59:01Z

Sure. @agramfort, @christianmbrodbeck, @mluessi, speak now or forever hold your peace...

dengemann · 2012-12-07T23:00:22Z

Just to add a few thoughts, or just one, basically. Before the 0.5 release i am planning to refactor the pandas export. I think the dict features will make it straight forward to export proper design-tables to pandas / R -- so you could easily feedback stats to an mne session at a critical point, the epochs processing. the equalisation features would be nice to obtain equal cells expected by most procedures.

On 07.12.2012, at 23:49, Eric89GXL notifications@github.com wrote:

I really like the new dict support for epochs instances. They will allow me to potentially code something in mne-python that I've been doing manually thus far, which is trial-count-matching conditions and combining conditions, with minimal changes to mne-python.

For example, let's say you have an experiment with auditory and visual stimuli, coming from the left and right (2x2 design). You may want to contrast A-V (thus collapsing L&R). In this case, it would be ideal (and possibly important) to equalize the trial counts in AL, AR, VL, and VR conditions before combining AL+AR and VL+VR to get A and V.

I propose the following changes (or something analogous) to facilitate this:

We already have the function mne.equalize_epoch_counts, but this operates on multiple Epochs instances to equalize trial counts. Now that we have support for multiple markers in one Epochs instance, it should be very straightforward to add this functionality to Epochs instances themselves, as:
epochs.equalize_counts(types_1, types_2, ...)
This function would pool all trials of type types_1 together, all trials of type types_2 together, etc., and equalize the trial counts of these groups by dropping an appropriate number of trials from each groups. In the example above, this would be something like:
epochs.equalize_counts(['AudL'], ['AudR'], ['VisL'], ['VisR'])
Following this operation, epochs would have the same number of 'AudL' and 'VisR' trials and so on. If someone didn't really care about whether or not auditory had more L trials than visual, but just wanted to compare them (obviously not optimal in this example but there are situations where it would make sense), they could do:
epochs.equalize_counts(['AudL', 'AudR'], ['VisL', 'VisR'])

Add the method epochs.collapse_trial_types(old_types, new_type). This function would simply re-label all events in a given set of types to be a new label. In the example, this would be:

epochs.collapse_types(['AudL', 'AudR'], 'Aud'])
epochs.collapse_types(['VisL', 'VisR'], 'Vis'])
What do people think? I think the changes to mne-python would be fairly minimal, and I think this is functionality that should be present. It's a necessary first step for any analysis performed in our lab, since we typically have designs where there are multiple conditions, and they must be trial-count-equalized to avoid bias...

—
Reply to this email directly or view it on GitHub.

mluessi · 2012-12-07T23:11:23Z

I like the idea.. but I think we should try not to include a bunch new features right before the 0.5 release. Given that the idea is to have a release in less than 2 weeks, features that we include now will only receive minimal testing and we may end up having a release that is buggy.

dengemann · 2012-12-07T23:15:30Z

that's true too. so let's first make sure the new exciting dict feature does what we want and also there are still other thıngs pending / waiting...
but ++1 for this direction

On 08.12.2012, at 00:11, Martin Luessi notifications@github.com wrote:

I like the idea.. but I think we should try not to include a bunch new features right before the 0.5 release. Given that the idea is to have a release in less than 2 weeks, features that we include now will only receive minimal testing and we may end up having a release that is buggy.

—
Reply to this email directly or view it on GitHub.

larsoner · 2012-12-07T23:18:18Z

Makes sense. Are we going to split development between stable 0.5 (patching bugs and the like) and development (0.6dev) versions? That would prevent delays on 0.6 feature development. But it is possible just hold off on merging changes if that's easier to manage.

christianbrodbeck · 2012-12-07T23:18:25Z

@Eric89GXL Sounds useful, and makes me think of my earlier proposal to integrate my factors to manage events: with those factors you could just use an interaction (I used the % operator) to specify the sets of trials you want to equalize

epochs.equalize_counts(modality % side)

internally you could then use those factors:

>>> i = modality % side # interaction effect
>>> i.cells
[('A', 'L'), ('A', 'None'), ('A', 'R'), ('None', 'L'), ('None', 'None'), ('None', 'R'), ('V', 'L'), ('V', 'None'), ('V', 'R')]
>>> for cell in i.cells:
...     index = i == cell
...     print index.sum()
...     
72
0
73
0
31
0
73
0
71

larsoner · 2012-12-07T23:21:43Z

@christianmbrodbeck that would be cool, too. I think supporting both modes would be convienent, like a "simple" versus "advanced" mode.

Sorry for being dense, but what is the ('None', 'None') in that example...?

dengemann · 2012-12-07T23:21:52Z

i think that would be brilliant.
but martin is certainly right, let's get the 0.5 done -- without bugs and meanwhile keep up #133

On 08.12.2012, at 00:18, Christian Brodbeck notifications@github.com wrote:

@Eric89GXL Sounds useful, and makes me think of my earlier proposal to integrate my factors to manage events: with those factors you could just use an interaction (I used the % operator) to specify the sets of trials you want to equalize

epochs.equalize_counts(modality % side)
internally you could then use those factors:

i = modality % side # interaction effect
i.cells
[('A', 'L'), ('A', 'None'), ('A', 'R'), ('None', 'L'), ('None', 'None'), ('None', 'R'), ('V', 'L'), ('V', 'None'), ('V', 'R')]
for cell in i.cells:
... index = i == cell
... print index.sum()
...
72
0
73
0
31
0
73
0
71
—
Reply to this email directly or view it on GitHub.

larsoner · 2012-12-07T23:22:44Z

@mluessi I realized it might be as simple as sending my pull request for this design into a 0.6dev fork (which would also get the commits from 0.5 as they roll in). Is that right?

christianbrodbeck · 2012-12-07T23:26:29Z

@Eric89GXL

Sorry for being dense, but what is the ('None', 'None') in that example...?

On 'smiley' and 'button' trials, modality and side are = 'None' :), so

>>> interesting_trials = condition.isnot('smiley', 'button')
>>> i = i[interesting_trials]
>>> i.cells
[('A', 'L'), ('A', 'R'), ('V', 'L'), ('V', 'R')]

agramfort · 2012-12-08T09:34:15Z

@Eric89GXL I am answering to your long text above

+1 for 1/

for 2/ this is somehow related with the mne.merge_events functions. I am wondering if this logic should be done in Epochs or a priori working with the events array.

just a remark: we should be careful not to have too clever objects. Think about how you would explain it to a new user.

dengemann · 2012-12-08T10:52:22Z

On 08.12.2012, at 10:34, Alexandre Gramfort notifications@github.com wrote:

@Eric89GXL I am answering to your long text above

+1 for 1/

for 2/ this is somehow related with the mne.merge_events functions. I am wondering if this logic should be done in Epochs or a priori working with the events array.

right, forgot about merge events. so as it is now you could at any time merge events before you create a new epoch.
2 would just involve an instance method that does this / remaps the event_id.
Or you simply would create a new events array and update the event_id dict without the need for a new smart instance method that confuses new users already totally overwhelmed by mne-python.
We could also think about a function that does this as kind of a compromise. say mne.merge_epochs_ids(epochs, ['aud_l', 'aud_'], 'aud') or so.

just a remark: we should be careful not to have too clever objects. Think about how you would explain it to a new user.

—
Reply to this email directly or view it on GitHub.

agramfort · 2012-12-08T13:51:55Z

Maybe it's 5 lines of code so yes but it gets messy lets avoid too much complexity.

larsoner · 2012-12-08T16:25:27Z

@agramfort I do expect it only to be about 5 lines of code. I'll draft a PR (for 0.6) and you can see what you think. The advantage to having the object be smarter is that, the way I've talked about, you can load all the data exactly once (and save the epochs FIF), and then take a subset of them as required by the analysis. It should be substantially faster than having to re-load from raw every time.

dengemann · 2012-12-08T16:30:06Z

as i proposed, if the reluctance is about the fear to pollute our Epochs object we could write function that lives in mne just like merge_events

On 08.12.2012, at 17:25, Eric89GXL notifications@github.com wrote:

@agramfort I do expect it only to be about 5 lines of code. I'll draft a PR (for 0.6) and you can see what you think. The advantage to having the object be smarter is that, the way I've talked about, you can load all the data exactly once (and save the epochs FIF), and then take a subset of them as required by the analysis. It should be substantially faster than having to re-load from raw every time.

—
Reply to this email directly or view it on GitHub.

dengemann · 2012-12-08T16:37:59Z

this function then could do the remapping in a flexible fashion to also support factors, patsy and the other smart things from #133

so the object may remain a bit less smart and we can eat and have our cakes...

On 08.12.2012, at 17:25, Eric89GXL notifications@github.com wrote:

@agramfort I do expect it only to be about 5 lines of code. I'll draft a PR (for 0.6) and you can see what you think. The advantage to having the object be smarter is that, the way I've talked about, you can load all the data exactly once (and save the epochs FIF), and then take a subset of them as required by the analysis. It should be substantially faster than having to re-load from raw every time.

—
Reply to this email directly or view it on GitHub.

larsoner · 2012-12-08T17:27:38Z

Ahh, I see what you mean. I'll do that!

dengemann · 2012-12-08T17:37:55Z

Great! Let's think about how we do it in a way that is extendible, either
via smart args / type checking (dict, list, pandas whatever, R formula), or
a private function that does the remapping but with different public
interfaces that handle the specification from different, e.g. your proposed
merge_epochs_ids, that would take list + string, and then for example
gen/make/create/set_espochs_design which would process formulas or whatever
and so forth and whatever we will draw from #133 in the future. Makes
sense?

…tuff to what's new. @chrstianmbrodbeck you also made contributions to 0.5, you should mention these. Added you to the dict as the rest of the #133 crew (hope no one is missing).

larsoner · 2012-12-20T19:32:28Z

@dengemann, @agramfort, @christianmbrodbeck now that we've made some progress on this, my vote is to close this issue, and start a new, more clearly titled issue when someone formulates the next good potential extension (for 0.6, presumably).

dengemann · 2012-12-20T19:36:29Z

Good bye #133 and thanks for the fruitful as well as nice discussion we had. Let's continue this in subsequent issues and PRs.

larsoner mentioned this issue Oct 6, 2012

WIP: Multiple evoked, plus stderr calculation #135

Closed

dengemann mentioned this issue Oct 7, 2012

WIP: Data frame method for epochs #137

Closed

dengemann mentioned this issue Oct 11, 2012

WIP: ICA decomposition for Raw and Epochs (cleaned branch) #141

Closed

dengemann mentioned this issue Dec 5, 2012

2-Way Repeated Measures ANOVA + contrasts #226

Closed

larsoner mentioned this issue Dec 12, 2012

ENH: Epoch event_id trial count equalization and combining #252

Merged

This was referenced Dec 13, 2012

WIP+ ENH: neater + richer pandas exporter + more concrete + detailed tutorial #262

Merged

ENH: Generalize epoch images to handle custom data (misc) #221

Merged

dengemann mentioned this issue Dec 19, 2012

Failing example: plot_epochs_as_data_frame.py #286

Closed

dengemann closed this as completed Dec 20, 2012

christianbrodbeck mentioned this issue Dec 21, 2012

Suggestion (> 0.5): single class to represent multidimensional data #309

Closed

This was referenced Dec 27, 2012

WIP + ENH: define higher order events #333

Closed

ENH: describe / summary method for epochs #338

Closed

epochs event_id as dict and support for multiple marker numbers #133

epochs event_id as dict and support for multiple marker numbers #133

Comments

agramfort commented Oct 5, 2012

larsoner commented Oct 5, 2012

larsoner commented Oct 5, 2012

dengemann commented Oct 5, 2012

agramfort commented Oct 5, 2012

larsoner commented Oct 5, 2012

larsoner commented Oct 5, 2012

agramfort commented Oct 5, 2012

larsoner commented Oct 5, 2012

agramfort commented Oct 5, 2012

dengemann commented Oct 5, 2012

christianbrodbeck commented Oct 5, 2012

larsoner commented Oct 5, 2012

dengemann commented Oct 5, 2012

eventID i_start condition side modality

eventID i_start condition side modality

dengemann commented Oct 5, 2012

larsoner commented Oct 6, 2012

larsoner commented Oct 6, 2012

agramfort commented Oct 7, 2012

agramfort commented Oct 7, 2012

eventID i_start condition

agramfort commented Oct 7, 2012

dengemann commented Oct 7, 2012

christianbrodbeck commented Oct 7, 2012

christianbrodbeck commented Oct 7, 2012

dengemann commented Oct 7, 2012

dengemann commented Dec 5, 2012

agramfort commented Dec 5, 2012

dengemann commented Dec 5, 2012

dengemann commented Dec 7, 2012

larsoner commented Dec 7, 2012

dengemann commented Dec 7, 2012

mluessi commented Dec 7, 2012

dengemann commented Dec 7, 2012

larsoner commented Dec 7, 2012

christianbrodbeck commented Dec 7, 2012

larsoner commented Dec 7, 2012

dengemann commented Dec 7, 2012

larsoner commented Dec 7, 2012

christianbrodbeck commented Dec 7, 2012

agramfort commented Dec 8, 2012

dengemann commented Dec 8, 2012

agramfort commented Dec 8, 2012

larsoner commented Dec 8, 2012

dengemann commented Dec 8, 2012

dengemann commented Dec 8, 2012

larsoner commented Dec 8, 2012

dengemann commented Dec 8, 2012

larsoner commented Dec 20, 2012

dengemann commented Dec 20, 2012