BaseRaw + io module #1003

agramfort · 2013-12-22T17:31:33Z

I'd like to summarize what I feel needs to be done on the BaseRaw class + io module. Here are the few steps I would take

rename mne.fiff to mne.io
mv mne/fiff/evoked.py to mne/evoked.py. It's much more natural to type mne.read_evoked etc...
mv mne/fiff/pick.py to mne/pick.py
add all the necessary deprecation warnings
make sure all examples still run
update all examples
make sure the doc still builds

then introduce a BaseRaw class in mne/io/base.py , define the interface and update the base class in every object in the Raw family

do I forget something?

any volunteer? :)

dengemann · 2013-12-24T09:08:31Z

@agramfort I'll take care of it over the holidays.

dengemann · 2013-12-24T14:09:27Z

I'm on it now

dengemann · 2013-12-24T16:34:41Z

@agramfort now that I'm in the middle of my edits it occurs to me how inconsistent it feels to have mne.Evoked, mne.Epochs but not mne.Raw., it really looks stupid next to each other in any example.
Also I think it would be cool to be able to say from mne import Raw, Epochs, Evoked
I think io should really be about converting, low level reading and writing.
My 2 cents.

agramfort · 2013-12-24T16:44:44Z

well the meaning of io is raw io in my mind.

the io module should contain only the BaseRaw and Raw* classes.

larsoner · 2013-12-24T16:49:13Z

@agramfort it would be strange to me to have a submodule named mne.io, but not have things like read_evoked, read_source_estimate, etc. not also be under it. That is what I had been assuming would happen -- all our I/O-related functions would be grouped under that module. Sorry I didn't bring it up earlier when you first proposed this...

dengemann · 2013-12-24T16:55:34Z

@agramfort I think we agree on this. We simply should expose the most frequently used callables in mne. To me this would be Raw but not Base classes or conversion classes. It really looks kind of unpractical in an example to import io only to access the Raw constructor while mne.Epochs and mne.Evoked works. Same for pick_types ... cleanup is overdue

agramfort · 2013-12-24T16:57:49Z

@Eric89GXL if we move all the read_* functions in io then we move many
many things there and need to split existing files / classes. Would
you move the source_estimate.py file there and remove the
SourceEstimate class from mne namespace? to me it would be too much.

agramfort · 2013-12-24T16:59:43Z

I have in mind to type:

mne.io.read_raw_fif
mne.io.read_raw_edf
mne.io.read_raw_kit

and

picks = mne.pick_types(...)

On Tue, Dec 24, 2013 at 5:55 PM, Denis A. Engemann <notifications@github.com

wrote:

@agramfort https://github.com/agramfort I think we agree on this. We
simply should expose the most frequently used callables in mne. To me
this would be Raw but not Base classes or conversion classes. It really
looks kind of unpractical in an example to import io only to access the
Raw constructor while mne.Epochs and mne.Evoked works. Same for pick_types... cleanup is overdue

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/1003#issuecomment-31178748
.

dengemann · 2013-12-24T17:05:36Z

I know that I suggested this.
But thinking about it it while editing this I think a data object constructors should have a consistent top-level API. flat is better than nested :-) (don't reply with namespaces are a honking great idea).

On Dec 24, 2013, at 5:59 PM, Alexandre Gramfort notifications@github.com wrote:

I have in mind to type:

mne.io.read_raw_fif
mne.io.read_raw_edf
mne.io.read_raw_kit

and

picks = mne.pick_types(...)

On Tue, Dec 24, 2013 at 5:55 PM, Denis A. Engemann <notifications@github.com

wrote:

@agramfort https://github.com/agramfort I think we agree on this. We
simply should expose the most frequently used callables in mne. To me
this would be Raw but not Base classes or conversion classes. It really
looks kind of unpractical in an example to import io only to access the
Raw constructor while mne.Epochs and mne.Evoked works. Same for pick_types... cleanup is overdue

—
Reply to this email directly or view it on GitHubhttps://github.com//issues/1003#issuecomment-31178748
.

—
Reply to this email directly or view it on GitHub.

larsoner · 2013-12-24T17:05:43Z

@agramfort that makes sense. Do you think we should not expose Raw as a way of reading raw files, but instead have people use read_raw_fif (e.g., for the FIF case)? @dengemann one disadvantage to having mne.Raw is that it would break this pattern, and make FIF-raw reading "exceptional" in how it's accessed/used.

larsoner · 2013-12-24T17:08:39Z

I think it makes the most sense to deprecate direct instantiation of raw using Raw(...) and instead have people use read_raw_fif, since it's more explicit. And we can do the same thing with Evoked(...) versus read_evoked. This seems the cleanest, most consistent way to do it to me. I'm fine with having these be under mne or mne.io.

dengemann · 2013-12-24T17:14:42Z

i'm fine with functions, but it really should be consistent. no historical excuses ;-)
also I don't see strong reasons to withhold the most commonly used functions from being exposed as top-level functions. usability over taxonomic desires, please.

dengemann · 2013-12-24T17:24:53Z

I won't continue here before we reach consensus, this PR is too fidgety otherwise. Let's shoot for a post-holiday / xmas hangout if necessary.

larsoner · 2013-12-24T17:31:21Z

The only remaining debate is over whether to put the read_raw_* functions under mne or mne.io, right?

dengemann · 2013-12-24T17:32:38Z

The only remaining debate is over whether to put the read_raw_* functions under mne or mne.io, right?

and how many of all other read_* functions should follow, e.g. read_ica, etc.

likewise, how much of pick to expose in mne name space.

dengemann · 2013-12-24T17:34:08Z

I'd prefer mne to io (read already makes the purpose explicit) but consistency would even be more important to me. Now that we have the chance to change it we should stop scattering io functions across subpackages ...

mluessi · 2013-12-24T21:01:06Z

I think it would be good to have all file-reading functions in a separate sub-package, so I vote for either using mne.io.read_raw_fif etc. or mne.io.fiff.read_raw. The former would be a flatter hierarchy, the later would be better if we ever decide to be able to things other than raw from non-fif files. E.g., we would use mne.io.fiff.read_epochs() or mne.io.h5.read_epochs(). Having a separate sub-package for each file type will make it more obvious what read/write operations are supported (esp. when e.g. using auto-completion in IPython).

Regarding having a BaseRaw class in mne/io/base.py. I think we should put it in mne/raw.py, this will be more consistent with other types (Evoked etc.).

dengemann · 2013-12-24T21:45:04Z

I'm confused. Let me try to do some score-keeping.

so I vote for either using mne.io.read_raw_fif etc. or mne.io.fiff.read_raw

currently the thing fiff is contrasted with is not hdf5, etc., but formats from other MEG vendors using a suffix notation, not a general prefix notation. I'm not sure what the solution might be here. Maybe:

mne.io.fiff.read_raw
mne.io.kit.read_raw
mne.io.hdf5.read_raw
mne.io.hdf5_ctf_from_my_lab.read_raw

# and so on?

The former would be a flatter hierarchy, the later would be better if we ever decide to be able to things other than raw from non-fif files.

I agree. but currently this is not in sight. hdf5 might be the next thing cf. NEO support. We should definitely chose an API that does not block such future developments. on the other hand side we should not give too much weight to thing things we (Y)AGN(I). Admittedly, in this case is a bit different in that we try to find an API that is both maximum general and simple ....

Having a separate sub-package for each file type will make it more obvious what read/write operations are supported (esp. when e.g. using auto-completion in IPython).

well auto-completion works equally well with underscores ;-)

Regarding having a BaseRaw class in mne/io/base.py. I think we should put it in mne/raw.py, this will be more consistent with other types (Evoked etc.).

I'm trying to wrap up. So you (@mluessi) would like to have a base class in mne but all readers spreaded across format specific modules (prefix notation). @agramfort and @Eric89GXL want do deprecate constructors but basically converge on io semantics using a suffix notation. I would like to have either functions or constructors (preferably functions) in way that is as consistent and flat as possible without excuses (read_ica, read_stc, read_raw in one place OR in their dedicated packages, but not one top-level, the other nested). But I would actually have a strong preference for top level readers because read already indicates the purpose ( io ) while implementations and low level things should live in a dedicated io module. All great packages have general readers (numpy, scipy, pandas) while special tools live in nested io modules --- stuff you only want to get in, mostly read only. I'm really afraid that we'r about to do something really bureaucratic or too smart.

I guess we need postpone this to a point where all of us can meet in a hangout.

mluessi · 2013-12-24T21:59:31Z

I don't feel very strongly about it, I'm fine with either mne.io.read_raw_fif etc. or mne.io.fiff.read_raw. I just wanted to point out that the latter has potential advantages if we decide to support non-fiff formats for writing at some point.

agramfort · 2013-12-26T21:28:17Z

It looks like we really need to hangout to converge... let's plan this for after the holidays. I'm back at work on jan 6

larsoner · 2013-12-27T01:52:16Z

Jan 6th or later works for me, but I am not too passionate about it so I'm okay if you can work it out without me (esp. since I am the limiting time factor).

maedoc · 2014-01-15T10:11:31Z

@dengemann pointed me here to discuss how to create bipolar montages for stereotactic EEG, for which I'm currently manipulating the info and _data attributes on a Raw instance directly. He pointed out that the idiomatic way to do this is with an appropriate subclass.

While we are simply recomputing channel data and changing labels, bipolar sEEG is, technically, the use of adjacent channels as references, but I didn't see any way to do that in MNE currently.

Does this fit somewhere into the above discussion?

agramfort · 2014-01-15T10:15:26Z

can you share a gist of the "hack" you do with the info and the _data field?
we recently added a method to set the EEG reference in Raw object,
maybe we can do something for sEEG too.

dengemann · 2014-01-15T10:16:44Z

@mmWoodman indeed not all aspects of the above discussion match your use case. But the general topic BaseRaw is essentially on how to support custom data in MNE-Python objects that don't match the existing schemes. A gist would be helpful to see how you read in you sEEG data and how you do the referencing.

maedoc · 2014-01-15T10:52:30Z

I have four steps from the raw bti data import

l. 16 pick EEG channels
l. 58 pick ['chs'][:]['kind']==2 as after picking EEG, some MEG chs remained
l. 93 sort channels by name
l. 151 bipolarize channels

Sorry for the excessive length, it's exported from a notebook I'm working on.

dengemann · 2014-01-15T11:02:41Z

@mmWoodman that's indeed very helpful. It helps me understanding that the io is not the crucial part of your problem since you apparently record the sEEG with the 4D Magnes system. Maybe it's a separate issue but we could probably add parts of your code to the BTi converter and / or add a designated sEEG montage function ... to be discussed.

mluessi · 2014-01-15T15:05:56Z

Any chance the bipolar montage could be computed using an SSP projector? This would be more efficient and convenient.

mluessi · 2014-01-15T19:22:20Z

@mmWoodman if you can write it as a projection it may work. Have a look here for the EEG average reference projection:

https://github.com/mne-tools/mne-python/blob/master/mne/fiff/proj.py#L524

Using an SSP would have the advantage that you wouldn't need to use preloading and you can switch on/off the projection later (e.g. when plotting evoked responses).

maedoc · 2014-01-15T21:21:53Z

@mluessi thanks for the tip. Are these the projections in raw.info['projs']? Can they be sparse? I had to work with this by hand today because the picks= functionality didn't manipulate the projections correctly and the arrays (at least for the EEG average) are dense.

The preloading and switching on/off is indeed useful as we often see bipolar/monopolar as complementary, not exclusive, so I'll look into writing an appropriate function.

larsoner · 2014-01-15T21:24:34Z

Why do you need them to be sparse? I don't anticipate hitting memory problems, and it may or may not be faster depending on whether sparse matrix multiplication in scipy is actually faster than MKL + multithreaded BLAS in numpy...

It seems like if all you're doing is subtracting pairs of channels, it should be possible with a single matrix. Whether or not this can be represented as a projection vector, I'm not sure.

maedoc · 2014-01-15T21:34:10Z

They don't need to be sparse; 'twas but a curiosity. The number of sEEG electrodes doesn't exceed ~60, at least in our case due to the amplifier.

It does require a matrix, e.g. for a single implantation it would be (diag(ones((n, ))) - diag(ones((n-1, )), -1))[:-1].

agramfort · 2014-01-16T10:37:39Z

using the SSP mechanism seem complicated

I see a few things that could be done:

add an sEEG channel type in constants.FIFF and add the corresponding seeg
param to the pick_types functions in pick.py
create a function like apply_montage (for which we would define a good
API to be defined)

dengemann · 2014-01-16T10:46:20Z

In addition we could add a bti function for renaming / setting channel types.

maedoc · 2014-01-16T10:53:07Z

I opened #1070 to continue sEEG discussion and avoid derailing the original discussion here.

dengemann · 2014-01-27T08:50:02Z

@agramfort where would you put the RawBase?

agramfort · 2014-01-27T09:29:52Z

In fiff/base.py

On 27 janv. 2014, at 09:50, "Denis A. Engemann" notifications@github.com wrote:

@agramfort where would you put the RawBase?

—
Reply to this email directly or view it on GitHub.

dengemann · 2014-01-27T09:31:08Z

@agramfort I'll open a PR later such that we can discuss the interface the ABCMeta is supposed to dictate.

mluessi · 2014-01-27T14:40:25Z

Regarding being able to read non-fiff formats for things other than Raw, we are currently discussing read/write functions for FieldTrip data (Epochs, Evoked, etc.), which btw is the reason for mne-tools/mne-matlab#9 which will allow FT to read Epochs. So we should keep this in mind when designing the new io module.

dengemann · 2014-01-27T14:41:34Z

@mluessi I was also thinking about about .to_fieldtrip etc methods ...

dengemann · 2014-01-27T14:46:24Z

mluessi · 2014-01-27T14:46:45Z

@dengemann I think for saving, maybe we could have a format kwarg in the save methods.

mluessi · 2014-01-27T14:47:29Z

@dengemann ?? I don't get it..

dengemann · 2014-01-27T14:51:33Z

@mluessi vote for the ABCMeta methods ;-)

dengemann · 2014-01-27T15:12:17Z

@agramfort @mluessi a few of the attributes are computed during runtime. We would need to make them abc.abstractproperties to make them serve as interface definitions.
Another question is which things shall already be implemented in RawBase as concrete methods ...

Any thoughts?

mluessi · 2014-01-27T15:30:26Z

@agramfort why do you suggest to put the RawBase class into fiff/base.py? Maybe I'm misunderstanding something, but isn't the point of this PR to make IO less fiff-centric? I think it would make more sense to have the base class in mne/raw.py and format specific readers in the .py file for the respective file format.

@dengemann are you trying to decide which methods/attributes from Raw should go into BaseRaw? I think almost everything except the things that are fiff-specific (read block, etc). You want as many methods as possible in the base class so all the features are available in all derived classes (which implement few format specific methods for reading and writing).

agramfort · 2014-01-27T15:31:19Z

whatever method is not specific to fiff files :)

agramfort · 2014-01-27T15:35:41Z

first we create the base class then we refactor the io module to move stuff
from mne/fiff to mne/ ok?

mluessi · 2014-01-27T15:42:15Z

@agramfort +1 let's create the base class and the derived class for fiff first.

Given that we are now planing to implement readers for FT, do you guys think it is still the best idea to implement a flat hierarchy or would hierarchical be better? I.e., should we use mne.io.read_epochs_fieldtrip etc or put things for each file format in a separate sub-package, i.e., mne.io.fieldtrip.read_epochs. IMO, the later would allow for a cleaner organization and one could use from mne.io.fieldtrip import * for less typing.

agramfort · 2014-01-27T15:55:45Z

fine with mne.io.fieldtrip

dengemann · 2014-01-27T16:04:53Z

@agramfort the problem I see is that using a proper ABC class won't allow for inheritance. All methods expected by the API will be needed to be implemented by the advanced user, if only wrapping super calls to the base class. I'm not sure this is really what we want.

agramfort · 2014-01-27T16:23:28Z

abstract methods should be only the methods required by subclasses.

the other methods are inherited

agramfort · 2014-06-11T16:16:06Z

duplicate

dengemann mentioned this issue Dec 24, 2013

ENH: Add KIT-157 layout #1008

Merged

dengemann closed this as completed Dec 24, 2013

dengemann reopened this Dec 24, 2013

maedoc mentioned this issue Jan 16, 2014

Handle sEEG data sensibly #1070

Closed

dengemann mentioned this issue Feb 4, 2014

Next hangout #1113

Closed

dengemann mentioned this issue Apr 12, 2014

ENH: Make base raw class #1223

Merged

agramfort closed this as completed Jun 11, 2014

larsoner mentioned this issue Oct 26, 2021

Make mne.io.Raw available as mne.Raw #9560

Closed

BaseRaw + io module #1003

BaseRaw + io module #1003

Comments

agramfort commented Dec 22, 2013

dengemann commented Dec 24, 2013

dengemann commented Dec 24, 2013

dengemann commented Dec 24, 2013

agramfort commented Dec 24, 2013

larsoner commented Dec 24, 2013

dengemann commented Dec 24, 2013

agramfort commented Dec 24, 2013

agramfort commented Dec 24, 2013

dengemann commented Dec 24, 2013

larsoner commented Dec 24, 2013

larsoner commented Dec 24, 2013

dengemann commented Dec 24, 2013

dengemann commented Dec 24, 2013

larsoner commented Dec 24, 2013

dengemann commented Dec 24, 2013

dengemann commented Dec 24, 2013

mluessi commented Dec 24, 2013

dengemann commented Dec 24, 2013

mluessi commented Dec 24, 2013

agramfort commented Dec 26, 2013

larsoner commented Dec 27, 2013

maedoc commented Jan 15, 2014

agramfort commented Jan 15, 2014

dengemann commented Jan 15, 2014

maedoc commented Jan 15, 2014

dengemann commented Jan 15, 2014

mluessi commented Jan 15, 2014

mluessi commented Jan 15, 2014

maedoc commented Jan 15, 2014

larsoner commented Jan 15, 2014

maedoc commented Jan 15, 2014

agramfort commented Jan 16, 2014

dengemann commented Jan 16, 2014

maedoc commented Jan 16, 2014

dengemann commented Jan 27, 2014

agramfort commented Jan 27, 2014

dengemann commented Jan 27, 2014

mluessi commented Jan 27, 2014

dengemann commented Jan 27, 2014

dengemann commented Jan 27, 2014

mluessi commented Jan 27, 2014

mluessi commented Jan 27, 2014

dengemann commented Jan 27, 2014

dengemann commented Jan 27, 2014

mluessi commented Jan 27, 2014

agramfort commented Jan 27, 2014

agramfort commented Jan 27, 2014

mluessi commented Jan 27, 2014

agramfort commented Jan 27, 2014

dengemann commented Jan 27, 2014

agramfort commented Jan 27, 2014

agramfort commented Jun 11, 2014