Automatic label alignment #186

shoyer · 2014-07-17T18:18:52Z

If we want to mimic pandas, we should support automatic alignment of coordinate labels in:

Mathematical operations (non-inplace, ~~in-place~~, see also WIP: Automatic label alignment for mathematical operations #184)
All operations that add new dataset variables (merge, update, __setitem__).
All operations that create a new dataset ( __init__, ~~concat~~)

For the later two cases, it is not clear that using an inner join on coordinate labels is the right choice, because that could lead to some surprising destructive operations. This should be considered carefully.

The text was updated successfully, but these errors were encountered:

shoyer · 2015-02-09T06:48:28Z

I have most of a working implementation for this that will be up for a PR shortly. Here's some of my thinking on expected behavior.

Based on the principle that combining arrays into a new dataset should not remove information, it makes sense to use outer joins for Dataset.__init__ and Dataset.merge. This is the same behavior pandas uses

When adding an item to an existing dataset, it would be surprising if indexes or dimension sizes changed. So I think we should be using left joins for __setitem__ and update. This is also what pandas does.

Right now, we use Dataset.merge to handle all operations that add new items to a dataset. Adding automatic alignment is turning that into even more of a kludgy mess than it already was. I think some simplification of scope for update/__setitem__/merge would help, though I'm not sure it's worth breaking existing code.

shoyer added API labels Jul 17, 2014

shoyer modified the milestones: 0.4, before 1.0 Feb 9, 2015

shoyer mentioned this issue Feb 13, 2015

Automatic label-based alignment for math and Dataset constructor #321

Merged

shoyer closed this as completed in #321 Feb 13, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatic label alignment #186

Automatic label alignment #186

shoyer commented Jul 17, 2014

shoyer commented Feb 9, 2015

Automatic label alignment #186

Automatic label alignment #186

Comments

shoyer commented Jul 17, 2014

shoyer commented Feb 9, 2015