- added size_splits to functional #3837

ptrblck · 2017-11-22T16:24:47Z

Pull request addresses issue #3223

The split function splits tensors into equally sized chunks.
split_sizes let the user define a list with sizes for each chunk.

tf.split combines both functionalities in one function. Maybe this is also desired for Pytorch?

split_sizes seems to be a bit slower (6.991s vs 6.704s)

colesbury · 2017-11-24T19:35:39Z

Yeah, I think a single function would be nicer. Both tf.split and numpy.split have that sort of API.

ptrblck · 2017-11-24T22:46:31Z

Thanks for the feedback, I will merge it in torch.split then.

ptrblck · 2017-11-25T13:06:55Z

I merged both functions now.

Before that, I timed all functions on my machine (tensor=torch.randn(200, 10, 2, 2) into 40 chunks of size 5 in dim=0)

merged function with split_size_or_sections = [5] * 40: ~8.1586s
merged function with split_size_or_sections = 5: ~7.4982
native with `split_size=5: ~7.2688s

What are your thoughts? Any suggestion on the code / naming / documentation?

ezyang · 2017-11-27T16:23:37Z

@pytorchbot test this please

flennerhag · 2017-12-15T20:33:22Z

@ptrblck great initiative, I've been using my own wrapper for a while. Would be nice to have it in the code base.

You could simplify the code quite a bit though by using plain python operations instead of invoking torch overheads, e.g. something like

def split(tensor, sizes, dim=0):
    if dim < 0:
        dim += tensor.dim()

    if isinstance(sizes, int):
        # original code ...
        return chunks

    if tensor.size(dim) != sum(sizes):
        raise ValueError("Sizes do not match tensor size in dim")

    nsizes = len(sizes)
    sizes = [0] + sizes
    return tuple(tensor.narrow(dim, sizes[i], sizes[i + 1])
                 for i in range(nsizes))

Should be slightly faster too.

soumith · 2017-12-18T07:28:21Z

@ptrblck as soon as you add unit tests for the list of splits case, i can merge this in.

- removed ``split_sizes``

- added tests in test_split for variable sections splits (pytorch#3837)

ptrblck · 2017-12-18T18:11:24Z

@flennerhag Thanks for the suggestions! I tried to change some Pytorch code to plain python operations.
@soumith I also added some tests in test_split. If it's not sufficient, I can add some more test cases.

soumith · 2018-01-04T13:46:33Z

@pytorchbot test this please

soumith · 2018-01-04T14:52:51Z

thanks a lot @ptrblck !

ptrblck force-pushed the split_sizes branch from 7e5d4e5 to 054fbd9 Compare December 4, 2017 20:00

pbialecki added 5 commits December 18, 2017 17:40

- added size_splits to functional

fc4b646

- merged split_sizes to split

f7e49ea

- removed ``split_sizes``

- flake8 (pytorch#3223)

4750357

flake8 (pytorch#3837)

1e9fb1b

- changed some operations to plain python (pytorch#3837)

7b08210

ptrblck force-pushed the split_sizes branch from d280b04 to 7b08210 Compare December 18, 2017 16:48

pbialecki and others added 4 commits December 18, 2017 18:50

- bugfixes on plain python operations (pytorch#3837)

62bef8c

- added tests in test_split for variable sections splits (pytorch#3837)

- added doc string for split (pytorch#3837)

78ae53e

- added doc string for split (pytorch#3837)

93ffe33

Merge branch 'master' into split_sizes

3736c5b

soumith merged commit 7c729e6 into pytorch:master Jan 4, 2018

ptrblck deleted the split_sizes branch January 5, 2018 18:58

samuela mentioned this pull request May 2, 2018

[Feature Request] more flexible splitting of Tensors #3223

Closed

ezyang added the open source label Jun 24, 2019

mruberry mentioned this pull request Jan 3, 2021

torch.split is divergent from np.split #50012

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

- added size_splits to functional #3837

- added size_splits to functional #3837

Uh oh!

ptrblck commented Nov 22, 2017

Uh oh!

colesbury commented Nov 24, 2017

Uh oh!

ptrblck commented Nov 24, 2017

Uh oh!

ptrblck commented Nov 25, 2017

Uh oh!

ezyang commented Nov 27, 2017

Uh oh!

flennerhag commented Dec 15, 2017

Uh oh!

soumith commented Dec 18, 2017

Uh oh!

ptrblck commented Dec 18, 2017

Uh oh!

soumith commented Jan 4, 2018

Uh oh!

soumith commented Jan 4, 2018

Uh oh!

Uh oh!

- added size_splits to functional #3837

- added size_splits to functional #3837

Uh oh!

Conversation

ptrblck commented Nov 22, 2017

Uh oh!

colesbury commented Nov 24, 2017

Uh oh!

ptrblck commented Nov 24, 2017

Uh oh!

ptrblck commented Nov 25, 2017

Uh oh!

ezyang commented Nov 27, 2017

Uh oh!

flennerhag commented Dec 15, 2017

Uh oh!

soumith commented Dec 18, 2017

Uh oh!

ptrblck commented Dec 18, 2017

Uh oh!

soumith commented Jan 4, 2018

Uh oh!

soumith commented Jan 4, 2018

Uh oh!

Uh oh!