scatter_add for non variable tensors #2358

altaetran · 2016-05-13T22:25:12Z

Hi,

I am interested in using scatter_add when the tensor being update is not a variable. Is this possible?

I am looking to do something like this:

X1_ph = tf.placeholder(tf.float32, shape=(None, 3))
ind_ph = tf.placeholder(tf.int32, shape=(None))

#Z = tf.Variable(tf.zeros([10, 3]))
Z = tf.zeros([10, N_feat])

X1 = np.array([[1,0.00,1],
[2,0.00,1],
[3,0.00,1],
[5,0.00,1.1],
[6,1.0,1.8]])

ind = [0, 1, 1, 0, 0]

Z = tf.scatter_add(Z, ind_ph, X1)

If I declare Z as a tf.Variable, I can do this, but I need to call this operation hundreds of thousands of times, and do not want to store any copies of Z once I am done with them. If I were to declare Z as a Variable, would there be any way to destroy Z once I am done with it (maybe with a garbage collector or something similar)? Thank you so much for your help!

The text was updated successfully, but these errors were encountered:

girving · 2016-05-14T00:32:41Z

Use tf.sparse_to_dense.

girving · 2016-05-14T00:34:03Z

Actually, by "this operation" do you mean the whole thing, or do you want to allocate a Z and do a bunch of separate scatters into it before deallocating it?

altaetran · 2016-05-14T01:02:12Z

I'd like to be able to allocate Z, do a bunch of scatters, and then deallocate it while allocating another Z for a different version. Is there a function I can call to deallocate a variable?

girving · 2016-05-14T01:05:55Z

@altaetran: We could certainly make a deallocation op, but I don't think we have one at the moment? However, using it would be somewhat awkward.

@yuanbyu: Do you have an ideas?

altaetran · 2016-05-14T01:16:26Z

Is there any way to retrofit the scatter_add function to work directly on regular tensors produced by other tensorflow operations?

girving · 2016-05-16T14:35:21Z

Non-Variable tensors are immutable, so supporting scatter into them would break an important part of the model. The uninitialize op is much easier. Would you be interested in submitting a patch? :)

mrry · 2016-05-16T15:38:29Z

Judging by #2367, it appears that @altaetran also requires gradients for this operation. It sounds to me like a functional op would be preferable for this purpose.

rryan · 2016-07-14T23:51:22Z

/me whistles innocently ;)

tensorflow/tensorflow/python/ops/math_ops.py

Line 1525 in 5df4c71

var = gen_state_ops._temporary_variable(shape=shape, dtype=tensor_dtype)

girving · 2016-07-14T23:55:35Z

Excellent. I won't have time to work on this PR in the near term. @rryan Would you want to either take over or elaborate about how to use temporary_variable for this purpose?

rryan · 2016-07-15T00:19:19Z

It wont help with the gradient bit (and this is a non-public op) but based on your example code you could do:

Z = gen_state_ops._temporary_variable(shape=..., dtype=...)
Z_name = Z.op.name
destroy_op = gen_state_ops._destroy_temporary_variable(Z, var_name=Z_name)
X1_ph = tf.placeholder(tf.float32, shape=(None, 3))
ind_ph = tf.placeholder(tf.int32, shape=(None))
Z = tf.scatter_add(Z, ind_ph, X1_ph)


with tf.Session() as sess:
  for _ in xrange(steps):
    # run Z with placeholders filled in for X1_ph and ind_ph
    sess.run(Z, feed_dict={X1_ph: ..., ind_ph: ...}) 
  # clean up -- destroy_op returns the value of Z and destroys the temporary variable
  sess.run(destroy_op)

girving · 2016-07-15T14:55:56Z

@rryan Is it possible to define valid gradients for this using gradient_override_map if we use temporary_variable?

girving · 2016-08-09T23:36:34Z

Removing my assignment since I won't have time to work on this personally. @rryan Could you comment on my gradient question? If there's a reasonable path forward we can mark this contributions welcome, but I'm not sure how the temporary variables stuff works.

yaroslavvb · 2016-08-10T01:00:46Z

I thought there was a thread a while back (Feb) from @vanhoucke about how
to do scatter add without using variables. If you can do without variables,
you can use same op to do it using same op hundreds of thousands of times
by using persistent Tensors, since memory is recycled as soon as Python
handle is unassigned

On Tue, Aug 9, 2016 at 4:36 PM, Geoffrey Irving notifications@github.com
wrote:

Removing my assignment since I won't have time to work on this personally.
@rryan https://github.com/rryan Could you comment on my gradient
question? If there's a reasonable path forward we can mark this
contributions welcome, but I'm not sure how the temporary variables stuff
works.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#2358 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AABaHKCD1QjFu6UZA-ehlmmJQCecarEeks5qeQ8EgaJpZM4IebkP
.

altaetran · 2016-08-10T02:19:56Z

Hey, this sounds really promising. Do you think you could provide a simple example of how one might do this? I'm not too familiar with persistent tensors. Thanks!

yaroslavvb · 2016-08-10T05:13:22Z

sure, I'll put an example together tomorrow

On Tue, Aug 9, 2016 at 7:19 PM, Han Altae-Tran notifications@github.com
wrote:

Hey, this sounds really promising. Do you think you could provide a simple
example of how one might do this? I'm not too familiar with persistent
tensors. Thanks!

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#2358 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/AABaHJhKeaPZYS5TAF9lYh-sM6ZKR7Wgks5qeTVOgaJpZM4IebkP
.

yaroslavvb · 2016-08-11T01:56:35Z

@altaetran here's an example that uses a helper I wrote to simplify dealing with persistent tensors

Download imperative.py and save it in a place where you can import it (ie, same directory as your script)

import tensorflow as tf
import imperative
env = imperative.Env(tf)
tfi = env.tf

N_feat = 3
Z = tfi.zeros([10, N_feat])
X1 = tfi.constant([[1,0.00,1],
[2,0.00,1],
[3,0.00,1],
[5,0.00,1.1],
[6,1.0,1.8]])

ind = [0, 1, 1, 0, 0]
for right_pos, left_pos in enumerate(ind):
    new_row = Z[left_pos, :]+X1[right_pos, :]
    # turn vector into 1-by-x matrix so we can concat it
    new_row_mat = tfi.reshape(new_row, [1, -1])
    # make new Tensor with old row replaced by updated version
    Z = tfi.concat(0, [Z[:left_pos, :], new_row_mat, Z[left_pos+1:, :]])
print Z

That should give

ITensor([[ 12.           1.           3.89999986]
 [  5.           0.           2.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]
 [  0.           0.           0.        ]], dtype=float32)

I'm working on more docs for imperative.py, meanwhile there's an overview with some slides here
https://github.com/yaroslavvb/imperative/blob/master/imperative_slides.pdf

AdityaGudimella · 2017-01-23T19:26:57Z

Would this work? Since this is just using TensorFlow ops under the hood, it propogates gradients too.

def scatter_add_tensor(ref, indices, updates, name=None):
    """
    Adds sparse updates to a variable reference.

    This operation outputs ref after the update is done. This makes it easier to chain operations that need to use the
    reset value.

    Duplicate indices are handled correctly: if multiple indices reference the same location, their contributions add.

    Requires updates.shape = indices.shape + ref.shape[1:].
    :param ref: A Tensor. Must be one of the following types: float32, float64, int64, int32, uint8, uint16,
        int16, int8, complex64, complex128, qint8, quint8, qint32, half.
    :param indices: A Tensor. Must be one of the following types: int32, int64. A tensor of indices into the first
        dimension of ref.
    :param updates: A Tensor. Must have the same dtype as ref. A tensor of updated values to add to ref
    :param name: A name for the operation (optional).
    :return: Same as ref. Returned as a convenience for operations that want to use the updated values after the update
        is done.
    """
    with tf.name_scope(name, 'scatter_add_tensor', [ref, indices, updates]) as scope:
        ref = tf.convert_to_tensor(ref, name='ref')
        indices = tf.convert_to_tensor(indices, name='indices')
        updates = tf.convert_to_tensor(updates, name='updates')
        ref_shape = tf.shape(ref, out_type=indices.dtype, name='ref_shape')
        scattered_updates = tf.scatter_nd(indices, updates, ref_shape, name='scattered_updates')
        with tf.control_dependencies([tf.assert_equal(ref_shape, tf.shape(scattered_updates, out_type=indices.dtype))]):
            output = tf.add(ref, scattered_updates, name=scope)
        return output

aliosmanulusoy · 2017-02-01T10:34:21Z

Thanks @AdityaGudimella ! I've tested the gradients and it seems to work. Can anybody else confirm?

aliosmanulusoy · 2017-03-05T13:37:02Z

@AdityaGudimella you wrote "Duplicate indices are handled correctly: if multiple indices reference the same location, their contributions add." for your function. I've tested this and it seems correct. However, I don't understand why :) Can you please explain? tf.scatter_nd doesn't seems to produce any guarantees if multiple indices reference the same location.

cjf00000 · 2017-04-16T08:39:00Z

@aliosmanulusoy According to #8102, it seems that tf.scatter_nd currently adds up duplicate updates

itsmeolivia · 2017-06-16T19:46:12Z

Automatically closing due to lack of recent activity. Since this issue is old at this point, please reopen the issue if it still occurs when tried with the latest version of Tensorflow. Thank you.

xcyan · 2017-07-19T20:51:01Z

The latest version of Tensorflow (1.2) still only supports mutable Tensors.

Also, I think there is a bug of this function: "ref" is not updated.

bodokaiser · 2017-09-12T18:48:15Z

Any updates here?

xcyan · 2017-09-12T18:57:23Z

@bodokaiser I don't think so. They are not working on this thread actively. One solution is to implement your own scatter_add() layer.

nikonikolov · 2017-10-25T17:34:26Z

It would really be very useful if this is implemented.

albertz · 2019-01-28T11:25:42Z

@altaetran @yaroslavvb @girving @mrry @rryan @aselle @itsmeolivia (or anyone who has access): Can this be reopened? It still is missing, and still it would be a useful addition, as this could potentially speed up some code (where you would currently need to use tf.sparse_to_dense, or tf.where, or so instead).

alextp · 2019-04-09T21:14:14Z

tensor_scatter_nd_add and friends are the solution to this problem I think so no need to reopen.

albertz · 2019-04-11T14:19:51Z

Ah thanks, seems I missed that? Or when exactly was this added? (The documentation does not say this.)

alextp · 2019-04-11T16:58:09Z

@albertz this was added fairly recently :-)

ypxie · 2019-04-23T19:40:35Z

what if I don't need gradient, can I do scatter_update to tensor just like variable?

alextp · 2019-04-23T19:51:31Z

Just use tensor_scatter_update

…

On Tue, Apr 23, 2019 at 12:47 PM Yuanpu Xie ***@***.***> wrote: what if I don't need gradient, can I do scatter_update to tensor just like variable? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#2358 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAABHRPQRSTWSLOOPFYSTMTPR5RTVANCNFSM4CDZXEHQ> .

-- - Alex

ypxie · 2019-04-23T20:04:49Z

@alextp Hey, thanks! but that api is in Tensorflow 1.13. is there any workaround for tf .1.12?

alextp · 2019-04-23T20:12:13Z

No, please upgrade.

…

On Tue, Apr 23, 2019 at 1:11 PM Yuanpu Xie ***@***.***> wrote: @alextp <https://github.com/alextp> Hey, is there any workaround for tf .1.12? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2358 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAABHRJJSHSYQHO6SYGD3YTPR5UPJANCNFSM4CDZXEHQ> .

-- - Alex

petewarden assigned girving May 13, 2016

mrry mentioned this issue May 16, 2016

Incorrect gradients for scatter_add #2367

Closed

girving added the triaged label Jun 7, 2016

aselle removed the triaged label Jul 28, 2016

girving removed their assignment Aug 9, 2016

girving added the stat:awaiting tensorflower Status - Awaiting response from tensorflower label Aug 9, 2016

girving added stat:contribution welcome Status - Contributions welcome and removed stat:awaiting tensorflower Status - Awaiting response from tensorflower labels Aug 10, 2016

itsmeolivia closed this as completed Jun 16, 2017

albertz referenced this issue in Spotlight0xff/returnn Mar 2, 2018

first implementation of slice_nd

e0f4c18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scatter_add for non variable tensors #2358

scatter_add for non variable tensors #2358

altaetran commented May 13, 2016

girving commented May 14, 2016

girving commented May 14, 2016 •

edited

altaetran commented May 14, 2016

girving commented May 14, 2016

altaetran commented May 14, 2016

girving commented May 16, 2016

mrry commented May 16, 2016

rryan commented Jul 14, 2016

girving commented Jul 14, 2016

rryan commented Jul 15, 2016

girving commented Jul 15, 2016

girving commented Aug 9, 2016

yaroslavvb commented Aug 10, 2016

altaetran commented Aug 10, 2016

yaroslavvb commented Aug 10, 2016

yaroslavvb commented Aug 11, 2016

AdityaGudimella commented Jan 23, 2017 •

edited

aliosmanulusoy commented Feb 1, 2017

aliosmanulusoy commented Mar 5, 2017

cjf00000 commented Apr 16, 2017

itsmeolivia commented Jun 16, 2017

xcyan commented Jul 19, 2017 •

edited

bodokaiser commented Sep 12, 2017

xcyan commented Sep 12, 2017

nikonikolov commented Oct 25, 2017 •

edited

albertz commented Jan 28, 2019

alextp commented Apr 9, 2019

albertz commented Apr 11, 2019

alextp commented Apr 11, 2019

ypxie commented Apr 23, 2019

alextp commented Apr 23, 2019 via email

ypxie commented Apr 23, 2019 •

edited

alextp commented Apr 23, 2019 via email

scatter_add for non variable tensors #2358

scatter_add for non variable tensors #2358

Comments

altaetran commented May 13, 2016

girving commented May 14, 2016

girving commented May 14, 2016 • edited

altaetran commented May 14, 2016

girving commented May 14, 2016

altaetran commented May 14, 2016

girving commented May 16, 2016

mrry commented May 16, 2016

rryan commented Jul 14, 2016

girving commented Jul 14, 2016

rryan commented Jul 15, 2016

girving commented Jul 15, 2016

girving commented Aug 9, 2016

yaroslavvb commented Aug 10, 2016

altaetran commented Aug 10, 2016

yaroslavvb commented Aug 10, 2016

yaroslavvb commented Aug 11, 2016

AdityaGudimella commented Jan 23, 2017 • edited

aliosmanulusoy commented Feb 1, 2017

aliosmanulusoy commented Mar 5, 2017

cjf00000 commented Apr 16, 2017

itsmeolivia commented Jun 16, 2017

xcyan commented Jul 19, 2017 • edited

bodokaiser commented Sep 12, 2017

xcyan commented Sep 12, 2017

nikonikolov commented Oct 25, 2017 • edited

albertz commented Jan 28, 2019

alextp commented Apr 9, 2019

albertz commented Apr 11, 2019

alextp commented Apr 11, 2019

ypxie commented Apr 23, 2019

alextp commented Apr 23, 2019 via email

ypxie commented Apr 23, 2019 • edited

alextp commented Apr 23, 2019 via email

girving commented May 14, 2016 •

edited

AdityaGudimella commented Jan 23, 2017 •

edited

xcyan commented Jul 19, 2017 •

edited

nikonikolov commented Oct 25, 2017 •

edited

ypxie commented Apr 23, 2019 •

edited