Does this work for any model with activation function as relu? #34

kailashg26 · 2022-03-03T03:24:23Z

Hello, I'm trying to use the actnn with maddpg (an MARL algorithm). The model just has 3 layers with activation function relu. If so, can you let us know if this mechanism will give the results with smaller models.

Thank you.

Link of maddpg https://github.com/marlbenchmark/off-policy/tree/release/offpolicy/algorithms/maddpg

merrymercy · 2022-03-09T23:53:52Z

It should work with relu activation function, but we didn't test any RL tasks. Did you meet memory issues even with this small model?

kailashg26 · 2022-03-10T00:00:53Z

I mean, when I train the maddpg, almost 10GB of memory gets used. So, I wanted to try some compression functions. It would be a great help if you can provide some insights on how to test it with the maddpg if you have any idea.

Thanks

merrymercy · 2022-03-10T00:07:26Z

You can try to follow the usage and replace the layers in your model with actnn layers. You can start with higher bits and see whether the lossy compression hurts reward.

kailashg26 · 2022-03-10T00:18:39Z

Thank you. I'll look into it and post here if I have any doubts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this work for any model with activation function as relu? #34

Does this work for any model with activation function as relu? #34

kailashg26 commented Mar 3, 2022

merrymercy commented Mar 9, 2022

kailashg26 commented Mar 10, 2022

merrymercy commented Mar 10, 2022 •

edited

kailashg26 commented Mar 10, 2022

Does this work for any model with activation function as relu? #34

Does this work for any model with activation function as relu? #34

Comments

kailashg26 commented Mar 3, 2022

merrymercy commented Mar 9, 2022

kailashg26 commented Mar 10, 2022

merrymercy commented Mar 10, 2022 • edited

kailashg26 commented Mar 10, 2022

merrymercy commented Mar 10, 2022 •

edited