Compatibility with PyTorch 0.4 #32

seyiqi · 2018-03-27T12:56:52Z

Thanks very much for sharing this implementation. I forked the code. It works great on PyTorch 0.3.1. But when I ran it with 0.4.0 (master version), I got following error (I made some minor change so the line number wouldn't match):
File "../networks/densenet_efficient.py", line 330, in forward

bn_input_var = Variable(type(inputs[0])(storage).resize_(size), volatile=True)

TypeError: Variable data has to be a tensor, but got torch.cuda.FloatStorage

It turned out that for this line:
bn_input_var = Variable(type(inputs[0])(storage).resize_(size), volatile=True)

The inputs in version 0.3.1 is FloatTensor but in 0.4.0 it's Variable.

I am wondering what's the best way to update the code for 0.4.0?

Many thanks!

The text was updated successfully, but these errors were encountered:

seyiqi · 2018-03-27T12:58:14Z

I forgot to mention that this issue is related to this file:

https://github.com/gpleiss/efficient_densenet_pytorch/blob/master/models/densenet_efficient.py

gpleiss · 2018-03-30T14:13:18Z

Right... I think that the way it is written is currently incompatible with PyTorch version 0.4. I've mostly been testing against 0.3. I'll try to make it compatible with both versions.

gpleiss · 2018-04-16T12:31:58Z

Sorry for the slow reply. @taineleau pointed out that PyTorch 0.4 has a checkpoint feature. This will essentially do most of the work that's being done in the efficient implementation right now. I'll look into this, hopefully next week.

wandering007 · 2018-04-26T12:03:25Z

@gpleiss @seyiqi I just release the code runnable with PyTorch 0.4 and pass the single-gpu case. However, it does not work in the multi-gpu case, which is weird. Needing help~

taineleau-zz · 2018-04-26T12:11:47Z

@wandering007 I suggest you take a look at the checkpoint feature. This helper function should handle multi-gpus nicely.

gpleiss · 2018-04-26T12:14:18Z

@taineleau @wandering007 I'm thinking we should re-write this code to just work with PyTorch 0.4, using the checkpointing feature. We can make a branch/tag of the current code for people who are still using PyTorch 0.3.

wandering007 · 2018-04-26T12:15:53Z

@taineleau Actually, what we do is basically like checkpoint feature and seems more efficient...

gpleiss · 2018-04-26T13:28:11Z

Sorry @wandering007 not sure what you mean. Are you saying that you currently have an implementation that's using checkpointing?

wandering007 · 2018-04-26T13:37:49Z

@gpleiss uh...no, I didn't use the checkpoint feature. But I've looked into the source code of checkpointing feature, the implementations are very similar. The inefficiency of checkpoint may include these points:

Though concat-bn-relu can be checkpointed, the relu-ouput cannot use the same shared memory unless we hack it like what you've done before.
If concat-bn-relu-conv is checkpointed, the conv operations need to be recomputed when backward, which is time comsuming.

Besides, we still need to restore the BN statistics (running_mean and running_var) as checkpoint feature simply ignores the change.

gpleiss · 2018-04-26T15:46:54Z

@wandering007 checkpointing takes care of it all. I profiled it, and the models are far more memory efficient than they ever were. The PyTorch team has optimized the memory usage of autograd like crazy.

Besides that, the checkpointing feature seems to be really smart. When memory isn't an issue (e.g. for big GPUs/smaller models), the checkpointing code does less memory-efficient optimizations so the code runs faster. When memory IS an issue (e.g. for bigger models), the checkpointing code squeezes out a TON of memory savings.

gpleiss · 2018-04-27T12:51:47Z

Closed by #35

gpleiss added the compatibility label Mar 30, 2018

gpleiss mentioned this issue Apr 26, 2018

Pytorch 0.4 compatibility (uses checkpointing) #35

Merged

gpleiss closed this as completed Apr 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility with PyTorch 0.4 #32

Compatibility with PyTorch 0.4 #32

seyiqi commented Mar 27, 2018

seyiqi commented Mar 27, 2018

gpleiss commented Mar 30, 2018

gpleiss commented Apr 16, 2018

wandering007 commented Apr 26, 2018

taineleau-zz commented Apr 26, 2018

gpleiss commented Apr 26, 2018

wandering007 commented Apr 26, 2018

gpleiss commented Apr 26, 2018

wandering007 commented Apr 26, 2018 •

edited

gpleiss commented Apr 26, 2018

gpleiss commented Apr 27, 2018

Compatibility with PyTorch 0.4 #32

Compatibility with PyTorch 0.4 #32

Comments

seyiqi commented Mar 27, 2018

seyiqi commented Mar 27, 2018

gpleiss commented Mar 30, 2018

gpleiss commented Apr 16, 2018

wandering007 commented Apr 26, 2018

taineleau-zz commented Apr 26, 2018

gpleiss commented Apr 26, 2018

wandering007 commented Apr 26, 2018

gpleiss commented Apr 26, 2018

wandering007 commented Apr 26, 2018 • edited

gpleiss commented Apr 26, 2018

gpleiss commented Apr 27, 2018

wandering007 commented Apr 26, 2018 •

edited