Memory Error For TWITTER-US Dataset #9

MortonWang · 2019-04-07T13:00:39Z

Hi, Tiiiger

I am very interesting in your recent SGC work.

I want to apply your SGC code to the Semi-supervised user geolocation which belongs to Downstream Tasks in your paper.

The GEOTEXT dataset is OK, but when I turn to TWITTER-US and TWITTER-WORLD, it crashed.The error is list as follows:

File "/home/wtl/桌面/wtlCode/geoSGC/dataProcess.py", line 96, in process_data
features = torch.FloatTensor(features.to_dense())
RuntimeError: $ Torch: not enough memory: you tried to allocate 417GB. Buy new RAM! at /pytorch/aten/src/TH/THGeneral.cpp:201

I have tried different versions of python and torch, such as python 2.7 + torch 1.0.1.post2 and python 3.5 + torch 1.0.1.post2 , but failed. I also google for solution but so many methods dose not work.

Do you have the similar error, and how do you fix it? My computer is Ubuntu 16.04 with 40GB memory.

Many thanks for your help.

-Morton

Tiiiger · 2019-04-08T04:24:37Z

@felixgwu

felixgwu · 2019-04-08T05:48:31Z

Hi Morton,

The reason is that the TWITTER-US and TWITTER-WORLD have high-dimensional sparse features.
Converting these features to a dense tensor would require too much memory.
In our experiments, we keep it as a sparse Tensor in our experiments. We use Afshin's code base written in Theano, so we didn't implement it in PyTorch; however, you may consider converting the input features to a torch.sparse.FloatTensor.
BTW, we don't pre-compute the features on these datasets. Instead, based on the associative property we multiply the node features with the weight matrix first and then do K-step propagation to reduce memory usage.

-Felix

Tiiiger · 2019-04-09T19:49:36Z

@MortonWang I am closing this if you don't have any further question. feel free to reopen.

Tiiiger closed this as completed Apr 9, 2019

felixgwu mentioned this issue Jun 19, 2019

table5's job #12

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory Error For TWITTER-US Dataset #9

Memory Error For TWITTER-US Dataset #9

MortonWang commented Apr 7, 2019

Tiiiger commented Apr 8, 2019

felixgwu commented Apr 8, 2019

Tiiiger commented Apr 9, 2019

Memory Error For TWITTER-US Dataset #9

Memory Error For TWITTER-US Dataset #9

Comments

MortonWang commented Apr 7, 2019

Tiiiger commented Apr 8, 2019

felixgwu commented Apr 8, 2019

Tiiiger commented Apr 9, 2019