Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When I run the preprocess/PA.py,I encounter the ValueError: NULL pointer access error #6

Closed
lh123cha opened this issue Dec 11, 2023 · 7 comments

Comments

@lh123cha
Copy link

lh123cha commented Dec 11, 2023

I have already install the dc_dgl successfully.But I encounter this error:

nullptraccess

@lh123cha lh123cha changed the title When I run 跳河 When I run the preprocess/PA.py,I encounter the ValueError: NULL pointer access error Dec 11, 2023
@lh123cha
Copy link
Author

And I how can I get the coo.txt file like uk_coo.txt from gnnlab. I can only get samgraph/uk-2006-05/coo.bin and samgraph/twitter/coo.bin from gnnlab.

屏幕截图 2023-12-11 204532
How can I get the coo.txt file used in preprocess/UU_UK_TW.py

@initzhang
Copy link
Owner

Hi @lh123cha , thanks for your interest in our work!

I have already install the dc_dgl successfully.But I encounter this error:

It seems that the problem is in OGB loading, could you try the following code and check the results?

from ogb.nodeproppred import DglNodePropPredDataset
dataset = DglNodePropPredDataset(name='ogbn-papers100M')

And I how can I get the coo.txt file like uk_coo.txt from gnnlab. I can only get samgraph/uk-2006-05/coo.bin and samgraph/twitter/coo.bin from gnnlab.

You can modify the webgraph utility in gnnlab to (de)serialise the bin file to coo file. Specifically, you can change the line of code and print/save to stdout/file instead.

@lh123cha
Copy link
Author

Thank you for the answer!But when I run run_allocate.py on uk dataset
CUDA_VISIBLE_DEVICES=0 python run_allocate.py --dataset uk --fanouts 15,15,15 --fake-dim 100 --total-budget 1
The error occurs

屏幕截图 2023-12-13 121318

It seems like the np.polyfit() package failed on uk dataset.But I can run successfully on twitter dataset.Is there something wrong in uk dataset?

@initzhang
Copy link
Owner

I haven't encountered such problem before, but according to the error message, it seems that the program fails to fitting the curve due to large variance in the system running time? Maybe you can try the following:

(1) avoid sharing the GPU/Machine with other users, because the contention on GPU/PCIe leads to unstable system running time, which could cause the failure for np.polyfit().
(2) set pre_batches and pre_epochs to larger values (such as 1000 & 10), this can mitigate the instability of system running time.

@lh123cha
Copy link
Author

lh123cha commented Dec 13, 2023

Thank for the answer!I have sloved the problem.The problem is the total_buget is smaller than nfeat_buget that cause the nfeat_stats's length is one.And it is wrong to fitting the curve with only one point.So I set --total-buget to a larger number like 10.

@initzhang
Copy link
Owner

Indeed this is a corner case not covered by the current Allocator lol, I will consider add sanity check for this part maybe later. Thank you for reporting the case!

@initzhang
Copy link
Owner

Hi, I have updated the script to fix the problem and I will close this issue. Please feel free to reopen it or open a new issue If you have any further problems, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants