Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

loading SOF data #6

Closed
hojjatkarami opened this issue Nov 1, 2022 · 1 comment
Closed

loading SOF data #6

hojjatkarami opened this issue Nov 1, 2022 · 1 comment

Comments

@hojjatkarami
Copy link

Hi,
I cannot load stackoverflow data with pickle module:

UnpicklingError: A load persistent id instruction was encountered,
but no persistent_load function was specified.

in the notebook you referred to "train_manifold_format.pkl" but it does not exist in the data folder

Thank you

@BIRD-TAO
Copy link
Owner

BIRD-TAO commented Nov 2, 2022

pls read the Readme.md:
We provide the MOOC and Stack Overflow datasets in ./data/

And Retweet dataset can be downloaded from Google Drive. Download it and copy it into ./data/retweet/

To preprocess the data, run the following commands

python ./scripts/generate_mooc_data.py
python ./scripts/generate_stackoverflow_data.py
python ./scripts/generate_retweet_data.py

@BIRD-TAO BIRD-TAO closed this as completed Nov 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants