Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA OUT OF MEMORY #2

Open
yangkang2318 opened this issue Oct 11, 2021 · 2 comments
Open

CUDA OUT OF MEMORY #2

yangkang2318 opened this issue Oct 11, 2021 · 2 comments

Comments

@yangkang2318
Copy link

yangkang2318 commented Oct 11, 2021

Hello,
When I run "roberta_mlm_pretrain.sh",it always occur the error that CUDA out of memory(I have 4 TITAN RTX GPUs,which means the memory is enough).Is it because the data file all_text.txt is too big?
PS:The terminal show the info:INFO - main - Creating features from dataset file at ../bio_script/tasks/unlabeled/all_text.txt,and then it occurs the error.
Thank you in advance.

@yangkang2318
Copy link
Author

Maybe I find the reason that the size of unlabeled data we downloaded from "ftp.ncbi.nlm.nih.gov/pubmed/baseline" is 19G.According to your paper,should I sample the unlabeled data and then convert it to weak labeled data?
Thanks!

@HMJiangGatech
Copy link
Contributor

Yes, I did sampling at the beginning (before convert it to weak labeled data?). It was too large for our Machine.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants