-
Notifications
You must be signed in to change notification settings - Fork 351
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cannot find openwebtext.tar.xz #18
Comments
The corpus can be downloaded from this Google drive: https://drive.google.com/drive/folders/1IaD_SIIB-K3Sij_-JjWoPy_UrWqQRdjx |
thanks a lot |
gone? |
I think GDrive link came from here: https://skylion007.github.io/OpenWebTextCorpus/ and now they are pointing to the OpenWebText data repo on the Model Hub: https://huggingface.co/datasets/Skylion007/openwebtext :) |
Thank you. It’s not clear from this page how to download it.
From: Stefan Schweter ***@***.***>
Date: Wednesday, 17 April 2024 at 22:39
To: google-research/electra ***@***.***>
Cc: Georgy Urumov ***@***.***>, Comment ***@***.***>
Subject: Re: [google-research/electra] cannot find openwebtext.tar.xz (#18)
I think GDrive link came from here: https://skylion007.github.io/OpenWebTextCorpus/ and now they are pointing to the OpenWebText data repo on the Model Hub: https://huggingface.co/datasets/Skylion007/openwebtext :)
—
Reply to this email directly, view it on GitHub<#18 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AUUFFC5S2SGB6GFTL563UI3Y53TXTAVCNFSM4LL2EQ32U5DIOJSWCZC7NNSXTN2JONZXKZKDN5WW2ZLOOQ5TEMBWGI2DKMBYGA3A>.
You are receiving this because you commented.Message ID: ***@***.***>
…--
This
message and its attachments are private and confidential. If you
have received
this message in error, please notify the sender and remove it
and its
attachments from your system.
The University of Westminster is a
charity and a company limited by guarantee. Registration number: 977818
England. Registered Office: 309 Regent Street, London W1B 2UW.
|
Could not find url to download openwebtext.tar.xz . Could you share the urls?
The text was updated successfully, but these errors were encountered: