-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Raw training data #167
Comments
thank you |
I have serious problems with downloading the files. I have tried to download the 92c658d7.zip for several times and each time the download gets interrupted after few minutes. I tried with both Firefox and Chrome but neither seem to be able to download it. Neither browser is able to resume the download and I must start the download from the beginning after failure. The same issue happened earlier with the other files but I was able to download them after retrying few times. |
Perhaps FTP would be better? There are nice freeware clients that will continue broken downloads and have various options. |
FTP is a far, far worse option as its much less reliable through firewalls. HTTP supports resuming just as well. There's a reason FTP is not used anywhere any more. Unfortunately I can't really do much here. They are on Google Cloud storage buckets. It's possible downloading them with gsutil allows resuming to work, but I'm not sure. I can try splitting up the data in smaller chunks, maybe. |
You can use the |
Well, one can try Firefox Add-ons. |
Firefox (and Chrome and every browser) can resume HTTP downloads themselves. There must be something with Google's Cloud Platform that breaks HTTP resumes if it doesn't work. |
https://cloud.google.com/compute/docs/instances/transfer-files On Windows workstations, use the WinSCP client to manage files on your instances through a graphical file browser interface.
|
I have a little question. I am running "dump_supervised all.sgf train.txt" on all.sgf that has 900K games. |
It passes over the SGF multiple times every time writing out one out of every x positions in a random rotation, so yes. |
During several trials, I constantly fail to download these files. The server doesn't allow me to resume, and the connection is really slow and unstable. Will it be a bad idea to create bittorrent seeds for these? |
Have you tried "wget -c"? (I haven't tried it because I'm able to download the files in one shot.) |
I'm going to download them and create a torrent as I have around 200Mbps to ogs. |
@killerducky Currently the download is ongoing... But it will take at least a week per file, so for the answer to be clear it will take some time. |
Here's the magnet link |
@zediir Thank you for sharing. The bittorrent client is trying to download it, hopefully it will be faster. |
@isty2e I'm also uploading it to my seedbox so that might make downloading faster. It'll take a while though as I'll have to upload 156GB. |
@isty2e Managed to get the seedbox working. Your download speed should be somewhat faster now. |
@zediir Indeed it is! It was like 35KB/s before and it is much faster now. |
I'm going to keep uploading the training data to the seedbox and posting (editing this post) updated magnet link to a new torrent when @gcp uploads a new batch of training data. |
Unfortunatly seedbox.io that I use only offers servers from Netherlands, Romania and France. Though @isty2e seems to be downloading much faster (about 2MB/s I think) now that they are downloading from the seedbox. |
@zediir I downloaded it at 12MB/s (from the Netherlands) |
Ah that was you :). 12.5MB/s (100Mbps) is the limit for that particular seedbox . |
Perhaps @isty2e sets up his own Chinese seedbox to mirror off yours and then downloads from that one once it's all there. :) (Google tells me there are many Chinese seedbox companies.) |
Well I am not in China, and my network connection and bandwidth are more than fine. They are exceptionally bad for OGS strangely. Anyway, thanks to @zediir, the training data are all downloaded. |
@gcp Will you upload training data this week? |
leela.online-go.com hasn't worked for a day. Gives error 522 "Connection timed out" for me. |
@Ttl: There seems to be an issue going on at the provider, I've opened up a support ticket to get the issue resolved. |
Any progress? The old networks are also offline as they are hosted on the storage server, and that also means I can't clean up those on the live server... |
I filed a follow up ticket with them yesterday after 24 hours, still
haven't heard anything back. Quite frustrating. Needless to say we'll be
moving to another provider, but we still need to get access to those files.
I'll get another server up today and get you the details so you can at
least clean stuff off the live server.
For future people googling reliability and support, be warned and stay away
from delimiter.com
|
I started re-uploading old networks from my local backup. This is about 108G and it will take me about 2 days. Re-uploading the >1TB of training data...well you do the math. I will start uploading the new data and hope the old server eventually pops up back long enough so we can transfer from there. I think we also had the 9x9 networks, the SGF files for matches and self-play on there (just mentioning this for my own reference). Thanks for the help @anoek. The new server seems to be a lot more responsive too, so yay. |
Yeah I noticed it was a lot faster as well, so at least that's a win. Terribly sorry about the problems all. I've never had a service provider just drop a server on the floor and not even respond to any tickets before. I'll let you know if they do manage to get it back up before you finish uploading everything so we can finish with a server to server transfer. |
Everything should be back up except for older training data. |
the file train_c9fb22c7.zip uploaded on May 3 is of size 7.4gb. Are we going to get new training data this week? |
When someone has a chance, could we get new Tensorboard logs posted? Right now there aren't any LZ Tensorboard logs available for download. |
I'll redump and reupload this but that will take a while. Possibly the second dump was missing the part of the data that went out of the training window.
Uploading should finish in about 10 minutes. |
I have most of the old files. I can upload them if necessary.
Thanks a lot. |
Up in https://sjeng.org/zero |
This file was re-uploaded now. |
Can we also have the update of ac9bcd63? The one uploaded on August 1 should not contain all selfplay from the size of the file. |
Ok, will check. |
train_a4f5d99a.zip 1433662354 2019-Aug-19 09:06 |
Redumped and re-uploaded ac9bcd63. |
many thanks |
It looks like the self-play games from v241 (466fa23a) are missing in the update today? |
@gcp When will it be updated? It's been a while since it wasn't updated. |
@anoek The server seems to have been down for a few days, can you investigate? |
@gcp fixed. For some reason the VM was powered off, I'll investigate further. I guess I should have had a heartbeat check on that server.. |
Thanks! Will try to upload missing data overnight. |
I just finished re-uploading all the files that were lost in one of the server moves/outages. So all the training data is complete now. It will stay up as long as @anoek keeps hosting it. |
Do these training data show also the output of the Value function of a board position? |
I don't think so. Should be just input position (8 moves), color of the current player, outcome of the game and visits distribution. |
http://leela.online-go.com/training/
License: Public Domain
Hosting sponsored by OGS.
The text was updated successfully, but these errors were encountered: