-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Turkish Mirror Doesn't have Page about the referendum #24
Comments
Where do we get that page? |
This has already gone live ~18 hours ago |
Mh I wonder why that's not part of the dump. We can try patching it in. |
I accidentally looked at en wiki. |
@Kubuxu seems the turkish version of the page was first made on January 21, 2017 though, https://tr.wikipedia.org/w/index.php?title=2017_T%C3%BCrkiye_anayasa_de%C4%9Fi%C5%9Fikli%C4%9Fi_referandumu&offset=20170128113837&limit=500&action=history |
Hmm, interesting. |
The most important part is about the controversy surrounding the referendum, which only came after the votes were 'counted' on April 16th. The ipns hash has the information, but the URL does not. |
Sounds like we need to talk to kiwix about their dumps - see if there are more complete or more current versions available. |
Update: we are still trying to get a more recent dump from kiwix. The turkish snapshot is definitely outdated -- the styling was updated for all of the new dumps but hasn't been updated for the turkish dump. |
Oh man. I hope I don't have to pin a whole new snapshot. It took 72 hours of grueling downloads. |
Note: this is off topic. If you want to discuss bandwidth needs and improvements, please redirect the conversation to https://discuss.ipfs.io We have a lot of bandwidth optimizations coming. In the long run we intend for IPFS to be at least as fast as HTTP and in many cases faster. We are all super motivated to work on this. This is part of what motivated us to create https://github.com/ipfs/test-lab -- it lets us spin up test networks and test performance in various scenarios. |
It seems not only does it not contain all articles that existed as of the date 2017-04, also some articles are multiple month older version than existed when other articles where last edited that made it. |
The solution here seems indeed to be pretty simple: download last version of the ZIM file. Here is the permalink, a new version has been published a few days ago: download.kiwix.org/zim/wikipedia_tr_all.zim. So far I have seen last time I checked, you were using a ZIM file from January 2017. @meyerscr The file is only 3GB big, so it should take a few hours max. If you have the felling the mirror is slow, better use BitTorrent. |
Was the file updated? I was downloading it from there 1.5months ago. |
@Kubuxu Yes this is a permalink and a new version has been published a few days ago. |
Not sure if it's that simple. Even the ZIM file from January 2017 doesn't seem to include all the pages, or we are not including all pages from it, as noted in the above comments. |
@victorbjelkholm I do not follow you. Can you be more specific please? This ticket is about missing https://tr.wikipedia.org/wiki/2017_T%C3%BCrkiye_anayasa_de%C4%9Fi%C5%9Fikli%C4%9Fi_referandumu. Yourself has identified that it was created the 21 January and the ZIM file used has been published the January 13th. Case closed!? |
Let's pull down the new ZIM and see if it's there. Fingers crossed...
…On Sun, Jun 11, 2017 at 6:41 AM Kelson ***@***.***> wrote:
@victorbjelkholm <https://github.com/victorbjelkholm> I do not follow
you. Can you be more specific please? This ticket is about missing
https://tr.wikipedia.org/wiki/2017_T%C3%BCrkiye_anayasa_de%C4%9Fi%C5%9Fikli%C4%9Fi_referandumu.
Yourself has identified that it was created the 21 January and the ZIM file
used has been published the January 13th. Case closed!?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#24 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAIesnYdii629IBUjXg59bLtD3H43a9Eks5sC8RygaJpZM4NUua3>
.
|
Doing it right now. |
Extraction took: 1m 30s I am using the old search graph. I might ask @magik6k to create script in this repo for rebuilding the graph. Finishing up as we speak. |
Ticket could probably be closed. |
The Turkish referendum was the inciting incident that led Erdogan to block wikipedia. That is because it mentioned how he stole the election. It would be excellent if you could update that single page if at all possible before this goes live
The text was updated successfully, but these errors were encountered: