Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Table of Contents
WikiTeam software is a set of tools for archiving wikis. They work on MediaWiki wikis, but we want to expand to other wiki engines. As of February 2014, WikiTeam has preserved more than 10,000 stand-alone wikis, several WikiFarms, regular Wikipedia dumps and 24TB of Wikimedia Commons images.
There are thousands of wikis in the Internet. Everyday some of them are no longer publicly available and, due to lack of backups, lost forever. Millions of people download tons of media files (movies, music, books, etc) from the Internet, implementing a kind of distributed backup. Wikis, most of them under free licenses, disappear from time to time because nobody grabbed a copy of them. That is a shame that we would like to solve.
WikiTeam is the Archive Team subcommittee on wikis. It was founded and originally developed by Emilio J. Rodríguez-Posada, a Wikipedia veteran editor and amateur archivist. Many people have help sending suggestions, reporting bugs, writing documentation, providing help in the mailing list and making wiki backups. Thanks to all, especially to: Federico Leva, Alex Buie, Scott Boyd, Hydriz, Platonides, Ian !McEwen and Mike Dupont.
The last news (follow us on Twitter too!):
- 2014-02-14: Today, WikiTeam collection at Internet Archive has surpassed the 10,000 wikis milestone. We love wikis. Happy Valentine's Day!
- 2013-10-14: WikiTeam has just finished archiving all Wikimedia Commons files up to 2012 on the Internet Archive. About 24 TB of content.
- 2013-03-17: Niklas Laxström has kindly provided a dump of translatewiki.net (13th biggest wiki by number of pages).
- 2013-01-27: Wikia wikis dumps archived. About 34,000 wikis have dumps, one tenth of existing Wikia wikis. Wikia has now restricted dump requests to "autoconfirmed" users.
You can do a lot of things:
- Read our FAQ
- Download our scripts and software, read our Tutorial and backup your favorite wiki. Send us a copy of the backup if you can!
- Download some backups of wikis. Additionally, there are more backups at Internet Archive.
- Report a bug or request a feature
- Use the backups to research how wiki communities evolve
- Subscribe to the mailing lists: general discussion and developer discussion
- Follow us on Twitter! @_WikiTeam
Are you a researcher? If you have conducted any research about wikis using our software or the backups we made available, we would be glad to hear from your experience.
- ARCHIVE TEAM: A Distributed Preservation of Service Attack (Jason Scott at DEFCON 19)
- Backups de wikis con WikiTeam (!IberOgre, 2011-06-04) (webcite)
- WikiTeam: un repositorio de wikis (Barrapunto, 2011-05-10) (webcite)
This project contains a lot of material. Be careful with the licenses.
- The source code is GPLv3
- The documentation in this site is Creative Commons 3.0 BY-SA
- Every backup of every wiki has its own license. Visit the wiki and search for the license of texts and images. Read more in our FAQ.
If you want to contact with the WikiTeam community, you can do it:
- For bug reports, new features requests, notices about new wikis to backup, visit our issues tracker.
- General discussion about WikiTeam tools and help for new users, visit our mailing list
- Developers discussion about bugs, patches, features, visit the developers' mailing list
- There is a social IRC channel in Efnet: irc://efnet/wikiteam (webchat access)