StackOverflow to Kiwix: Script to create a Zim of StackExchange websites
JavaScript Python HTML CSS Makefile
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
static
templates
.gitignore
Makefile
README.md
reference.txt
requirements.txt
sotoki.py

README.md

Sotoki

Stack Overflow to Kiwix

The goal of this project is to create a suite of tools to create zim files required by kiwix reader to make available Stack Overflow offline (without access to Internet).

Getting started

Download the last stackexchange dump using BitTorrent (only "superuser.com.7z" is necessary) and put it in the Sotoki source code root.

Clone this repository:

git clone https://github.com/kiwix/sotoki.git

Install non python dependencies:

sudo apt-get install jpegoptim pngquant gifsicle advancecomp python-pip python-virtualenv python-dev libxml2-dev libxslt1-dev libbz2-dev p7zip-full

Create a virtual environment for python:

virtualenv --with-system-site-packages venv

Activate the virtual enviroment:

source venv/bin/activate

Install the python requirements:

pip install -r requirements.txt

Copy superuser.com.7z and unzip it to work/dump/:

mkdir -p work/dump/
cp superuser.com.7z work/dump/
cd work/dump
7z e superuser.com.7z
rename 'y/A-Z/a-z/' *

Go back at the sotoki root and run the pipeline:

python sotoki.py run [url of stackechange website] [publisher] [--directory (optional)]

It's use https://github.com/testlnord/sedumpy