Skip to content
Switch branches/tags

KH Coder: for Quantitative Content Analysis or Text Mining




KH Coder is a free software for quantitative content analysis or text mining. It is also utilized for computational linguistics. You can analyze Catalan, Chinese (simplified), Dutch, English, French, German, Italian, Japanese, Korean, Portuguese, Russian, Slovenian and Spanish text with KH Coder.

Official book (in Japanese):

How to run source code of KH Coder on Windows

  1. Download & install Perl:
  2. (Fork and) clone this repository
  3. Download released *.exe file (Winzip self-extractor) of KH Coder 3
  4. Unzip the downloaded file into the clone directory
  5. Open command prompt window and go to the clone directory, type "perl", and hit "Enter" key

If you get errors like "Can't locate in @INC", you need to install Perl module called "Jcode". To install it, type "cpanm Jcode" and hit "Enter" key on your command prompt window.

Above procedure is for people who want to develop or modify KH Coder. If you want to just try or use KH Coder, you don't need Perl. Please just download and unzip released *.exe file, then double click extracted "kh_coder.exe".

On Linux or other Un*x like system

You need:

  • MySQL
  • Perl (and some Perl modules)
  • R (and some R packages)
  • Morphological Analysis and POS Tagging software
    • ChaSen or MeCab for analyzing Japanese text
    • FreeLing or Stanford POS Tagger for analyzing English text
    • FreeLing for analyzing Catalan, French, German, Italian, Portuguese, Russian or Spanish text
    • MeCab and HanDic for analyzing Korean text
    • Stanford Word Segmenter and Stanford POS Tagger for analyzing Chinese text

See issue #91 for more details.


GNU GPL version 2 or later