Skip to content
Python 2/3 compatible character encoding detector.
Branch: master
Clone or download
saintamh and dan-blanchard Typo fix (#159)
Bulgairan -> Bulgarian
Latest commit b5194bf Aug 9, 2018
Type Name Latest commit message Commit time
Failed to load latest commit information.
chardet Typo fix (#159) Aug 8, 2018
docs Prefer over (#129) Jun 5, 2017
tests Remove extra CDATA tags from Hungarian XML test data Apr 10, 2017
.gitattributes Add .gitattributes file to keep text line encodings straight. Dec 15, 2013
.travis.yml Drop support for Python 2.6 (#143) Dec 11, 2017
LICENSE Add LGPL because it should be here. Nov 28, 2012
README.rst Convert single-byte charset probers to use nested dicts for language … Jun 8, 2017
setup.cfg Include license file in the generated wheel package Oct 21, 2017 Drop support for Python 2.6 (#143) Dec 11, 2017


Chardet: The Universal Character Encoding Detector

Build status Latest version on PyPI


  • ASCII, UTF-8, UTF-16 (2 variants), UTF-32 (4 variants)
  • Big5, GB2312, EUC-TW, HZ-GB-2312, ISO-2022-CN (Traditional and Simplified Chinese)
  • EUC-JP, SHIFT_JIS, CP932, ISO-2022-JP (Japanese)
  • EUC-KR, ISO-2022-KR (Korean)
  • KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, windows-1251 (Cyrillic)
  • ISO-8859-5, windows-1251 (Bulgarian)
  • ISO-8859-1, windows-1252 (Western European languages)
  • ISO-8859-7, windows-1253 (Greek)
  • ISO-8859-8, windows-1255 (Visual and Logical Hebrew)
  • TIS-620 (Thai)


Our ISO-8859-2 and windows-1250 (Hungarian) probers have been temporarily disabled until we can retrain the models.

Requires Python 2.7 or 3.4+.


Install from PyPI:

pip install chardet


For users, docs are now available at

Command-line Tool

chardet comes with a command-line script which reports on the encodings of one or more files:

% chardetect somefile someotherfile
somefile: windows-1252 with confidence 0.5
someotherfile: ascii with confidence 1.0


This is a continuation of Mark Pilgrim's excellent chardet. Previously, two versions needed to be maintained: one that supported python 2.x and one that supported python 3.x. We've recently merged with Ian Cordasco's charade fork, so now we have one coherent version that works for Python 2.7+ and 3.4+.

maintainer:Dan Blanchard
You can’t perform that action at this time.