Skip to content

Character Set Support

John Gardner edited this page Dec 24, 2019 · 5 revisions

Supported Character Sets

We should decide which character sets to support for each/all plugins. Other character sets could be specified outside of this set and they would be ignored if not understood by the editor. This is simply a list of character sets that all plugins should attempt to support at a minimum.

If an implementation wishes to define a new character set, it is recommended that it consults outside authorities such as the WHATWG Encoding Standard and the IANA Character Set registry. Using a special prefix such as x- is neither required nor recommended, as unsupported values are simply ignored.

Setting character sets in text editors


Vim character encodings can be set with the filencoding command (see documentation).

Setting UTF-8

set fileencoding=utf-8

Setting Latin1

set fileencoding=latin1


As noted here, Emacs character encodings and end-of-line formats are both set using the set-buffer-file-coding-system command.

Setting UTF-8

(set-buffer-file-coding-system 'utf-8-unix)
(set-buffer-file-coding-system 'utf-8-dos)
(set-buffer-file-coding-system 'utf-8-mac)

Setting Latin1

(set-buffer-file-coding-system 'iso-latin-1-unix)
(set-buffer-file-coding-system 'iso-latin-1-dos)
(set-buffer-file-coding-system 'iso-latin-1-mac)
You can’t perform that action at this time.