Skip to content

Latest commit

 

History

History
49 lines (37 loc) · 5.55 KB

IMEs.md

File metadata and controls

49 lines (37 loc) · 5.55 KB

Chinese IME (Input Method Editor)

Explanation

There’s two main methods to write chinese. A phonetic-based method and a shape-based method. A pronunciation-based, used in PRC, requires no extra effort to learn alongside the learning the pinyin system. Stroke-based methods require a few weeks to learn the rules and practice, but in exchange: are Chinese-language-variant neutral, are easy to type characters without prior knowledge of their reading (eg. an image with no text), reinforces character writing memory (tends to atrophy due to typing), and may support character variants depending on the method. Both are roughly equally fast when you reading/production are relatively quick, and you can set a shortcut to switch between traditional and simplified.

Types of Stroke-based Systems

  • Cangjie (倉頡)- Unintuitive codes, complicated rules, traditional-focused but simplified support was added later. Used in Hong Kong and Taiwan. Typically three to five keypresses per characters

  • Sucheng (速成) or Quick/Simplified Cangjie - Same as Cangjie but with the only the first and last codes. Likely the most used input method by natives in Hong Kong and Taiwan, especially on phones.

  • Zhengma (郑码) - Created post-Cangjie and designed to be logical and consistent. Designed to support both simplified and traditional, and even supports character variant. Typically four keypresses per character.

  • Wubi (五笔字型) - Shares a history Zhengma, designed to be more logical and consistent than Cangjie. Simplified-focused but also supports traditional. Typically four keypresses per character.

  • Boshiamy - Proprietary. Supposedly the fastest method.

  • Dayi - Similar to Cangjie but shift key usage to expand the number keys to support more components.

  • Wubihua - Incomplete Not to be confused with Wubi.

  • Sijiao - Incomplete

I recommend Wubi if for general use or Zhengma. More detailed descriptions and Advice by Gareth can be found here.

Software

Phonetic-based

Typically input methods do not support tone discrimination on default, but some methods do have the option to enable it. For Mandarin there are Pinyin and Zhuyin methods. For Cantonese, some methods support Jyutping, Yale, or both.

  • System installed [ZH]
    • Windows - For Windows 7, you want to find “Chinese Simplified (PRC) → Microsoft Pinyin New Experience Input St.” You can switch between Traditional and Simplified, and even assign that to a shortcut key. On Windows 10, you can simply add Pinyin via Settings -> Time & Language -> Region & Language -> Add a language.
    • Mac
    • Linux
  • Rime [CA][ZH] - Multi-OS Support. Allows for installed dictionaries.
  • Google Input [ZH] - Supports both simplified and traditional. I imagine it has better character recognition but need an opinion of someone who has used this.
  • CAP [CA] - Dead? Multi-OS support, Yale or Similar to Rime in terms of dictionary.
  • CPIME Jyutping and Yale phonetic input for Cantonese.
  • FHL Taigi-Hakka IME

Shape-based

You can use Zdic to lookup codes for Cangjie, Wubi, and Zhengma.

  • System installed Windows supports Cangjie, Sucheng, and Zhengma (pre-Win8).
  • Rime [倉頡][五笔][郑码] - Due to its support of custom dictionaries. You can add Wubi and Zhengma
  • Sogou - Has a modes for Mandarin Pinyin, Wubi, and both input and displays the Wubi codes. Though it supports continuous typing, typos will match to nearest character.
  • FreeWB [五笔] - Use to be the best Wubi input method. Not sure about now, but if you don’t trust Sougou as a company here is your alternative.

Linux

Unlike the Apple and Windows systems where it’s a two part process, Linux distributions separate the input system into three parts: keymapping, input method frameworks, and IME.

Keymapping (xev and xmodmap) remaps keyboard signals to other keys or commands (so quite hardware specific), useful for when your keyboard is a different layout from a standard US keyboard for example.

An input method framework is the intermediary for input method editors (IMEs), of which there are two popular ones for linux: iBus, fcitx, etc.

Finally the IME is a specific implementation of a keyboard for specific to the language. You can have several IMEs for a single language.

I’d recommend installing fcitx and installing either the Google Pinyin IME or Rime.io IME. iBus and fcitx are setup so that you swap between a main and secondary IME. You can toggle between two of them with a keyboard shortcut. When you’ve toggled on the secondary IME, you can the cycle between all the extra IMEs that you have installed (if you have more than two).