Tested Software version 8.07.3 on Darwin Full results available at ucs-detect repository path data/macos-zoc-8.07.3.yaml
The best wide unicode table version for zoc appears to be 15.0.0, this is from a summary of the following results:
version | n_errors | n_total | pct_success |
---|---|---|---|
'5.1.0' |
|
|
100.0% |
'5.2.0' |
|
|
79.6% |
'6.0.0' |
|
|
23.1% |
'9.0.0' |
|
|
99.5% |
'10.0.0' |
|
|
99.2% |
'11.0.0' |
|
|
100.0% |
'12.0.0' |
|
|
80.6% |
'12.1.0' |
|
|
100.0% |
'13.0.0' |
|
|
99.6% |
'14.0.0' |
|
|
95.1% |
'15.0.0' |
|
|
93.3% |
'15.1.0' |
|
|
20.0% |
Sequence of a WIDE character from Unicode Version 15.0.0, from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0001F6DC | '\U0001f6dc' | So |
|
WIRELESS |
Total codepoints: 1
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9f\x9b\x9c|\\n12|\\n" 🛜| 12|
- python wcwidth.wcswidth() measures width 2, while zoc measures width 1.
The best Emoji ZWJ table version for zoc appears to be None, this is from a summary of the following results:
version | n_errors | n_total | pct_success |
---|---|---|---|
'2.0' |
|
|
0.0% |
'4.0' |
|
|
0.0% |
'5.0' |
|
|
0.0% |
'11.0' |
|
|
0.0% |
'12.0' |
|
|
0.0% |
'12.1' |
|
|
0.0% |
'13.0' |
|
|
0.0% |
'13.1' |
|
|
0.0% |
'14.0' |
|
|
0.0% |
'15.0' |
|
|
0.0% |
'15.1' |
|
|
0.0% |
Sequence of an Emoji ZWJ Sequence from Emoji Version 15.1, from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0001F9D1 | '\U0001f9d1' | So |
|
ADULT |
U+200D | '\u200d' | Cf |
|
ZERO WIDTH JOINER |
U+0001F9BC | '\U0001f9bc' | So |
|
MOTORIZED WHEELCHAIR |
U+200D | '\u200d' | Cf |
|
ZERO WIDTH JOINER |
U+27A1 | '\u27a1' | So |
|
BLACK RIGHTWARDS ARROW |
U+FE0F | '\ufe0f' | Mn |
|
VARIATION SELECTOR-16 |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9f\xa7\x91\xe2\x80\x8d\xf0\x9f\xa6\xbc\xe2\x80\x8d\xe2\x9e\xa1\xef\xb8\x8f|\\n12|\\n" 🧑🦼➡️| 12|
- python wcwidth.wcswidth() measures width 2, while zoc measures width 8.
Emoji VS-16 results for zoc is 0 errors out of 100 total codepoints tested, 100.0% success. All codepoint combinations with Variation Selector-16 tested were successful.
The following 7 languages were tested with 100% success:
Adyghe, Idoma, Kabardian, Tamazight, Central Atlas (Tifinagh), Tamazight, Standard Morocan, Vai, Yukaghir, Northern.
The following 91 languages are not fully supported:
lang | n_errors | n_total | pct_success |
---|---|---|---|
Javanese (Javanese) |
|
|
0.0% |
Nuosu |
|
|
0.0% |
Cherokee (cased) |
|
|
1.4% |
Tai Dam |
|
|
2.2% |
Maldivian |
|
|
2.9% |
Tamil |
|
|
3.1% |
Tamil (Sri Lanka) |
|
|
3.1% |
Burmese |
|
|
3.7% |
Mon |
|
|
4.2% |
Shan |
|
|
4.4% |
Dzongkha |
|
|
4.7% |
Gujarati |
|
|
5.7% |
Tibetan, Central |
|
|
5.7% |
Malayalam |
|
|
6.2% |
Tamang, Eastern |
|
|
6.7% |
Kannada |
|
|
6.7% |
Khün |
|
|
6.8% |
Khmer, Central |
|
|
6.8% |
Bengali |
|
|
7.4% |
Chakma |
|
|
7.4% |
Telugu |
|
|
9.1% |
Nepali |
|
|
9.7% |
Sanskrit |
|
|
11.2% |
Sanskrit (Grantha) |
|
|
11.5% |
Marathi |
|
|
12.4% |
Hindi |
|
|
13.2% |
Sinhala |
|
|
13.3% |
Panjabi, Eastern |
|
|
13.5% |
Bhojpuri |
|
|
14.4% |
Thai (2) |
|
|
14.7% |
Maithili |
|
|
18.4% |
Thai |
|
|
19.9% |
Magahi |
|
|
22.2% |
Vietnamese |
|
|
24.2% |
Tagalog (Tagalog) |
|
|
32.3% |
Lao |
|
|
36.6% |
Lingala (tones) |
|
|
40.8% |
Vietnamese (Han nom) |
|
|
46.2% |
Pular (Adlam) |
|
|
52.1% |
Yiddish, Eastern |
|
|
52.9% |
Bamun |
|
|
56.1% |
Orok |
|
|
60.6% |
Tem |
|
|
61.2% |
Nanai |
|
|
68.6% |
Evenki |
|
|
70.3% |
Yaneshaʼ |
|
|
71.6% |
Ticuna |
|
|
71.7% |
Amarakaeri |
|
|
72.3% |
South Azerbaijani |
|
|
72.4% |
Yoruba |
|
|
77.0% |
Chickasaw |
|
|
78.0% |
Siona |
|
|
81.7% |
Fur |
|
|
87.6% |
Chinantec, Chiltepec |
|
|
87.7% |
Gumuz |
|
|
89.7% |
Bora |
|
|
89.9% |
Mòoré |
|
|
90.8% |
Mongolian, Halh (Mongolian) |
|
|
90.9% |
Lamnso' |
|
|
91.2% |
Navajo |
|
|
91.4% |
Tamazight, Central Atlas |
|
|
91.5% |
Gilyak |
|
|
91.8% |
Ditammari |
|
|
92.6% |
Assyrian Neo-Aramaic |
|
|
93.6% |
Farsi, Western |
|
|
94.4% |
Otomi, Mezquital |
|
|
95.4% |
Veps |
|
|
95.5% |
Waama |
|
|
96.2% |
Dinka, Northeastern |
|
|
96.3% |
Dari |
|
|
96.5% |
Éwé |
|
|
97.5% |
Baatonum |
|
|
97.6% |
Urdu (2) |
|
|
97.7% |
Urdu |
|
|
97.8% |
Uduk |
|
|
97.8% |
Mazahua Central |
|
|
97.8% |
Secoya |
|
|
97.9% |
Gen |
|
|
98.0% |
Picard |
|
|
98.2% |
Mixtec, Metlatónoc |
|
|
98.2% |
Arabic, Standard |
|
|
98.5% |
Ga |
|
|
98.7% |
Panjabi, Western |
|
|
99.1% |
Dangme |
|
|
99.2% |
Dagaare, Southern |
|
|
99.3% |
Serer-Sine |
|
|
99.6% |
Fon |
|
|
99.6% |
Aja |
|
|
99.7% |
Pashto, Northern |
|
|
99.8% |
Dendi |
|
|
99.9% |
Seraiki |
|
|
99.9% |
Sequence of language Javanese (Javanese) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+A9CB | '\ua9cb' | Po |
|
JAVANESE PADA ADEG ADEG |
U+A9B1 | '\ua9b1' | Lo |
|
JAVANESE LETTER SA |
U+A9A7 | '\ua9a7' | Lo |
|
JAVANESE LETTER BA |
U+A9BC | '\ua9bc' | Mn |
|
JAVANESE VOWEL SIGN PEPET |
U+A9A4 | '\ua9a4' | Lo |
|
JAVANESE LETTER NA |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xea\xa7\x8b\xea\xa6\xb1\xea\xa6\xa7\xea\xa6\xbc\xea\xa6\xa4|\\n1234|\\n" ꧋ꦱꦧꦼꦤ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 10.
Sequence of language Nuosu from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+300A | '\u300a' | Ps |
|
LEFT DOUBLE ANGLE BRACKET |
U+A2E7 | '\ua2e7' | Lo |
|
YI SYLLABLE ZZYT |
U+A0C5 | '\ua0c5' | Lo |
|
YI SYLLABLE MU |
U+A2BD | '\ua2bd' | Lo |
|
YI SYLLABLE COT |
U+A305 | '\ua305' | Lo |
|
YI SYLLABLE NZY |
U+A14D | '\ua14d' | Lo |
|
YI SYLLABLE DDU |
U+A11C | '\ua11c' | Lo |
|
YI SYLLABLE TI |
U+A2CA | '\ua2ca' | Lo |
|
YI SYLLABLE CYT |
U+A12F | '\ua12f' | Lo |
|
YI SYLLABLE TEP |
U+A489 | '\ua489' | Lo |
|
YI SYLLABLE YY |
U+300B | '\u300b' | Pe |
|
RIGHT DOUBLE ANGLE BRACKET |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xe3\x80\x8a\xea\x8b\xa7\xea\x83\x85\xea\x8a\xbd\xea\x8c\x85\xea\x85\x8d\xea\x84\x9c\xea\x8b\x8a\xea\x84\xaf\xea\x92\x89\xe3\x80\x8b|\\n1234567890123456789012|\\n" 《ꋧꃅꊽꌅꅍꄜꋊꄯꒉ》| 1234567890123456789012|
- python wcwidth.wcswidth() measures width 22, while zoc measures width 13.
Sequence of language Cherokee (cased) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+13C2 | '\u13c2' | Lu |
|
CHEROKEE LETTER NI |
U+AB7C | '\uab7c' | Ll |
|
CHEROKEE SMALL LETTER GV |
U+AB8E | '\uab8e' | Ll |
|
CHEROKEE SMALL LETTER NA |
U+ABAB | '\uabab' | Ll |
|
CHEROKEE SMALL LETTER DV |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x8f\x82\xea\xad\xbc\xea\xae\x8e\xea\xae\xab|\\n1234|\\n" Ꮒꭼꮎꮫ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 7.
Sequence of language Tai Dam from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+AA81 | '\uaa81' | Lo |
|
TAI VIET LETTER HIGH KO |
U+AAAB | '\uaaab' | Lo |
|
TAI VIET LETTER HIGH VO |
U+AAB1 | '\uaab1' | Lo |
|
TAI VIET VOWEL AA |
U+AAA3 | '\uaaa3' | Lo |
|
TAI VIET LETTER HIGH MO |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xea\xaa\x81\xea\xaa\xab\xea\xaa\xb1\xea\xaa\xa3|\\n1234|\\n" ꪁꪫꪱꪣ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 8.
Sequence of language Maldivian from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0791 | '\u0791' | Lo |
|
THAANA LETTER DAVIYANI |
U+07A8 | '\u07a8' | Mn |
|
THAANA IBIFILI |
U+0790 | '\u0790' | Lo |
|
THAANA LETTER SEENU |
U+07AC | '\u07ac' | Mn |
|
THAANA EBEFILI |
U+0789 | '\u0789' | Lo |
|
THAANA LETTER MEEMU |
U+07B0 | '\u07b0' | Mn |
|
THAANA SUKUN |
U+0784 | '\u0784' | Lo |
|
THAANA LETTER BAA |
U+07A6 | '\u07a6' | Mn |
|
THAANA ABAFILI |
U+0783 | '\u0783' | Lo |
|
THAANA LETTER RAA |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "\xde\x91\xde\xa8\xde\x90\xde\xac\xde\x89\xde\xb0\xde\x84\xde\xa6\xde\x83|\\n12345|\\n" ޑިސެމްބަރ| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 9.
Sequence of language Tamil from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0BAE | '\u0bae' | Lo |
|
TAMIL LETTER MA |
U+0BA9 | '\u0ba9' | Lo |
|
TAMIL LETTER NNNA |
U+0BBF | '\u0bbf' | Mc |
|
TAMIL VOWEL SIGN I |
U+0BA4 | '\u0ba4' | Lo |
|
TAMIL LETTER TA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xae\xae\xe0\xae\xa9\xe0\xae\xbf\xe0\xae\xa4|\\n123|\\n" மனித| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Tamil (Sri Lanka) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0BAE | '\u0bae' | Lo |
|
TAMIL LETTER MA |
U+0BA9 | '\u0ba9' | Lo |
|
TAMIL LETTER NNNA |
U+0BBF | '\u0bbf' | Mc |
|
TAMIL VOWEL SIGN I |
U+0BA4 | '\u0ba4' | Lo |
|
TAMIL LETTER TA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xae\xae\xe0\xae\xa9\xe0\xae\xbf\xe0\xae\xa4|\\n123|\\n" மனித| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Burmese from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+1021 | '\u1021' | Lo |
|
MYANMAR LETTER A |
U+1015 | '\u1015' | Lo |
|
MYANMAR LETTER PA |
U+103C | '\u103c' | Mc |
|
MYANMAR CONSONANT SIGN MEDIAL RA |
U+100A | '\u100a' | Lo |
|
MYANMAR LETTER NNYA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+1015 | '\u1015' | Lo |
|
MYANMAR LETTER PA |
U+103C | '\u103c' | Mc |
|
MYANMAR CONSONANT SIGN MEDIAL RA |
U+100A | '\u100a' | Lo |
|
MYANMAR LETTER NNYA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+1006 | '\u1006' | Lo |
|
MYANMAR LETTER CHA |
U+102D | '\u102d' | Mn |
|
MYANMAR VOWEL SIGN I |
U+102F | '\u102f' | Mn |
|
MYANMAR VOWEL SIGN U |
U+1004 | '\u1004' | Lo |
|
MYANMAR LETTER NGA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+101B | '\u101b' | Lo |
|
MYANMAR LETTER RA |
U+102C | '\u102c' | Mc |
|
MYANMAR VOWEL SIGN AA |
Total codepoints: 16
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x80\xa1\xe1\x80\x95\xe1\x80\xbc\xe1\x80\x8a\xe1\x80\xba\xe1\x80\x95\xe1\x80\xbc\xe1\x80\x8a\xe1\x80\xba\xe1\x80\x86\xe1\x80\xad\xe1\x80\xaf\xe1\x80\x84\xe1\x80\xba\xe1\x80\x9b\xe1\x80\xac|\\n12345678|\\n" အပြည်ပြည်ဆိုင်ရာ| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 16.
Sequence of language Mon from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+101C | '\u101c' | Lo |
|
MYANMAR LETTER LA |
U+102D | '\u102d' | Mn |
|
MYANMAR VOWEL SIGN I |
U+1000 | '\u1000' | Lo |
|
MYANMAR LETTER KA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+101C | '\u101c' | Lo |
|
MYANMAR LETTER LA |
U+101C | '\u101c' | Lo |
|
MYANMAR LETTER LA |
U+1031 | '\u1031' | Mc |
|
MYANMAR VOWEL SIGN E |
U+102C | '\u102c' | Mc |
|
MYANMAR VOWEL SIGN AA |
U+105A | '\u105a' | Lo |
|
MYANMAR LETTER MON NGA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x80\x9c\xe1\x80\xad\xe1\x80\x80\xe1\x80\xba\xe1\x80\x9c\xe1\x80\x9c\xe1\x80\xb1\xe1\x80\xac\xe1\x81\x9a\xe1\x80\xba|\\n12345|\\n" လိက်လလောၚ်| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 10.
Sequence of language Shan from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+101C | '\u101c' | Lo |
|
MYANMAR LETTER LA |
U+102D | '\u102d' | Mn |
|
MYANMAR VOWEL SIGN I |
U+1075 | '\u1075' | Lo |
|
MYANMAR LETTER SHAN KA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+1088 | '\u1088' | Mc |
|
MYANMAR SIGN SHAN TONE-3 |
U+1015 | '\u1015' | Lo |
|
MYANMAR LETTER PA |
U+102D | '\u102d' | Mn |
|
MYANMAR VOWEL SIGN I |
U+102F | '\u102f' | Mn |
|
MYANMAR VOWEL SIGN U |
U+107C | '\u107c' | Lo |
|
MYANMAR LETTER SHAN NA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+107D | '\u107d' | Lo |
|
MYANMAR LETTER SHAN PHA |
U+1062 | '\u1062' | Mc |
|
MYANMAR VOWEL SIGN SGAW KAREN EU |
U+101D | '\u101d' | Lo |
|
MYANMAR LETTER WA |
U+103A | '\u103a' | Mn |
|
MYANMAR SIGN ASAT |
U+1087 | '\u1087' | Mc |
|
MYANMAR SIGN SHAN TONE-2 |
Total codepoints: 15
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x80\x9c\xe1\x80\xad\xe1\x81\xb5\xe1\x80\xba\xe1\x82\x88\xe1\x80\x95\xe1\x80\xad\xe1\x80\xaf\xe1\x81\xbc\xe1\x80\xba\xe1\x81\xbd\xe1\x81\xa2\xe1\x80\x9d\xe1\x80\xba\xe1\x82\x87|\\n123456|\\n" လိၵ်ႈပိုၼ်ၽၢဝ်ႇ| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 15.
Sequence of language Dzongkha from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0F60 | '\u0f60' | Lo |
|
TIBETAN LETTER -A |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0FB2 | '\u0fb2' | Mn |
|
TIBETAN SUBJOINED LETTER RA |
U+0F7C | '\u0f7c' | Mn |
|
TIBETAN VOWEL SIGN O |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F58 | '\u0f58' | Lo |
|
TIBETAN LETTER MA |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F60 | '\u0f60' | Lo |
|
TIBETAN LETTER -A |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F51 | '\u0f51' | Lo |
|
TIBETAN LETTER DA |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F44 | '\u0f44' | Lo |
|
TIBETAN LETTER NGA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F46 | '\u0f46' | Lo |
|
TIBETAN LETTER CHA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F60 | '\u0f60' | Lo |
|
TIBETAN LETTER -A |
U+0F5B | '\u0f5b' | Lo |
|
TIBETAN LETTER DZA |
U+0F58 | '\u0f58' | Lo |
|
TIBETAN LETTER MA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0FB3 | '\u0fb3' | Mn |
|
TIBETAN SUBJOINED LETTER LA |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F44 | '\u0f44' | Lo |
|
TIBETAN LETTER NGA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F63 | '\u0f63' | Lo |
|
TIBETAN LETTER LA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F92 | '\u0f92' | Mn |
|
TIBETAN SUBJOINED LETTER GA |
U+0FB2 | '\u0fb2' | Mn |
|
TIBETAN SUBJOINED LETTER RA |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F0D | '\u0f0d' | Po |
|
TIBETAN MARK SHAD |
Total codepoints: 41
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xbd\xa0\xe0\xbd\x82\xe0\xbe\xb2\xe0\xbd\xbc\xe0\xbc\x8b\xe0\xbd\x96\xe0\xbc\x8b\xe0\xbd\x98\xe0\xbd\xb2\xe0\xbd\xa0\xe0\xbd\xb2\xe0\xbc\x8b\xe0\xbd\x91\xe0\xbd\x96\xe0\xbd\x84\xe0\xbc\x8b\xe0\xbd\x86\xe0\xbc\x8b\xe0\xbd\x82\xe0\xbd\xb2\xe0\xbc\x8b\xe0\xbd\xa0\xe0\xbd\x9b\xe0\xbd\x98\xe0\xbc\x8b\xe0\xbd\x82\xe0\xbe\xb3\xe0\xbd\xb2\xe0\xbd\x84\xe0\xbc\x8b\xe0\xbd\x82\xe0\xbd\xa6\xe0\xbd\xa3\xe0\xbc\x8b\xe0\xbd\x96\xe0\xbd\xa6\xe0\xbe\x92\xe0\xbe\xb2\xe0\xbd\x82\xe0\xbd\xa6\xe0\xbc\x8d|\\n12345678901234567890123456789012|\\n" འགྲོ་བ་མིའི་དབང་ཆ་གི་འཛམ་གླིང་གསལ་བསྒྲགས།| 12345678901234567890123456789012|
- python wcwidth.wcswidth() measures width 32, while zoc measures width 41.
Sequence of language Gujarati from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0AAE | '\u0aae' | Lo |
|
GUJARATI LETTER MA |
U+0ABE | '\u0abe' | Mc |
|
GUJARATI VOWEL SIGN AA |
U+0AA8 | '\u0aa8' | Lo |
|
GUJARATI LETTER NA |
U+0AB5 | '\u0ab5' | Lo |
|
GUJARATI LETTER VA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xaa\xae\xe0\xaa\xbe\xe0\xaa\xa8\xe0\xaa\xb5|\\n123|\\n" માનવ| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Tibetan, Central from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0F61 | '\u0f61' | Lo |
|
TIBETAN LETTER YA |
U+0F7C | '\u0f7c' | Mn |
|
TIBETAN VOWEL SIGN O |
U+0F44 | '\u0f44' | Lo |
|
TIBETAN LETTER NGA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F41 | '\u0f41' | Lo |
|
TIBETAN LETTER KHA |
U+0FB1 | '\u0fb1' | Mn |
|
TIBETAN SUBJOINED LETTER YA |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F63 | '\u0f63' | Lo |
|
TIBETAN LETTER LA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F92 | '\u0f92' | Mn |
|
TIBETAN SUBJOINED LETTER GA |
U+0FB2 | '\u0fb2' | Mn |
|
TIBETAN SUBJOINED LETTER RA |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0F66 | '\u0f66' | Lo |
|
TIBETAN LETTER SA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F60 | '\u0f60' | Lo |
|
TIBETAN LETTER -A |
U+0F42 | '\u0f42' | Lo |
|
TIBETAN LETTER GA |
U+0FB2 | '\u0fb2' | Mn |
|
TIBETAN SUBJOINED LETTER RA |
U+0F7C | '\u0f7c' | Mn |
|
TIBETAN VOWEL SIGN O |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F58 | '\u0f58' | Lo |
|
TIBETAN LETTER MA |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F60 | '\u0f60' | Lo |
|
TIBETAN LETTER -A |
U+0F72 | '\u0f72' | Mn |
|
TIBETAN VOWEL SIGN I |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F50 | '\u0f50' | Lo |
|
TIBETAN LETTER THA |
U+0F7C | '\u0f7c' | Mn |
|
TIBETAN VOWEL SIGN O |
U+0F56 | '\u0f56' | Lo |
|
TIBETAN LETTER BA |
U+0F0B | '\u0f0b' | Po |
|
TIBETAN MARK INTERSYLLABIC TSHEG |
U+0F50 | '\u0f50' | Lo |
|
TIBETAN LETTER THA |
U+0F44 | '\u0f44' | Lo |
|
TIBETAN LETTER NGA |
U+0F0C | '\u0f0c' | Po |
|
TIBETAN MARK DELIMITER TSHEG BSTAR |
U+0F0D | '\u0f0d' | Po |
|
TIBETAN MARK SHAD |
Total codepoints: 40
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xbd\xa1\xe0\xbd\xbc\xe0\xbd\x84\xe0\xbd\xa6\xe0\xbc\x8b\xe0\xbd\x81\xe0\xbe\xb1\xe0\xbd\x96\xe0\xbc\x8b\xe0\xbd\x82\xe0\xbd\xa6\xe0\xbd\xa3\xe0\xbc\x8b\xe0\xbd\x96\xe0\xbd\xa6\xe0\xbe\x92\xe0\xbe\xb2\xe0\xbd\x82\xe0\xbd\xa6\xe0\xbc\x8b\xe0\xbd\xa0\xe0\xbd\x82\xe0\xbe\xb2\xe0\xbd\xbc\xe0\xbc\x8b\xe0\xbd\x96\xe0\xbc\x8b\xe0\xbd\x98\xe0\xbd\xb2\xe0\xbd\xa0\xe0\xbd\xb2\xe0\xbc\x8b\xe0\xbd\x90\xe0\xbd\xbc\xe0\xbd\x96\xe0\xbc\x8b\xe0\xbd\x90\xe0\xbd\x84\xe0\xbc\x8c\xe0\xbc\x8d|\\n1234567890123456789012345678901|\\n" ཡོངས་ཁྱབ་གསལ་བསྒྲགས་འགྲོ་བ་མིའི་ཐོབ་ཐང༌།| 1234567890123456789012345678901|
- python wcwidth.wcswidth() measures width 31, while zoc measures width 40.
Sequence of language Malayalam from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0D2E | '\u0d2e' | Lo |
|
MALAYALAM LETTER MA |
U+0D28 | '\u0d28' | Lo |
|
MALAYALAM LETTER NA |
U+0D41 | '\u0d41' | Mn |
|
MALAYALAM VOWEL SIGN U |
U+0D37 | '\u0d37' | Lo |
|
MALAYALAM LETTER SSA |
U+0D4D | '\u0d4d' | Mn |
|
MALAYALAM SIGN VIRAMA |
U+0D2F | '\u0d2f' | Lo |
|
MALAYALAM LETTER YA |
U+0D3E | '\u0d3e' | Mc |
|
MALAYALAM VOWEL SIGN AA |
U+0D35 | '\u0d35' | Lo |
|
MALAYALAM LETTER VA |
U+0D15 | '\u0d15' | Lo |
|
MALAYALAM LETTER KA |
U+0D3E | '\u0d3e' | Mc |
|
MALAYALAM VOWEL SIGN AA |
U+0D36 | '\u0d36' | Lo |
|
MALAYALAM LETTER SHA |
U+0D19 | '\u0d19' | Lo |
|
MALAYALAM LETTER NGA |
U+0D4D | '\u0d4d' | Mn |
|
MALAYALAM SIGN VIRAMA |
U+0D19 | '\u0d19' | Lo |
|
MALAYALAM LETTER NGA |
U+0D33 | '\u0d33' | Lo |
|
MALAYALAM LETTER LLA |
U+0D46 | '\u0d46' | Mc |
|
MALAYALAM VOWEL SIGN E |
U+0D15 | '\u0d15' | Lo |
|
MALAYALAM LETTER KA |
U+0D4D | '\u0d4d' | Mn |
|
MALAYALAM SIGN VIRAMA |
U+0D15 | '\u0d15' | Lo |
|
MALAYALAM LETTER KA |
U+0D41 | '\u0d41' | Mn |
|
MALAYALAM VOWEL SIGN U |
U+0D31 | '\u0d31' | Lo |
|
MALAYALAM LETTER RRA |
U+0D3F | '\u0d3f' | Mc |
|
MALAYALAM VOWEL SIGN I |
U+0D15 | '\u0d15' | Lo |
|
MALAYALAM LETTER KA |
U+0D4D | '\u0d4d' | Mn |
|
MALAYALAM SIGN VIRAMA |
U+0D15 | '\u0d15' | Lo |
|
MALAYALAM LETTER KA |
U+0D41 | '\u0d41' | Mn |
|
MALAYALAM VOWEL SIGN U |
U+0D28 | '\u0d28' | Lo |
|
MALAYALAM LETTER NA |
U+0D4D | '\u0d4d' | Mn |
|
MALAYALAM SIGN VIRAMA |
U+0D28 | '\u0d28' | Lo |
|
MALAYALAM LETTER NA |
Total codepoints: 29
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb4\xae\xe0\xb4\xa8\xe0\xb5\x81\xe0\xb4\xb7\xe0\xb5\x8d\xe0\xb4\xaf\xe0\xb4\xbe\xe0\xb4\xb5\xe0\xb4\x95\xe0\xb4\xbe\xe0\xb4\xb6\xe0\xb4\x99\xe0\xb5\x8d\xe0\xb4\x99\xe0\xb4\xb3\xe0\xb5\x86\xe0\xb4\x95\xe0\xb5\x8d\xe0\xb4\x95\xe0\xb5\x81\xe0\xb4\xb1\xe0\xb4\xbf\xe0\xb4\x95\xe0\xb5\x8d\xe0\xb4\x95\xe0\xb5\x81\xe0\xb4\xa8\xe0\xb5\x8d\xe0\xb4\xa8|\\n12345678901234567|\\n" മനുഷ്യാവകാശങ്ങളെക്കുറിക്കുന്ന| 12345678901234567|
- python wcwidth.wcswidth() measures width 17, while zoc measures width 29.
Sequence of language Tamang, Eastern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+094D | '\u094d' | Mn |
|
DEVANAGARI SIGN VIRAMA |
U+0939 | '\u0939' | Lo |
|
DEVANAGARI LETTER HA |
U+0940 | '\u0940' | Mc |
|
DEVANAGARI VOWEL SIGN II |
U+0938 | '\u0938' | Lo |
|
DEVANAGARI LETTER SA |
U+0947 | '\u0947' | Mn |
|
DEVANAGARI VOWEL SIGN E |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa5\x8d\xe0\xa4\xb9\xe0\xa5\x80\xe0\xa4\xb8\xe0\xa5\x87|\\n123|\\n" म्हीसे| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 6.
Sequence of language Kannada from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0CAE | '\u0cae' | Lo |
|
KANNADA LETTER MA |
U+0CBE | '\u0cbe' | Mc |
|
KANNADA VOWEL SIGN AA |
U+0CA8 | '\u0ca8' | Lo |
|
KANNADA LETTER NA |
U+0CB5 | '\u0cb5' | Lo |
|
KANNADA LETTER VA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb2\xae\xe0\xb2\xbe\xe0\xb2\xa8\xe0\xb2\xb5|\\n123|\\n" ಮಾನವ| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Khün from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+1A20 | '\u1a20' | Lo |
|
TAI THAM LETTER HIGH KA |
U+1A32 | '\u1a32' | Lo |
|
TAI THAM LETTER HIGH TA |
U+1A65 | '\u1a65' | Mn |
|
TAI THAM VOWEL SIGN I |
U+1A20 | '\u1a20' | Lo |
|
TAI THAM LETTER HIGH KA |
U+1A63 | '\u1a63' | Mc |
|
TAI THAM VOWEL SIGN AA |
U+1A45 | '\u1a45' | Lo |
|
TAI THAM LETTER WA |
U+1A64 | '\u1a64' | Mc |
|
TAI THAM VOWEL SIGN TALL AA |
U+1A75 | '\u1a75' | Mn |
|
TAI THAM SIGN TONE-1 |
U+1A2F | '\u1a2f' | Lo |
|
TAI THAM LETTER DA |
U+1A60 | '\u1a60' | Mn |
|
TAI THAM SIGN SAKOT |
U+1A45 | '\u1a45' | Lo |
|
TAI THAM LETTER WA |
U+1A60 | '\u1a60' | Mn |
|
TAI THAM SIGN SAKOT |
U+1A3F | '\u1a3f' | Lo |
|
TAI THAM LETTER LOW YA |
U+1A62 | '\u1a62' | Mn |
|
TAI THAM VOWEL SIGN MAI SAT |
U+1A3E | '\u1a3e' | Lo |
|
TAI THAM LETTER MA |
U+1A36 | '\u1a36' | Lo |
|
TAI THAM LETTER NA |
U+1A69 | '\u1a69' | Mn |
|
TAI THAM VOWEL SIGN U |
U+1A54 | '\u1a54' | Lo |
|
TAI THAM LETTER GREAT SA |
U+1A29 | '\u1a29' | Lo |
|
TAI THAM LETTER LOW CA |
U+1A63 | '\u1a63' | Mc |
|
TAI THAM VOWEL SIGN AA |
U+1A60 | '\u1a60' | Mn |
|
TAI THAM SIGN SAKOT |
U+1A32 | '\u1a32' | Lo |
|
TAI THAM LETTER HIGH TA |
Total codepoints: 22
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\xa8\xa0\xe1\xa8\xb2\xe1\xa9\xa5\xe1\xa8\xa0\xe1\xa9\xa3\xe1\xa9\x85\xe1\xa9\xa4\xe1\xa9\xb5\xe1\xa8\xaf\xe1\xa9\xa0\xe1\xa9\x85\xe1\xa9\xa0\xe1\xa8\xbf\xe1\xa9\xa2\xe1\xa8\xbe\xe1\xa8\xb6\xe1\xa9\xa9\xe1\xa9\x94\xe1\xa8\xa9\xe1\xa9\xa3\xe1\xa9\xa0\xe1\xa8\xb2|\\n123456789012|\\n" ᨠᨲᩥᨠᩣᩅᩤ᩵ᨯ᩠ᩅ᩠ᨿᩢᨾᨶᩩᩔᨩᩣ᩠ᨲ| 123456789012|
- python wcwidth.wcswidth() measures width 12, while zoc measures width 22.
Sequence of language Khmer, Central from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+17C1 | '\u17c1' | Mc |
|
KHMER VOWEL SIGN E |
U+1785 | '\u1785' | Lo |
|
KHMER LETTER CA |
U+1780 | '\u1780' | Lo |
|
KHMER LETTER KA |
U+17D2 | '\u17d2' | Mn |
|
KHMER SIGN COENG |
U+178A | '\u178a' | Lo |
|
KHMER LETTER DA |
U+17B8 | '\u17b8' | Mn |
|
KHMER VOWEL SIGN II |
U+1794 | '\u1794' | Lo |
|
KHMER LETTER BA |
U+17D2 | '\u17d2' | Mn |
|
KHMER SIGN COENG |
U+179A | '\u179a' | Lo |
|
KHMER LETTER RO |
U+1780 | '\u1780' | Lo |
|
KHMER LETTER KA |
U+17B6 | '\u17b6' | Mc |
|
KHMER VOWEL SIGN AA |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+1787 | '\u1787' | Lo |
|
KHMER LETTER CO |
U+17B6 | '\u17b6' | Mc |
|
KHMER VOWEL SIGN AA |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+1780 | '\u1780' | Lo |
|
KHMER LETTER KA |
U+179B | '\u179b' | Lo |
|
KHMER LETTER LO |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+17D2 | '\u17d2' | Mn |
|
KHMER SIGN COENG |
U+178A | '\u178a' | Lo |
|
KHMER LETTER DA |
U+17B8 | '\u17b8' | Mn |
|
KHMER VOWEL SIGN II |
U+1796 | '\u1796' | Lo |
|
KHMER LETTER PO |
U+17B8 | '\u17b8' | Mn |
|
KHMER VOWEL SIGN II |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+17B7 | '\u17b7' | Mn |
|
KHMER VOWEL SIGN I |
U+1791 | '\u1791' | Lo |
|
KHMER LETTER TO |
U+17D2 | '\u17d2' | Mn |
|
KHMER SIGN COENG |
U+1792 | '\u1792' | Lo |
|
KHMER LETTER THO |
U+17B7 | '\u17b7' | Mn |
|
KHMER VOWEL SIGN I |
U+1798 | '\u1798' | Lo |
|
KHMER LETTER MO |
U+1793 | '\u1793' | Lo |
|
KHMER LETTER NO |
U+17BB | '\u17bb' | Mn |
|
KHMER VOWEL SIGN U |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
U+17D2 | '\u17d2' | Mn |
|
KHMER SIGN COENG |
U+179F | '\u179f' | Lo |
|
KHMER LETTER SA |
Total codepoints: 36
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x9e\x9f\xe1\x9f\x81\xe1\x9e\x85\xe1\x9e\x80\xe1\x9f\x92\xe1\x9e\x8a\xe1\x9e\xb8\xe1\x9e\x94\xe1\x9f\x92\xe1\x9e\x9a\xe1\x9e\x80\xe1\x9e\xb6\xe1\x9e\x9f\xe1\x9e\x87\xe1\x9e\xb6\xe1\x9e\x9f\xe1\x9e\x80\xe1\x9e\x9b\xe1\x9e\x9f\xe1\x9f\x92\xe1\x9e\x8a\xe1\x9e\xb8\xe1\x9e\x96\xe1\x9e\xb8\xe1\x9e\x9f\xe1\x9e\xb7\xe1\x9e\x91\xe1\x9f\x92\xe1\x9e\x92\xe1\x9e\xb7\xe1\x9e\x98\xe1\x9e\x93\xe1\x9e\xbb\xe1\x9e\x9f\xe1\x9f\x92\xe1\x9e\x9f|\\n1234567890123456789012|\\n" សេចក្ដីប្រកាសជាសកលស្ដីពីសិទ្ធិមនុស្ស| 1234567890123456789012|
- python wcwidth.wcswidth() measures width 22, while zoc measures width 36.
Sequence of language Bengali from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+09AE | '\u09ae' | Lo |
|
BENGALI LETTER MA |
U+09BE | '\u09be' | Mc |
|
BENGALI VOWEL SIGN AA |
U+09A8 | '\u09a8' | Lo |
|
BENGALI LETTER NA |
U+09AC | '\u09ac' | Lo |
|
BENGALI LETTER BA |
U+09BE | '\u09be' | Mc |
|
BENGALI VOWEL SIGN AA |
U+09A7 | '\u09a7' | Lo |
|
BENGALI LETTER DHA |
U+09BF | '\u09bf' | Mc |
|
BENGALI VOWEL SIGN I |
U+0995 | '\u0995' | Lo |
|
BENGALI LETTER KA |
U+09BE | '\u09be' | Mc |
|
BENGALI VOWEL SIGN AA |
U+09B0 | '\u09b0' | Lo |
|
BENGALI LETTER RA |
U+09C7 | '\u09c7' | Mc |
|
BENGALI VOWEL SIGN E |
U+09B0 | '\u09b0' | Lo |
|
BENGALI LETTER RA |
Total codepoints: 12
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa6\xae\xe0\xa6\xbe\xe0\xa6\xa8\xe0\xa6\xac\xe0\xa6\xbe\xe0\xa6\xa7\xe0\xa6\xbf\xe0\xa6\x95\xe0\xa6\xbe\xe0\xa6\xb0\xe0\xa7\x87\xe0\xa6\xb0|\\n1234567|\\n" মানবাধিকারের| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 12.
Sequence of language Chakma from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0001111F | '\U0001111f' | Lo |
|
CHAKMA LETTER MAA |
U+0001111A | '\U0001111a' | Lo |
|
CHAKMA LETTER NAA |
U+0001112C | '\U0001112c' | Mc |
|
CHAKMA VOWEL SIGN E |
U+0001112D | '\U0001112d' | Mn |
|
CHAKMA VOWEL SIGN AI |
U+00011103 | '\U00011103' | Lo |
|
CHAKMA LETTER AA |
U+00011107 | '\U00011107' | Lo |
|
CHAKMA LETTER KAA |
U+00011134 | '\U00011134' | Mn |
|
CHAKMA MAAYYAA |
U+00011107 | '\U00011107' | Lo |
|
CHAKMA LETTER KAA |
U+00011125 | '\U00011125' | Lo |
|
CHAKMA LETTER SAA |
U+00011127 | '\U00011127' | Mn |
|
CHAKMA VOWEL SIGN A |
U+00011101 | '\U00011101' | Mn |
|
CHAKMA SIGN ANUSVARA |
U+00011122 | '\U00011122' | Lo |
|
CHAKMA LETTER RAA |
U+00011134 | '\U00011134' | Mn |
|
CHAKMA MAAYYAA |
Total codepoints: 13
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x91\x84\x9f\xf0\x91\x84\x9a\xf0\x91\x84\xac\xf0\x91\x84\xad\xf0\x91\x84\x83\xf0\x91\x84\x87\xf0\x91\x84\xb4\xf0\x91\x84\x87\xf0\x91\x84\xa5\xf0\x91\x84\xa7\xf0\x91\x84\x81\xf0\x91\x84\xa2\xf0\x91\x84\xb4|\\n1234567|\\n" 𑄟𑄚𑄬𑄭𑄃𑄇𑄴𑄇𑄥𑄧𑄁𑄢𑄴| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 13.
Sequence of language Telugu from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0C2E | '\u0c2e' | Lo |
|
TELUGU LETTER MA |
U+0C3E | '\u0c3e' | Mn |
|
TELUGU VOWEL SIGN AA |
U+0C28 | '\u0c28' | Lo |
|
TELUGU LETTER NA |
U+0C35 | '\u0c35' | Lo |
|
TELUGU LETTER VA |
U+0C38 | '\u0c38' | Lo |
|
TELUGU LETTER SA |
U+0C4D | '\u0c4d' | Mn |
|
TELUGU SIGN VIRAMA |
U+0C35 | '\u0c35' | Lo |
|
TELUGU LETTER VA |
U+0C24 | '\u0c24' | Lo |
|
TELUGU LETTER TA |
U+0C4D | '\u0c4d' | Mn |
|
TELUGU SIGN VIRAMA |
U+0C35 | '\u0c35' | Lo |
|
TELUGU LETTER VA |
U+0C2E | '\u0c2e' | Lo |
|
TELUGU LETTER MA |
U+0C41 | '\u0c41' | Mc |
|
TELUGU VOWEL SIGN U |
U+0C32 | '\u0c32' | Lo |
|
TELUGU LETTER LA |
Total codepoints: 13
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb0\xae\xe0\xb0\xbe\xe0\xb0\xa8\xe0\xb0\xb5\xe0\xb0\xb8\xe0\xb1\x8d\xe0\xb0\xb5\xe0\xb0\xa4\xe0\xb1\x8d\xe0\xb0\xb5\xe0\xb0\xae\xe0\xb1\x81\xe0\xb0\xb2|\\n123456789|\\n" మానవస్వత్వముల| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 13.
Sequence of language Nepali from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5|\\n123|\\n" मानव| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Sanskrit from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0927 | '\u0927' | Lo |
|
DEVANAGARI LETTER DHA |
U+093F | '\u093f' | Mc |
|
DEVANAGARI VOWEL SIGN I |
U+0915 | '\u0915' | Lo |
|
DEVANAGARI LETTER KA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0930 | '\u0930' | Lo |
|
DEVANAGARI LETTER RA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0923 | '\u0923' | Lo |
|
DEVANAGARI LETTER NNA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0902 | '\u0902' | Mn |
|
DEVANAGARI SIGN ANUSVARA |
Total codepoints: 14
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5\xe0\xa4\xbe\xe0\xa4\xa7\xe0\xa4\xbf\xe0\xa4\x95\xe0\xa4\xbe\xe0\xa4\xb0\xe0\xa4\xbe\xe0\xa4\xa3\xe0\xa4\xbe\xe0\xa4\x82|\\n1234567|\\n" मानवाधिकाराणां| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 14.
Sequence of language Sanskrit (Grantha) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0001132E | '\U0001132e' | Lo |
|
GRANTHA LETTER MA |
U+0001133E | '\U0001133e' | Mc |
|
GRANTHA VOWEL SIGN AA |
U+00011328 | '\U00011328' | Lo |
|
GRANTHA LETTER NA |
U+00011335 | '\U00011335' | Lo |
|
GRANTHA LETTER VA |
U+0001133E | '\U0001133e' | Mc |
|
GRANTHA VOWEL SIGN AA |
U+00011327 | '\U00011327' | Lo |
|
GRANTHA LETTER DHA |
U+0001133F | '\U0001133f' | Mc |
|
GRANTHA VOWEL SIGN I |
U+00011315 | '\U00011315' | Lo |
|
GRANTHA LETTER KA |
U+0001133E | '\U0001133e' | Mc |
|
GRANTHA VOWEL SIGN AA |
U+00011330 | '\U00011330' | Lo |
|
GRANTHA LETTER RA |
U+0001133E | '\U0001133e' | Mc |
|
GRANTHA VOWEL SIGN AA |
U+00011323 | '\U00011323' | Lo |
|
GRANTHA LETTER NNA |
U+0001133E | '\U0001133e' | Mc |
|
GRANTHA VOWEL SIGN AA |
U+00011302 | '\U00011302' | Mc |
|
GRANTHA SIGN ANUSVARA |
Total codepoints: 14
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x91\x8c\xae\xf0\x91\x8c\xbe\xf0\x91\x8c\xa8\xf0\x91\x8c\xb5\xf0\x91\x8c\xbe\xf0\x91\x8c\xa7\xf0\x91\x8c\xbf\xf0\x91\x8c\x95\xf0\x91\x8c\xbe\xf0\x91\x8c\xb0\xf0\x91\x8c\xbe\xf0\x91\x8c\xa3\xf0\x91\x8c\xbe\xf0\x91\x8c\x82|\\n1234567|\\n" 𑌮𑌾𑌨𑌵𑌾𑌧𑌿𑌕𑌾𑌰𑌾𑌣𑌾𑌂| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 14.
Sequence of language Marathi from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
U+0940 | '\u0940' | Mc |
|
DEVANAGARI VOWEL SIGN II |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5\xe0\xa5\x80|\\n123|\\n" मानवी| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 5.
Sequence of language Hindi from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5|\\n123|\\n" मानव| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Sinhala from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0DB8 | '\u0db8' | Lo |
|
SINHALA LETTER MAYANNA |
U+0DCF | '\u0dcf' | Mc |
|
SINHALA VOWEL SIGN AELA-PILLA |
U+0DB1 | '\u0db1' | Lo |
|
SINHALA LETTER DANTAJA NAYANNA |
U+0DC0 | '\u0dc0' | Lo |
|
SINHALA LETTER VAYANNA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb6\xb8\xe0\xb7\x8f\xe0\xb6\xb1\xe0\xb7\x80|\\n123|\\n" මානව| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Panjabi, Eastern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0A2E | '\u0a2e' | Lo |
|
GURMUKHI LETTER MA |
U+0A28 | '\u0a28' | Lo |
|
GURMUKHI LETTER NA |
U+0A41 | '\u0a41' | Mn |
|
GURMUKHI VOWEL SIGN U |
U+0A71 | '\u0a71' | Mn |
|
GURMUKHI ADDAK |
U+0A16 | '\u0a16' | Lo |
|
GURMUKHI LETTER KHA |
U+0A40 | '\u0a40' | Mc |
|
GURMUKHI VOWEL SIGN II |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa8\xae\xe0\xa8\xa8\xe0\xa9\x81\xe0\xa9\xb1\xe0\xa8\x96\xe0\xa9\x80|\\n123|\\n" ਮਨੁੱਖੀ| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 6.
Sequence of language Bhojpuri from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0927 | '\u0927' | Lo |
|
DEVANAGARI LETTER DHA |
U+093F | '\u093f' | Mc |
|
DEVANAGARI VOWEL SIGN I |
U+0915 | '\u0915' | Lo |
|
DEVANAGARI LETTER KA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0930 | '\u0930' | Lo |
|
DEVANAGARI LETTER RA |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5\xe0\xa4\xbe\xe0\xa4\xa7\xe0\xa4\xbf\xe0\xa4\x95\xe0\xa4\xbe\xe0\xa4\xb0|\\n123456|\\n" मानवाधिकार| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 10.
Sequence of language Thai (2) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0E1B | '\u0e1b' | Lo |
|
THAI CHARACTER PO PLA |
U+0E0F | '\u0e0f' | Lo |
|
THAI CHARACTER TO PATAK |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E0D | '\u0e0d' | Lo |
|
THAI CHARACTER YO YING |
U+0E0D | '\u0e0d' | Lo |
|
THAI CHARACTER YO YING |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E2A | '\u0e2a' | Lo |
|
THAI CHARACTER SO SUA |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E01 | '\u0e01' | Lo |
|
THAI CHARACTER KO KAI |
U+0E25 | '\u0e25' | Lo |
|
THAI CHARACTER LO LING |
U+0E27 | '\u0e27' | Lo |
|
THAI CHARACTER WO WAEN |
U+0E48 | '\u0e48' | Mn |
|
THAI CHARACTER MAI EK |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E14 | '\u0e14' | Lo |
|
THAI CHARACTER DO DEK |
U+0E49 | '\u0e49' | Mn |
|
THAI CHARACTER MAI THO |
U+0E27 | '\u0e27' | Lo |
|
THAI CHARACTER WO WAEN |
U+0E22 | '\u0e22' | Lo |
|
THAI CHARACTER YO YAK |
U+0E2A | '\u0e2a' | Lo |
|
THAI CHARACTER SO SUA |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E17 | '\u0e17' | Lo |
|
THAI CHARACTER THO THAHAN |
U+0E18 | '\u0e18' | Lo |
|
THAI CHARACTER THO THONG |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E21 | '\u0e21' | Lo |
|
THAI CHARACTER MO MA |
U+0E19 | '\u0e19' | Lo |
|
THAI CHARACTER NO NU |
U+0E38 | '\u0e38' | Mn |
|
THAI CHARACTER SARA U |
U+0E29 | '\u0e29' | Lo |
|
THAI CHARACTER SO RUSI |
U+0E22 | '\u0e22' | Lo |
|
THAI CHARACTER YO YAK |
U+0E0A | '\u0e0a' | Lo |
|
THAI CHARACTER CHO CHANG |
U+0E19 | '\u0e19' | Lo |
|
THAI CHARACTER NO NU |
Total codepoints: 29
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb8\x9b\xe0\xb8\x8f\xe0\xb8\xb4\xe0\xb8\x8d\xe0\xb8\x8d\xe0\xb8\xb2\xe0\xb8\xaa\xe0\xb8\xb2\xe0\xb8\x81\xe0\xb8\xa5\xe0\xb8\xa7\xe0\xb9\x88\xe0\xb8\xb2\xe0\xb8\x94\xe0\xb9\x89\xe0\xb8\xa7\xe0\xb8\xa2\xe0\xb8\xaa\xe0\xb8\xb4\xe0\xb8\x97\xe0\xb8\x98\xe0\xb8\xb4\xe0\xb8\xa1\xe0\xb8\x99\xe0\xb8\xb8\xe0\xb8\xa9\xe0\xb8\xa2\xe0\xb8\x8a\xe0\xb8\x99|\\n12345678901234567890123|\\n" ปฏิญญาสากลว่าด้วยสิทธิมนุษยชน| 12345678901234567890123|
- python wcwidth.wcswidth() measures width 23, while zoc measures width 29.
Sequence of language Maithili from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0938 | '\u0938' | Lo |
|
DEVANAGARI LETTER SA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0930 | '\u0930' | Lo |
|
DEVANAGARI LETTER RA |
U+094D | '\u094d' | Mn |
|
DEVANAGARI SIGN VIRAMA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
U+092D | '\u092d' | Lo |
|
DEVANAGARI LETTER BHA |
U+094C | '\u094c' | Mc |
|
DEVANAGARI VOWEL SIGN AU |
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
Total codepoints: 8
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xb8\xe0\xa4\xbe\xe0\xa4\xb0\xe0\xa5\x8d\xe0\xa4\xb5\xe0\xa4\xad\xe0\xa5\x8c\xe0\xa4\xae|\\n12345|\\n" सार्वभौम| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 8.
Sequence of language Thai from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0E1B | '\u0e1b' | Lo |
|
THAI CHARACTER PO PLA |
U+0E0F | '\u0e0f' | Lo |
|
THAI CHARACTER TO PATAK |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E0D | '\u0e0d' | Lo |
|
THAI CHARACTER YO YING |
U+0E0D | '\u0e0d' | Lo |
|
THAI CHARACTER YO YING |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E2A | '\u0e2a' | Lo |
|
THAI CHARACTER SO SUA |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E01 | '\u0e01' | Lo |
|
THAI CHARACTER KO KAI |
U+0E25 | '\u0e25' | Lo |
|
THAI CHARACTER LO LING |
U+0E27 | '\u0e27' | Lo |
|
THAI CHARACTER WO WAEN |
U+0E48 | '\u0e48' | Mn |
|
THAI CHARACTER MAI EK |
U+0E32 | '\u0e32' | Lo |
|
THAI CHARACTER SARA AA |
U+0E14 | '\u0e14' | Lo |
|
THAI CHARACTER DO DEK |
U+0E49 | '\u0e49' | Mn |
|
THAI CHARACTER MAI THO |
U+0E27 | '\u0e27' | Lo |
|
THAI CHARACTER WO WAEN |
U+0E22 | '\u0e22' | Lo |
|
THAI CHARACTER YO YAK |
U+0E2A | '\u0e2a' | Lo |
|
THAI CHARACTER SO SUA |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E17 | '\u0e17' | Lo |
|
THAI CHARACTER THO THAHAN |
U+0E18 | '\u0e18' | Lo |
|
THAI CHARACTER THO THONG |
U+0E34 | '\u0e34' | Mn |
|
THAI CHARACTER SARA I |
U+0E21 | '\u0e21' | Lo |
|
THAI CHARACTER MO MA |
U+0E19 | '\u0e19' | Lo |
|
THAI CHARACTER NO NU |
U+0E38 | '\u0e38' | Mn |
|
THAI CHARACTER SARA U |
U+0E29 | '\u0e29' | Lo |
|
THAI CHARACTER SO RUSI |
U+0E22 | '\u0e22' | Lo |
|
THAI CHARACTER YO YAK |
U+0E0A | '\u0e0a' | Lo |
|
THAI CHARACTER CHO CHANG |
U+0E19 | '\u0e19' | Lo |
|
THAI CHARACTER NO NU |
Total codepoints: 29
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xb8\x9b\xe0\xb8\x8f\xe0\xb8\xb4\xe0\xb8\x8d\xe0\xb8\x8d\xe0\xb8\xb2\xe0\xb8\xaa\xe0\xb8\xb2\xe0\xb8\x81\xe0\xb8\xa5\xe0\xb8\xa7\xe0\xb9\x88\xe0\xb8\xb2\xe0\xb8\x94\xe0\xb9\x89\xe0\xb8\xa7\xe0\xb8\xa2\xe0\xb8\xaa\xe0\xb8\xb4\xe0\xb8\x97\xe0\xb8\x98\xe0\xb8\xb4\xe0\xb8\xa1\xe0\xb8\x99\xe0\xb8\xb8\xe0\xb8\xa9\xe0\xb8\xa2\xe0\xb8\x8a\xe0\xb8\x99|\\n12345678901234567890123|\\n" ปฏิญญาสากลว่าด้วยสิทธิมนุษยชน| 12345678901234567890123|
- python wcwidth.wcswidth() measures width 23, while zoc measures width 29.
Sequence of language Magahi from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+092E | '\u092e' | Lo |
|
DEVANAGARI LETTER MA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0928 | '\u0928' | Lo |
|
DEVANAGARI LETTER NA |
U+0935 | '\u0935' | Lo |
|
DEVANAGARI LETTER VA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0927 | '\u0927' | Lo |
|
DEVANAGARI LETTER DHA |
U+093F | '\u093f' | Mc |
|
DEVANAGARI VOWEL SIGN I |
U+0915 | '\u0915' | Lo |
|
DEVANAGARI LETTER KA |
U+093E | '\u093e' | Mc |
|
DEVANAGARI VOWEL SIGN AA |
U+0930 | '\u0930' | Lo |
|
DEVANAGARI LETTER RA |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xa4\xae\xe0\xa4\xbe\xe0\xa4\xa8\xe0\xa4\xb5\xe0\xa4\xbe\xe0\xa4\xa7\xe0\xa4\xbf\xe0\xa4\x95\xe0\xa4\xbe\xe0\xa4\xb0|\\n123456|\\n" मानवाधिकार| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 10.
Sequence of language Vietnamese from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0074 | 't' | Ll |
|
LATIN SMALL LETTER T |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "toa\xcc\x80n|\\n1234|\\n" toàn| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Tagalog (Tagalog) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+170E | '\u170e' | Lo |
|
TAGALOG LETTER LA |
U+1711 | '\u1711' | Lo |
|
TAGALOG LETTER HA |
U+1706 | '\u1706' | Lo |
|
TAGALOG LETTER TA |
U+1714 | '\u1714' | Mn |
|
TAGALOG SIGN VIRAMA |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\x9c\x8e\xe1\x9c\x91\xe1\x9c\x86\xe1\x9c\x94|\\n123|\\n" ᜎᜑᜆ᜔| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Lao from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0E9B | '\u0e9b' | Lo |
|
LAO LETTER PO |
U+0EB0 | '\u0eb0' | Lo |
|
LAO VOWEL SIGN A |
U+0E81 | '\u0e81' | Lo |
|
LAO LETTER KO |
U+0EB2 | '\u0eb2' | Lo |
|
LAO VOWEL SIGN AA |
U+0E94 | '\u0e94' | Lo |
|
LAO LETTER DO |
U+0EAA | '\u0eaa' | Lo |
|
LAO LETTER SO SUNG |
U+0EB2 | '\u0eb2' | Lo |
|
LAO VOWEL SIGN AA |
U+0E81 | '\u0e81' | Lo |
|
LAO LETTER KO |
U+0EBB | '\u0ebb' | Mn |
|
LAO VOWEL SIGN MAI KON |
U+0E99 | '\u0e99' | Lo |
|
LAO LETTER NO |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "\xe0\xba\x9b\xe0\xba\xb0\xe0\xba\x81\xe0\xba\xb2\xe0\xba\x94\xe0\xba\xaa\xe0\xba\xb2\xe0\xba\x81\xe0\xba\xbb\xe0\xba\x99|\\n123456789|\\n" ປະກາດສາກົນ| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 10.
Sequence of language Lingala (tones) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+004D | 'M' | Lu |
|
LATIN CAPITAL LETTER M |
U+004F | 'O' | Lu |
|
LATIN CAPITAL LETTER O |
U+004C | 'L' | Lu |
|
LATIN CAPITAL LETTER L |
U+0186 | '\u0186' | Lu |
|
LATIN CAPITAL LETTER OPEN O |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+004E | 'N' | Lu |
|
LATIN CAPITAL LETTER N |
U+0047 | 'G' | Lu |
|
LATIN CAPITAL LETTER G |
U+0186 | '\u0186' | Lu |
|
LATIN CAPITAL LETTER OPEN O |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "MOL\xc6\x86\xcc\x81NG\xc6\x86\xcc\x81|\\n1234567|\\n" MOLƆ́NGƆ́| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 9.
Sequence of language Vietnamese (Han nom) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0002321C | '\U0002321c' | Lo |
|
CJK UNIFIED IDEOGRAPH-2321C |
U+0031 | '1' | Nd |
|
DIGIT ONE |
U+0030 | '0' | Nd |
|
DIGIT ZERO |
U+00023383 | '\U00023383' | Lo |
|
CJK UNIFIED IDEOGRAPH-23383 |
U+0031 | '1' | Nd |
|
DIGIT ONE |
U+0032 | '2' | Nd |
|
DIGIT TWO |
U+000221A5 | '\U000221a5' | Lo |
|
CJK UNIFIED IDEOGRAPH-221A5 |
U+0031 | '1' | Nd |
|
DIGIT ONE |
U+0039 | '9' | Nd |
|
DIGIT NINE |
U+0034 | '4' | Nd |
|
DIGIT FOUR |
U+0038 | '8' | Nd |
|
DIGIT EIGHT |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\xa3\x88\x9c10\xf0\xa3\x8e\x8312\xf0\xa2\x86\xa51948|\\n12345678901234|\\n" 𣈜10𣎃12𢆥1948| 12345678901234|
- python wcwidth.wcswidth() measures width 14, while zoc measures width 13.
Sequence of language Pular (Adlam) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0001E916 | '\U0001e916' | Lu |
|
ADLAM CAPITAL LETTER HA |
U+0001E90B | '\U0001e90b' | Lu |
|
ADLAM CAPITAL LETTER I |
U+0001E902 | '\U0001e902' | Lu |
|
ADLAM CAPITAL LETTER LAAM |
U+0001E946 | '\U0001e946' | Mn |
|
ADLAM GEMINATION MARK |
U+0001E900 | '\U0001e900' | Lu |
|
ADLAM CAPITAL LETTER ALIF |
U+0001E912 | '\U0001e912' | Lu |
|
ADLAM CAPITAL LETTER YA |
U+0001E900 | '\U0001e900' | Lu |
|
ADLAM CAPITAL LETTER ALIF |
U+0001E910 | '\U0001e910' | Lu |
|
ADLAM CAPITAL LETTER NUN |
U+0001E911 | '\U0001e911' | Lu |
|
ADLAM CAPITAL LETTER KAF |
U+0001E90C | '\U0001e90c' | Lu |
|
ADLAM CAPITAL LETTER O |
U+0001E945 | '\U0001e945' | Mn |
|
ADLAM VOWEL LENGTHENER |
U+0001E908 | '\U0001e908' | Lu |
|
ADLAM CAPITAL LETTER RA |
U+0001E909 | '\U0001e909' | Lu |
|
ADLAM CAPITAL LETTER E |
Total codepoints: 13
Shell test using printf(1),
'|'
should align in output:$ printf "\xf0\x9e\xa4\x96\xf0\x9e\xa4\x8b\xf0\x9e\xa4\x82\xf0\x9e\xa5\x86\xf0\x9e\xa4\x80\xf0\x9e\xa4\x92\xf0\x9e\xa4\x80\xf0\x9e\xa4\x90\xf0\x9e\xa4\x91\xf0\x9e\xa4\x8c\xf0\x9e\xa5\x85\xf0\x9e\xa4\x88\xf0\x9e\xa4\x89|\\n12345678901|\\n" 𞤖𞤋𞤂𞥆𞤀𞤒𞤀𞤐𞤑𞤌𞥅𞤈𞤉| 12345678901|
- python wcwidth.wcswidth() measures width 11, while zoc measures width 13.
Sequence of language Yiddish, Eastern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+05D0 | '\u05d0' | Lo |
|
HEBREW LETTER ALEF |
U+05B7 | '\u05b7' | Mn |
|
HEBREW POINT PATAH |
U+05DC | '\u05dc' | Lo |
|
HEBREW LETTER LAMED |
U+05F0 | '\u05f0' | Lo |
|
HEBREW LIGATURE YIDDISH DOUBLE VAV |
U+05E2 | '\u05e2' | Lo |
|
HEBREW LETTER AYIN |
U+05DC | '\u05dc' | Lo |
|
HEBREW LETTER LAMED |
U+05D8 | '\u05d8' | Lo |
|
HEBREW LETTER TET |
U+05DC | '\u05dc' | Lo |
|
HEBREW LETTER LAMED |
U+05E2 | '\u05e2' | Lo |
|
HEBREW LETTER AYIN |
U+05DB | '\u05db' | Lo |
|
HEBREW LETTER KAF |
U+05E2 | '\u05e2' | Lo |
|
HEBREW LETTER AYIN |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "\xd7\x90\xd6\xb7\xd7\x9c\xd7\xb0\xd7\xa2\xd7\x9c\xd7\x98\xd7\x9c\xd7\xa2\xd7\x9b\xd7\xa2|\\n1234567890|\\n" אַלװעלטלעכע| 1234567890|
- python wcwidth.wcswidth() measures width 10, while zoc measures width 11.
Sequence of language Bamun from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+004E | 'N' | Lu |
|
LATIN CAPITAL LETTER N |
U+004A | 'J' | Lu |
|
LATIN CAPITAL LETTER J |
U+0055 | 'U' | Lu |
|
LATIN CAPITAL LETTER U |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "NJU\xcc\x81|\\n123|\\n" NJÚ| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Orok from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0427 | '\u0427' | Lu |
|
CYRILLIC CAPITAL LETTER CHE |
U+0438 | '\u0438' | Ll |
|
CYRILLIC SMALL LETTER I |
U+043F | '\u043f' | Ll |
|
CYRILLIC SMALL LETTER PE |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+0304 | '\u0304' | Mn |
|
COMBINING MACRON |
U+043B | '\u043b' | Ll |
|
CYRILLIC SMALL LETTER EL |
U+0438 | '\u0438' | Ll |
|
CYRILLIC SMALL LETTER I |
U+043D | '\u043d' | Ll |
|
CYRILLIC SMALL LETTER EN |
U+043D | '\u043d' | Ll |
|
CYRILLIC SMALL LETTER EN |
U+0435 | '\u0435' | Ll |
|
CYRILLIC SMALL LETTER IE |
U+0304 | '\u0304' | Mn |
|
COMBINING MACRON |
U+0441 | '\u0441' | Ll |
|
CYRILLIC SMALL LETTER ES |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+043B | '\u043b' | Ll |
|
CYRILLIC SMALL LETTER EL |
Total codepoints: 14
Shell test using printf(1),
'|'
should align in output:$ printf "\xd0\xa7\xd0\xb8\xd0\xbf\xd0\xb0\xcc\x84\xd0\xbb\xd0\xb8\xd0\xbd\xd0\xbd\xd0\xb5\xcc\x84\xd1\x81\xd0\xb0\xd0\xbb|\\n123456789012|\\n" Чипа̄линне̄сал| 123456789012|
- python wcwidth.wcswidth() measures width 12, while zoc measures width 14.
Sequence of language Tem from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0196 | '\u0196' | Lu |
|
LATIN CAPITAL LETTER IOTA |
U+0072 | 'r' | Ll |
|
LATIN SMALL LETTER R |
U+028A | '\u028a' | Ll |
|
LATIN SMALL LETTER UPSILON |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+002D | '-' | Pd |
|
HYPHEN-MINUS |
U+0064 | 'd' | Ll |
|
LATIN SMALL LETTER D |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "\xc6\x96r\xca\x8a\xcc\x81-d\xc9\x9b\xcc\x81\xc9\x9b|\\n1234567|\\n" Ɩrʊ́-dɛ́ɛ| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 9.
Sequence of language Nanai from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+041D | '\u041d' | Lu |
|
CYRILLIC CAPITAL LETTER EN |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+0438 | '\u0438' | Ll |
|
CYRILLIC SMALL LETTER I |
U+0306 | '\u0306' | Mn |
|
COMBINING BREVE |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xd0\x9d\xd0\xb0\xd0\xb8\xcc\x86|\\n123|\\n" Най| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Evenki from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0411 | '\u0411' | Lu |
|
CYRILLIC CAPITAL LETTER BE |
U+0443 | '\u0443' | Ll |
|
CYRILLIC SMALL LETTER U |
U+0433 | '\u0433' | Ll |
|
CYRILLIC SMALL LETTER GHE |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+0304 | '\u0304' | Mn |
|
COMBINING MACRON |
U+0434 | '\u0434' | Ll |
|
CYRILLIC SMALL LETTER DE |
U+0443 | '\u0443' | Ll |
|
CYRILLIC SMALL LETTER U |
Total codepoints: 7
Shell test using printf(1),
'|'
should align in output:$ printf "\xd0\x91\xd1\x83\xd0\xb3\xd0\xb0\xcc\x84\xd0\xb4\xd1\x83|\\n123456|\\n" Буга̄ду| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 7.
Sequence of language Yaneshaʼ from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "\xcc\x83allohuen|\\n12345678|\\n" ̃allohuen| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Ticuna from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+004E | 'N' | Lu |
|
LATIN CAPITAL LETTER N |
U+00FC | '\xfc' | Ll |
|
LATIN SMALL LETTER U WITH DIAERESIS |
U+0078 | 'x' | Ll |
|
LATIN SMALL LETTER X |
U+00FC | '\xfc' | Ll |
|
LATIN SMALL LETTER U WITH DIAERESIS |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "N\xc3\xbcx\xc3\xbc\xcc\x83|\\n1234|\\n" Nüxü̃| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Amarakaeri from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0027 | "'" | Po |
|
APOSTROPHE |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0070 | 'p' | Ll |
|
LATIN SMALL LETTER P |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+0070 | 'p' | Ll |
|
LATIN SMALL LETTER P |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "o'nopoe\xcc\xb1po|\\n123456789|\\n" o'nopoe̱po| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 10.
Sequence of language South Azerbaijani from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0049 | 'I' | Lu |
|
LATIN CAPITAL LETTER I |
U+0307 | '\u0307' | Mn |
|
COMBINING DOT ABOVE |
U+004E | 'N' | Lu |
|
LATIN CAPITAL LETTER N |
U+0053 | 'S' | Lu |
|
LATIN CAPITAL LETTER S |
U+0041 | 'A' | Lu |
|
LATIN CAPITAL LETTER A |
U+004E | 'N' | Lu |
|
LATIN CAPITAL LETTER N |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "I\xcc\x87NSAN|\\n12345|\\n" İNSAN| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Yoruba from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+1EB8 | '\u1eb8' | Lu |
|
LATIN CAPITAL LETTER E WITH DOT BELOW |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
U+0054 | 'T' | Lu |
|
LATIN CAPITAL LETTER T |
U+1ECC | '\u1ecc' | Lu |
|
LATIN CAPITAL LETTER O WITH DOT BELOW |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\xba\xb8\xcc\x80T\xe1\xbb\x8c\xcc\x81|\\n123|\\n" Ẹ̀TỌ́| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 5.
Sequence of language Chickasaw from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+004D | 'M' | Lu |
|
LATIN CAPITAL LETTER M |
U+00F3 | '\xf3' | Ll |
|
LATIN SMALL LETTER O WITH ACUTE |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "M\xc3\xb3\xcc\xb1ma|\\n1234|\\n" Mó̱ma| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Siona from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0067 | 'g' | Ll |
|
LATIN SMALL LETTER G |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+00EB | '\xeb' | Ll |
|
LATIN SMALL LETTER E WITH DIAERESIS |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "gu\xc3\xab\xcc\xb1na|\\n12345|\\n" guë̱na| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Fur from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0044 | 'D' | Lu |
|
LATIN CAPITAL LETTER D |
U+00E1 | '\xe1' | Ll |
|
LATIN SMALL LETTER A WITH ACUTE |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
U+0064 | 'd' | Ll |
|
LATIN SMALL LETTER D |
U+0268 | '\u0268' | Ll |
|
LATIN SMALL LETTER I WITH STROKE |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+014B | '\u014b' | Ll |
|
LATIN SMALL LETTER ENG |
U+00E1 | '\xe1' | Ll |
|
LATIN SMALL LETTER A WITH ACUTE |
U+A78C | '\ua78c' | Ll |
|
LATIN SMALL LETTER SALTILLO |
U+014B | '\u014b' | Ll |
|
LATIN SMALL LETTER ENG |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "D\xc3\xa1\xcc\xb1ld\xc9\xa8\xcc\x81\xc5\x8b\xc3\xa1\xea\x9e\x8c\xc5\x8b|\\n123456789|\\n" Dá̱ldɨ́ŋáꞌŋ| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 11.
Sequence of language Chinantec, Chiltepec from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006B | 'k' | Ll |
|
LATIN SMALL LETTER K |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
Total codepoints: 8
Shell test using printf(1),
'|'
should align in output:$ printf "makaloo\xcc\xb1|\\n1234567|\\n" makaloo̱| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 8.
Sequence of language Gumuz from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0067 | 'g' | Ll |
|
LATIN SMALL LETTER G |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0063 | 'c' | Ll |
|
LATIN SMALL LETTER C |
U+0327 | '\u0327' | Mn |
|
COMBINING CEDILLA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "magac\xcc\xa7|\\n12345|\\n" magaç| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Bora from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0268 | '\u0268' | Ll |
|
LATIN SMALL LETTER I WITH STROKE |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+00FA | '\xfa' | Ll |
|
LATIN SMALL LETTER U WITH ACUTE |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "m\xc9\xa8\xcc\x81am\xc3\xbanaa|\\n12345678|\\n" mɨ́amúnaa| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Mòoré from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0073 | 's' | Ll |
|
LATIN SMALL LETTER S |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "se\xcc\x83n|\\n123|\\n" sẽn| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Mongolian, Halh (Mongolian) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+1828 | '\u1828' | Lo |
|
MONGOLIAN LETTER NA |
U+1821 | '\u1821' | Lo |
|
MONGOLIAN LETTER E |
U+1837 | '\u1837' | Lo |
|
MONGOLIAN LETTER RA |
U+180E | '\u180e' | Cf |
|
MONGOLIAN VOWEL SEPARATOR |
U+1821 | '\u1821' | Lo |
|
MONGOLIAN LETTER E |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xe1\xa0\xa8\xe1\xa0\xa1\xe1\xa0\xb7\xe1\xa0\x8e\xe1\xa0\xa1|\\n1234|\\n" ᠨᠡᠷᠡ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Lamnso' from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0064 | 'd' | Ll |
|
LATIN SMALL LETTER D |
U+007A | 'z' | Ll |
|
LATIN SMALL LETTER Z |
U+0259 | '\u0259' | Ll |
|
LATIN SMALL LETTER SCHWA |
U+0259 | '\u0259' | Ll |
|
LATIN SMALL LETTER SCHWA |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "dz\xc9\x99\xc9\x99\xcc\x80n|\\n12345|\\n" dzəə̀n| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Navajo from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0042 | 'B' | Lu |
|
LATIN CAPITAL LETTER B |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+007A | 'z' | Ll |
|
LATIN SMALL LETTER Z |
U+0105 | '\u0105' | Ll |
|
LATIN SMALL LETTER A WITH OGONEK |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+0105 | '\u0105' | Ll |
|
LATIN SMALL LETTER A WITH OGONEK |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "Beehaz\xc4\x85\xcc\x81\xc4\x85|\\n12345678|\\n" Beehazą́ą| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Tamazight, Central Atlas from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0054 | 'T' | Lu |
|
LATIN CAPITAL LETTER T |
U+0049 | 'I' | Lu |
|
LATIN CAPITAL LETTER I |
U+0053 | 'S' | Lu |
|
LATIN CAPITAL LETTER S |
U+0323 | '\u0323' | Mn |
|
COMBINING DOT BELOW |
U+0045 | 'E' | Lu |
|
LATIN CAPITAL LETTER E |
U+0052 | 'R' | Lu |
|
LATIN CAPITAL LETTER R |
U+0052 | 'R' | Lu |
|
LATIN CAPITAL LETTER R |
U+0049 | 'I' | Lu |
|
LATIN CAPITAL LETTER I |
U+0048 | 'H' | Lu |
|
LATIN CAPITAL LETTER H |
U+0323 | '\u0323' | Mn |
|
COMBINING DOT BELOW |
U+0054 | 'T' | Lu |
|
LATIN CAPITAL LETTER T |
Total codepoints: 11
Shell test using printf(1),
'|'
should align in output:$ printf "TIS\xcc\xa3ERRIH\xcc\xa3T|\\n123456789|\\n" TIṢERRIḤT| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 11.
Sequence of language Gilyak from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+043D | '\u043d' | Ll |
|
CYRILLIC SMALL LETTER EN |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+043C | '\u043c' | Ll |
|
CYRILLIC SMALL LETTER EM |
U+0430 | '\u0430' | Ll |
|
CYRILLIC SMALL LETTER A |
U+0434 | '\u0434' | Ll |
|
CYRILLIC SMALL LETTER DE |
U+0438 | '\u0438' | Ll |
|
CYRILLIC SMALL LETTER I |
U+0432 | '\u0432' | Ll |
|
CYRILLIC SMALL LETTER VE |
U+04CA | '\u04ca' | Ll |
|
CYRILLIC SMALL LETTER EN WITH TAIL |
U+0447 | '\u0447' | Ll |
|
CYRILLIC SMALL LETTER CHE |
U+043E | '\u043e' | Ll |
|
CYRILLIC SMALL LETTER O |
U+0493 | '\u0493' | Ll |
|
CYRILLIC SMALL LETTER GHE WITH STROKE |
U+0440 | '\u0440' | Ll |
|
CYRILLIC SMALL LETTER ER |
U+030C | '\u030c' | Mn |
|
COMBINING CARON |
Total codepoints: 13
Shell test using printf(1),
'|'
should align in output:$ printf "\xd0\xbd\xd0\xb0\xd0\xbc\xd0\xb0\xd0\xb4\xd0\xb8\xd0\xb2\xd3\x8a\xd1\x87\xd0\xbe\xd2\x93\xd1\x80\xcc\x8c|\\n123456789012|\\n" намадивӊчоғр̌| 123456789012|
- python wcwidth.wcswidth() measures width 12, while zoc measures width 13.
Sequence of language Ditammari from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0077 | 'w' | Ll |
|
LATIN SMALL LETTER W |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+0072 | 'r' | Ll |
|
LATIN SMALL LETTER R |
U+0069 | 'i' | Ll |
|
LATIN SMALL LETTER I |
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "muw\xc9\x9b\xcc\x83rimu|\\n12345678|\\n" muwɛ̃rimu| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Assyrian Neo-Aramaic from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+072C | '\u072c' | Lo |
|
SYRIAC LETTER TAW |
U+071D | '\u071d' | Lo |
|
SYRIAC LETTER YUDH |
U+0712 | '\u0712' | Lo |
|
SYRIAC LETTER BETH |
U+0742 | '\u0742' | Mn |
|
SYRIAC RUKKAKHA |
U+0720 | '\u0720' | Lo |
|
SYRIAC LETTER LAMADH |
U+071D | '\u071d' | Lo |
|
SYRIAC LETTER YUDH |
U+0710 | '\u0710' | Lo |
|
SYRIAC LETTER ALAPH |
Total codepoints: 7
Shell test using printf(1),
'|'
should align in output:$ printf "\xdc\xac\xdc\x9d\xdc\x92\xdd\x82\xdc\xa0\xdc\x9d\xdc\x90|\\n123456|\\n" ܬܝܒ݂ܠܝܐ| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 7.
Sequence of language Farsi, Western from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+06A9 | '\u06a9' | Lo |
|
ARABIC LETTER KEHEH |
U+0644 | '\u0644' | Lo |
|
ARABIC LETTER LAM |
U+06CC | '\u06cc' | Lo |
|
ARABIC LETTER FARSI YEH |
U+0647 | '\u0647' | Lo |
|
ARABIC LETTER HEH |
U+0654 | '\u0654' | Mn |
|
ARABIC HAMZA ABOVE |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xda\xa9\xd9\x84\xdb\x8c\xd9\x87\xd9\x94|\\n1234|\\n" کلیهٔ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Otomi, Mezquital from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0058 | 'X' | Lu |
|
LATIN CAPITAL LETTER X |
U+0049 | 'I' | Lu |
|
LATIN CAPITAL LETTER I |
U+004A | 'J' | Lu |
|
LATIN CAPITAL LETTER J |
U+004D | 'M' | Lu |
|
LATIN CAPITAL LETTER M |
U+004F | 'O' | Lu |
|
LATIN CAPITAL LETTER O |
U+004A | 'J' | Lu |
|
LATIN CAPITAL LETTER J |
U+004F | 'O' | Lu |
|
LATIN CAPITAL LETTER O |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+0049 | 'I' | Lu |
|
LATIN CAPITAL LETTER I |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "XIJMOJO\xcc\xb1I|\\n12345678|\\n" XIJMOJO̱I| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Veps from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0308 | '\u0308' | Mn |
|
COMBINING DIAERESIS |
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+0074 | 't' | Ll |
|
LATIN SMALL LETTER T |
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+0069 | 'i' | Ll |
|
LATIN SMALL LETTER I |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
Total codepoints: 8
Shell test using printf(1),
'|'
should align in output:$ printf "u\xcc\x88hthine|\\n1234567|\\n" ühthine| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 8.
Sequence of language Waama from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
Total codepoints: 2
Shell test using printf(1),
'|'
should align in output:$ printf "n\xcc\x80|\\n1|\\n" ǹ| 1|
- python wcwidth.wcswidth() measures width 1, while zoc measures width 2.
Sequence of language Dinka, Northeastern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0062 | 'b' | Ll |
|
LATIN SMALL LETTER B |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0308 | '\u0308' | Mn |
|
COMBINING DIAERESIS |
U+0069 | 'i' | Ll |
|
LATIN SMALL LETTER I |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "b\xc9\x9b\xcc\x88i|\\n123|\\n" bɛ̈i| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Dari from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+06A9 | '\u06a9' | Lo |
|
ARABIC LETTER KEHEH |
U+0644 | '\u0644' | Lo |
|
ARABIC LETTER LAM |
U+06CC | '\u06cc' | Lo |
|
ARABIC LETTER FARSI YEH |
U+0647 | '\u0647' | Lo |
|
ARABIC LETTER HEH |
U+0654 | '\u0654' | Mn |
|
ARABIC HAMZA ABOVE |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "\xda\xa9\xd9\x84\xdb\x8c\xd9\x87\xd9\x94|\\n1234|\\n" کلیهٔ| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Éwé from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0077 | 'w' | Ll |
|
LATIN SMALL LETTER W |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0077 | 'w' | Ll |
|
LATIN SMALL LETTER W |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
Total codepoints: 10
Shell test using printf(1),
'|'
should align in output:$ printf "hl\xc9\x94\xcc\x83nuw\xc9\x94w\xc9\x94|\\n123456789|\\n" hlɔ̃nuwɔwɔ| 123456789|
- python wcwidth.wcswidth() measures width 9, while zoc measures width 10.
Sequence of language Baatonum from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
Total codepoints: 3
Shell test using printf(1),
'|'
should align in output:$ printf "m\xc9\x9b\xcc\x80|\\n12|\\n" mɛ̀| 12|
- python wcwidth.wcswidth() measures width 2, while zoc measures width 3.
Sequence of language Urdu (2) from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0642 | '\u0642' | Lo |
|
ARABIC LETTER QAF |
U+0648 | '\u0648' | Lo |
|
ARABIC LETTER WAW |
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0645 | '\u0645' | Lo |
|
ARABIC LETTER MEEM |
U+0650 | '\u0650' | Mn |
|
ARABIC KASRA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd9\x82\xd9\x88\xd8\xa7\xd9\x85\xd9\x90|\\n12345|\\n" اقوامِ| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Urdu from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0642 | '\u0642' | Lo |
|
ARABIC LETTER QAF |
U+0648 | '\u0648' | Lo |
|
ARABIC LETTER WAW |
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0645 | '\u0645' | Lo |
|
ARABIC LETTER MEEM |
U+0650 | '\u0650' | Mn |
|
ARABIC KASRA |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd9\x82\xd9\x88\xd8\xa7\xd9\x85\xd9\x90|\\n12345|\\n" اقوامِ| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Uduk from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0070 | 'p' | Ll |
|
LATIN SMALL LETTER P |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0072 | 'r' | Ll |
|
LATIN SMALL LETTER R |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "p\xcc\xb1ara|\\n1234|\\n" p̱ara| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Mazahua Central from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0054 | 'T' | Lu |
|
LATIN CAPITAL LETTER T |
U+0045 | 'E' | Lu |
|
LATIN CAPITAL LETTER E |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
U+0027 | "'" | Po |
|
APOSTROPHE |
U+0045 | 'E' | Lu |
|
LATIN CAPITAL LETTER E |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "TE\xcc\xb1'E\xcc\xb1|\\n1234|\\n" TE̱'E̱| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 6.
Sequence of language Secoya from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0063 | 'c' | Ll |
|
LATIN SMALL LETTER C |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+00EB | '\xeb' | Ll |
|
LATIN SMALL LETTER E WITH DIAERESIS |
U+006F | 'o' | Ll |
|
LATIN SMALL LETTER O |
U+0077 | 'w' | Ll |
|
LATIN SMALL LETTER W |
U+00EB | '\xeb' | Ll |
|
LATIN SMALL LETTER E WITH DIAERESIS |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
Total codepoints: 8
Shell test using printf(1),
'|'
should align in output:$ printf "can\xc3\xabow\xc3\xab\xcc\xb1|\\n1234567|\\n" canëowë̱| 1234567|
- python wcwidth.wcswidth() measures width 7, while zoc measures width 8.
Sequence of language Gen from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0064 | 'd' | Ll |
|
LATIN SMALL LETTER D |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "d\xc9\x94\xcc\x80nna|\\n12345|\\n" dɔ̀nna| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Picard from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0076 | 'v' | Ll |
|
LATIN SMALL LETTER V |
U+0072 | 'r' | Ll |
|
LATIN SMALL LETTER R |
U+0065 | 'e' | Ll |
|
LATIN SMALL LETTER E |
U+030A | '\u030a' | Mn |
|
COMBINING RING ABOVE |
U+0079 | 'y' | Ll |
|
LATIN SMALL LETTER Y |
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+0069 | 'i' | Ll |
|
LATIN SMALL LETTER I |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0074 | 't' | Ll |
|
LATIN SMALL LETTER T |
Total codepoints: 9
Shell test using printf(1),
'|'
should align in output:$ printf "vre\xcc\x8aymint|\\n12345678|\\n" vre̊ymint| 12345678|
- python wcwidth.wcswidth() measures width 8, while zoc measures width 9.
Sequence of language Mixtec, Metlatónoc from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0027 | "'" | Po |
|
APOSTROPHE |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0331 | '\u0331' | Mn |
|
COMBINING MACRON BELOW |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "na'nu\xcc\xb1|\\n12345|\\n" na'nu̱| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Arabic, Standard from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0639 | '\u0639' | Lo |
|
ARABIC LETTER AIN |
U+062A | '\u062a' | Lo |
|
ARABIC LETTER TEH |
U+064F | '\u064f' | Mn |
|
ARABIC DAMMA |
U+0645 | '\u0645' | Lo |
|
ARABIC LETTER MEEM |
U+062F | '\u062f' | Lo |
|
ARABIC LETTER DAL |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd8\xb9\xd8\xaa\xd9\x8f\xd9\x85\xd8\xaf|\\n12345|\\n" اعتُمد| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Ga from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0073 | 's' | Ll |
|
LATIN SMALL LETTER S |
U+0068 | 'h' | Ll |
|
LATIN SMALL LETTER H |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "ash\xc9\x94\xcc\x83|\\n1234|\\n" ashɔ̃| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Panjabi, Western from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+064F | '\u064f' | Mn |
|
ARABIC DAMMA |
U+0646 | '\u0646' | Lo |
|
ARABIC LETTER NOON |
U+06CC | '\u06cc' | Lo |
|
ARABIC LETTER FARSI YEH |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd9\x8f\xd9\x86\xdb\x8c|\\n123|\\n" اُنی| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.
Sequence of language Dangme from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
Total codepoints: 3
Shell test using printf(1),
'|'
should align in output:$ printf "n\xc9\x94\xcc\x81|\\n12|\\n" nɔ́| 12|
- python wcwidth.wcswidth() measures width 2, while zoc measures width 3.
Sequence of language Dagaare, Southern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+006B | 'k' | Ll |
|
LATIN SMALL LETTER K |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+0075 | 'u' | Ll |
|
LATIN SMALL LETTER U |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "ku\xcc\x83u\xcc\x83|\\n123|\\n" kũũ| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 5.
Sequence of language Serer-Sine from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0070 | 'p' | Ll |
|
LATIN SMALL LETTER P |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+0073 | 's' | Ll |
|
LATIN SMALL LETTER S |
U+0069 | 'i' | Ll |
|
LATIN SMALL LETTER I |
U+006C | 'l' | Ll |
|
LATIN SMALL LETTER L |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "p\xcc\x81asil|\\n12345|\\n" ṕasil| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Fon from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0061 | 'a' | Ll |
|
LATIN SMALL LETTER A |
U+006B | 'k' | Ll |
|
LATIN SMALL LETTER K |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0301 | '\u0301' | Mn |
|
COMBINING ACUTE ACCENT |
U+006E | 'n' | Ll |
|
LATIN SMALL LETTER N |
Total codepoints: 5
Shell test using printf(1),
'|'
should align in output:$ printf "ak\xc9\x94\xcc\x81n|\\n1234|\\n" akɔ́n| 1234|
- python wcwidth.wcswidth() measures width 4, while zoc measures width 5.
Sequence of language Aja from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+00E8 | '\xe8' | Ll |
|
LATIN SMALL LETTER E WITH GRAVE |
U+0067 | 'g' | Ll |
|
LATIN SMALL LETTER G |
U+0062 | 'b' | Ll |
|
LATIN SMALL LETTER B |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
U+006D | 'm' | Ll |
|
LATIN SMALL LETTER M |
U+025B | '\u025b' | Ll |
|
LATIN SMALL LETTER OPEN E |
U+0300 | '\u0300' | Mn |
|
COMBINING GRAVE ACCENT |
Total codepoints: 8
Shell test using printf(1),
'|'
should align in output:$ printf "\xc3\xa8gb\xc9\x9b\xcc\x80m\xc9\x9b\xcc\x80|\\n123456|\\n" ègbɛ̀mɛ̀| 123456|
- python wcwidth.wcswidth() measures width 6, while zoc measures width 8.
Sequence of language Pashto, Northern from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0633 | '\u0633' | Lo |
|
ARABIC LETTER SEEN |
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+0633 | '\u0633' | Lo |
|
ARABIC LETTER SEEN |
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+064B | '\u064b' | Mn |
|
ARABIC FATHATAN |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd8\xb3\xd8\xa7\xd8\xb3\xd8\xa7\xd9\x8b|\\n12345|\\n" اساساً| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Dendi from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0062 | 'b' | Ll |
|
LATIN SMALL LETTER B |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+0303 | '\u0303' | Mn |
|
COMBINING TILDE |
U+014B | '\u014b' | Ll |
|
LATIN SMALL LETTER ENG |
U+0254 | '\u0254' | Ll |
|
LATIN SMALL LETTER OPEN O |
U+002E | '.' | Po |
|
FULL STOP |
Total codepoints: 6
Shell test using printf(1),
'|'
should align in output:$ printf "b\xc9\x94\xcc\x83\xc5\x8b\xc9\x94.|\\n12345|\\n" bɔ̃ŋɔ.| 12345|
- python wcwidth.wcswidth() measures width 5, while zoc measures width 6.
Sequence of language Seraiki from midpoint of alignment failure records:
Codepoint | Python | Category | wcwidth | Name |
---|---|---|---|---|
U+0627 | '\u0627' | Lo |
|
ARABIC LETTER ALEF |
U+064F | '\u064f' | Mn |
|
ARABIC DAMMA |
U+062A | '\u062a' | Lo |
|
ARABIC LETTER TEH |
U+06D2 | '\u06d2' | Lo |
|
ARABIC LETTER YEH BARREE |
Total codepoints: 4
Shell test using printf(1),
'|'
should align in output:$ printf "\xd8\xa7\xd9\x8f\xd8\xaa\xdb\x92|\\n123|\\n" اُتے| 123|
- python wcwidth.wcswidth() measures width 3, while zoc measures width 4.