Skip to content

Commit

Permalink
Add Thai example from 2023-08-10 telecon
Browse files Browse the repository at this point in the history
- Thanks to @r12a for the page:

https://r12a.github.io/scripts/thai/th.html#webSegmentation

- I used the word 'toilet' from this example because it has a sara am
  (U+0E33)
- I added mention of the sara am
  • Loading branch information
aphillips committed Aug 10, 2023
1 parent 56189c6 commit 83e3743
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions questions/qa-backwards-deletion.en.html
Original file line number Diff line number Diff line change
Expand Up @@ -219,13 +219,13 @@ <h4>Try It</h4>
<button type="button" onclick="reset('tryTamil', '&#xb95;&#xbcb;')" style="font-family:monospaced;font-size:9pt;">Reset</button></p>
</div>

<p>Indic scripts, such as the Devanagari and Tamil examples above, are not the only scripts affected by this. The same can be found for combining marks in many languages. For example, the first cluster in this Thai word: <q>คืออะไร</q>.</p>
<p>Indic scripts, such as the Devanagari and Tamil examples above, are not the only scripts affected by this. The same can be found for combining marks in many languages. For example, the first cluster in this Thai word <q lang="th">ห้องน้ำ</q> has similar behavior. The end of this word shows additional complexity: the <span class="codepoint"><bdi lang="th">&#xe33;</bdi><code class="uname">U+0E33 THAI CHARACTER SARA AM</code></span> appears as a separate typographical unit for layout, but as a single grapheme for the purposes of selection, cursoring, and forward deletion.</p>

<!-- TODO [get better example; demonstrate middle cursor deletion effects in Thai] -->
<div style="border:1px solid black; background-color:#aaa;">
<h4>Try It</h4>
<p><input id="tryThai" type="text" name="tryThai" lang="th" style="font-size:24pt;" value="คืออะไร"></input>
<button type="button" onclick="reset('tryThai', 'คืออะไร')" style="font-family:monospaced;font-size:9pt;">Reset</button></p>
<p><input id="tryThai" type="text" name="tryThai" lang="th" style="font-size:24pt;" value="ห้องน้ำ"></input>
<button type="button" onclick="reset('tryThai', 'ห้องน้ำ')" style="font-family:monospaced;font-size:9pt;">Reset</button></p>
</div>

<p>Some character sequences can be written in either a "composed" or a "decomposed" forms that affect how selection and deletion performs. For example, Korean characters can be written in either a precomposed form or using a sequence of combining marks (called <em>jamo</em>). Here's one example: </p>
Expand Down

0 comments on commit 83e3743

Please sign in to comment.