Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thai source update #19

Merged
merged 8 commits into from
Apr 8, 2018
Merged

Thai source update #19

merged 8 commits into from
Apr 8, 2018

Conversation

thep
Copy link
Contributor

@thep thep commented Oct 31, 2017

In this pull request, I've prepared Thai hyphenation source updates done over the two years from tlwg/thailatex repository.

Since I'm not sure about further workflow to get the generated patterns updated in the tree, I've omitted that part for now.

AFAIK, updating hyph-th.tex and languages.rb and then running generate-plain-patterns.rb and generate-ptex-patterns.rb get changes in Latin patterns as well. That makes me unsure how it should properly be done.

* diff-dicts.sh, test-hyphen.sh:
  - Convert pdflatex output to UTF-8 before applying grep,
    as newer grep seems to treat TIS-620 string as binary
    and output error message instead.
* tdict-city.txt:
* tdict-common.txt:
* tdict-country.txt:
* tdict-district.txt:
* tdict-ict.txt:
* tdict-lang-ethnic.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
  - Update dict from libthai 0.1.25.

* tdict-std.txt:
  - Correct hyphenation for 'สาธารณสมบัติ'.
* tdict-city.txt:
* tdict-common.txt:
* tdict-geo.txt:
* tdict-ict.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
* tdict-std-compound.txt:
* tdict-std.txt:
  - Update dict from libthai 0.1.26.

Make hyphenation of "*สเตอร์*" portions consistent.

* tdict-city.txt:
  - Correct hyphenation for 'อัมสเตอร์ดัม'.
* tdict-common.txt:
  - Correct hyphenation for 'ฮิปสเตอร์'.
* README, hyph-th.tex.in:
  - Use HTTPS for LTN project pages

* hyph-th.tex.in:
  - Replace obsolete LTN SVN URL with that of GitHub
* tdict-city.txt:
* tdict-common.txt:
* tdict-district.txt:
* tdict-geo.txt:
* tdict-history.txt:
* tdict-ict.txt:
* tdict-lang-ethnic.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
  - Update dict from libthai 0.1.27.

* thai-exc.pat:
  - Drop 'ฮ7โ' exception, which duplicated the generated 'ฮ1โ'.
* thai-exc.pat:
  + Add forcing exceptions for:
    - ทัก~ษ-อร
    - เด-ชะ
    - โก~โบ-ริ
  + Add blocking exceptions for:
    - นิร~ช-รา, เน~รัญ~ช-รา
@mojca
Copy link
Member

mojca commented Nov 1, 2017

AFAIK, updating hyph-th.tex and languages.rb and then running generate-plain-patterns.rb and generate-ptex-patterns.rb get changes in Latin patterns as well. That makes me unsure how it should properly be done.

Changes in other patterns are not to be expected and might result from "incomplete changes" in the past. I'll take a look and fix the issues.

These changes look ok (I was away for a while, so I need to catch up with stuff), I'll merge them.

If you prepare just the patterns (hyph-th.tex), I can do the rest anyway. We'll probably move away from the database in ruby to YAML, see the yaml branch in this repository for work-in-progress that will be merged soon (after I complete the review for all languages). It would be nice to get the patterns directly with YAML headers, I can send further instructions or examples.

See for example https://github.com/hyphenation/tex-hyphen/blob/yaml-headers/hyph-utf8/tex/generic/hyph-utf8/patterns/tex/hyph-th.tex (there might be further minor changes).

* hyph-th.tex.in:
  - Reformat the template file header as YAML.
@thep
Copy link
Contributor Author

thep commented Nov 2, 2017

OK. hyph-th.tex has been updated, with headers reformatted as YAML.

@reutenauer
Copy link
Member

I’m merging the pull request in preparation for the upload to CTAN; thank you very much, Theppitak.

@reutenauer reutenauer merged commit 0789cb2 into hyphenation:master Apr 8, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants