Skip to content

Releases: prohippo/pyelly

Fix Unicode Recognition Bugs

18 Nov 06:25
Compare
Choose a tag to compare

The EllyChar class was incorrectly recognizing Unicode control characters. This was discovered in the upcoming PyElly v2.0 release; and the fix for it has been has been copied back to v1.6 to make v1.6.1.

Handle Chinese Unicode Input

19 Oct 07:17
Compare
Choose a tag to compare

Generalize code to work optionally with Chinese Unicode text input. Before, all text input had to be in a Latin alphabet.

Small Change in English Suffix Rules

11 Jul 15:26
Compare
Choose a tag to compare

This is mostly a resync of GitHUB code after moving to a new computer. After 40 years, suffix table rules are still incomplete.

More Adjustments for Handling Greek Letters

21 Dec 19:29
Compare
Choose a tag to compare

The use of Greek letters in chemical nomenclature requires changes in how the $ wildcard is matched in the PyElly FSA for syntactic typing of tokens. Update documentation.

Minor Bug Fix

10 Dec 08:24
Compare
Choose a tag to compare

Have to recognize comma followed immediately by Greek letter as special case for breaking a token in ellyBuffer. Extend "chemic" rules and integration test. Update documentation.

Minor Bug Fix

07 Dec 23:38
Compare
Choose a tag to compare

Fix recognition of Unicode prime char in setting bounds in patternTable module for FSA matching. Extend "chemic" rules and integration testing. Update documentation.

Minor Fix in Morphological Matching

29 Nov 17:07
Compare
Choose a tag to compare

Suffix removal needs to be aware of previous prefix removal from a text input token for analyzing and rewriting. More rules for "chemic" example application and more examples for "chemic" integration testing. Update documentation.

Improve Handling of Commas, Error Reporting

20 Nov 08:19
Compare
Choose a tag to compare

This fixes problems discovered in identification of locants in chemical names. This requires that embedded commas be taken as a separate token. The "chemic" integration test was expanded. Documentation was revised.

Fix Handling of Prefixes and Suffixes

12 Nov 08:25
Compare
Choose a tag to compare

Improvement and debugging of various PyElly modules to support a new example application to recognize structural chemical nomenclature in text ("chemic"). This requires using PyElly prefix and suffix analysis in new ways. The "chemic" rules and integration test continues to grow.

Reorganize Input Processing To Handle Prefix Morphological Rules

08 Nov 08:19
Compare
Choose a tag to compare

PyElly tokenization of input was improperly dealing with prefix recognition and splitting off. This was main due to problems with the '+' character used to marking prefixes and separated roots. The problems showed up in the "chemic" example application, which has to find chemical nomenclature in various styles of text. The names are too numerous to list out fully and so requires a bit of analysis.