Bart Massey 2021 (version 0.1.0)
Test for vowels in Romance languages.
Deciding whether some grapheme is a "vowel" (represents a "vowel sound") is language dependent, and the author is not aware of any standardization efforts for vowel identification.
Even for the Romance languages, the situation is a bit complicated. The basic Romance vowels are "a", "e", "i", "o", "u". However, for example:
- Uppercase versions are also vowels.
- Accented versions are also vowels.
- "Sometimes 'y', sometimes 'w'." These letters are consonant in some situations, vowels in others.
This code attempts to provide reasonable heuristic answers as to the Romance-language-"vowelness" of a Unicode codepoint.
No attempt is made here to deal with non-Romance languages, even though some non-Romance vowel letters are borrowed in Romance languages: for example, "æ" and "Æ" are used in some British English. It would be great to use vowel letter tables for a variety of languages, but so far the author has been unable to locate such tables (surprisingly).
This crate is made available under the "MIT
license". Please see the file LICENSE
in this distribution
for license terms.
Thanks to the cargo-readme
crate for generation of this README
.