GitHub - termsurf/talk.js: Generate the Pronunciation of a Word from Traditional Orthography

@termsurf/talk

A tool to convert traditional writing across languages into a normalized form for pronunciation.

Welcome

Talk is a bunch of utils for transforming traditional orthography into ChatText, which makes it available as well for ToneText, amongst other things. ChatText normalizes the way you write pronunciation, into a more human readable (and easier to type on the computer) form, and ToneText is a rune-like script which streamlines writing pronunciations further.

Check out the ./code/language folder for the list of languages we are starting to work on. It's not always possible to do this across every language, especially on a language like English, where it is impossible to generate pronunciation based on written words. You must memorize individual cases in English, and in some other languages. However, some languages do have the ability to get pretty close to correct pronunciation based purely on the native spelling, which is pretty cool. Taking advantage of that fact here!

Table of Languages

Here is a table explaining which languages we've looked at so far which can and can't have pronunciations automatically done.

language	automatic	note
Chinese (Mandarin)	yes but not perfect	Pinyin can be used to auto generate pronunciations, but it doesn't always accurately reflect how people actually say each word, so it would be better to manually write each pronunciation if possible.
Korean	yes but not perfect
Sanskrit	yes	With Devanagari, each sound has an exact pronunciation in Sanskrit, so we can get pretty close to exact pronunciations automatically done.
Finnish	yes
Navajo	yes	Since it was fairly recently transcribed intoa Latin alphabet, it is phonetic for the most part.
Akkadian	yes	Because it is no longer spoken, we have at least a standard way f representing things.
Spanish	yes	Because it is no longer spoken, we have at least a standard way f representing things.
Hebrew	partially yes, but only for consonants unless diacritics given
Arabic	partially yes, but only for consonants unless diacritics given
English	no	Too many words need to have pronunciation memorized.
Tibetan	no	Modern Tibetan has evolved to where the script no longer is phonetic.
Vietnamese	no

Goals

Include romanization transliterations of texts in various forms.
Include pronunciation of words and word parts in Chat text and possibly IPA.
Maybe include keyboard layout data for various languages.
Have structured script data, such as what are the vowels, etc..
Script detection.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

TermSurf

This is being developed by the folks at TermSurf, a California-based project for helping humanity master information and computation. Find us on Twitter, LinkedIn, and Facebook. Check out our other GitHub projects as well!

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.vscode		.vscode
code		code
test/script		test/script
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.npmignore		.npmignore
.prettierignore		.prettierignore
.prettierrc.json		.prettierrc.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
readme.md		readme.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

code

code

test/script

test/script

.eslintrc.json

.eslintrc.json

.gitignore

.gitignore

.npmignore

.npmignore

.prettierignore

.prettierignore

.prettierrc.json

.prettierrc.json

package.json

package.json

pnpm-lock.yaml

pnpm-lock.yaml

readme.md

readme.md

tsconfig.json

tsconfig.json

Repository files navigation

@termsurf/talk

Welcome

Table of Languages

Goals

License

TermSurf

About

Releases

Sponsor this project

Packages

Languages

termsurf/talk.js

Folders and files

Latest commit

History

Repository files navigation

@termsurf/talk

Welcome

Table of Languages

Goals

License

TermSurf

About

Resources

Code of conduct

Security policy

Stars

Watchers

Forks

Sponsor this project

Languages