Displaying unicode or different language like Bengali doesn't work. #598

monirz · 2020-01-04T14:52:15Z

I'm trying to make an application using Fyne, where I need to display Bengali. But didn't find a way to display it. Is there a way to display Unicode/different language?

andydotxyz · 2020-01-04T18:57:05Z

The building fonts do not support all languages and Bengali is one that is not included sorry. If you download a TTF don’t file and specify its path in the FYNE_FONT environment variable then you should be able to use the characters you want. “Noto Sans Bengali” would match the builtin fonts well.

monirz · 2020-01-05T13:43:34Z

Okay, I tried with FYNE_FONT variable and it now shows letters instead of blank boxes but it's broken. I tried with few popular Bengali fonts. The result is same.
Also, I need to the fonts combined in the binary for user. So the user won't have to provide the font, guess the user has fonts already installed.

andydotxyz · 2020-01-05T13:55:49Z

Can you expand more please on “its broken”? If the letters are displayed what else do you expect?

monirz · 2020-01-05T16:28:37Z

I think I used the term "letter" wrong, it's more of a font is broken. Meaning it can not render the font properly.

monirz · 2020-01-05T16:34:49Z

This is the link of the image with Fyne and how it should be link

andydotxyz · 2020-01-05T18:23:55Z

As someone who knows the language can you explain what the differences are - or what a possible cause may be?

It looks to me like characters are supposed to combine in some way - but that is an uneducated guess.

monirz · 2020-01-06T12:10:00Z

Well that problem happens with some other cases like in chrome because of when the proper font is not installed on the system. But in this case I'm not sure.

mrezai · 2020-01-08T06:51:56Z

I can't read Bengali but it seems this problem is related to "text shaping" and to support it you need something like HarfBuzz
Some related links:
https://en.wikipedia.org/wiki/Complex_text_layout
https://github.com/grisha/hbshape

mrezai · 2020-01-08T08:58:28Z

My previous comment was a guess and now after run the demo, it seems text rendering for languages known as "complex scripts" hadn't implemented at all.
In addition to HarfBuzz something like ICU is needed for BiDi.
I think its a good idea to add CTL to milestone 2.0 and all of this means "open Pandora's box"!

beoran · 2020-01-08T11:19:45Z

For non-complex scripts such as Japanese or Chinese, etc. it would be enough to be able to embed a custom font in the binary using, e.g. go-bindata, if only fyne had a way to set that font other than setting FYNE_FONT.

Edit: additionally support for .odf and .ttc fonts might be very useful.

andydotxyz · 2020-01-08T11:33:51Z

Applications can provide their own font by implementing a custom theme (like https://github.com/andydotxyz/beebui).
The process for doing so should be easier though!

andydotxyz · 2020-01-08T11:35:13Z

Also I agree we should aim for full internationalisation in 2.0

kkartaltepe · 2020-02-15T03:48:00Z

IMO the easiest example of lack of support for shaping is in Arabic (and related languages). A simple test like
مرحبا

You can see the lack of RTL support (Full height Vertical character will appear on the right), and the lack of shaping (characters rendered individually instead of connected like cursive).

Nik-U · 2020-07-24T05:09:54Z

Rendering "non-complex scripts" (e.g., for English, Russian, or Korean) can be done with a naïve text shaper that simply queries the font file for a glyph on a rune-by-rune basis, as long as one supplies an appropriate font file. One annoying limitation is that the SFNT format (used by TTF/OTF files) is limited to 2¹⁶ glyphs, which is not enough to support all languages + emoji. Thus, even a text shaper that is limited to non-complex scripts will need to implement "font fallback" behavior in practice if one wants to properly render strings containing arbitrary languages. For the Gio project we implemented this with support for OpenType Collection files like these giant merged Noto fonts that I prepared. Alternatively, you can use the installed system fonts and read the fallback order from the system configuration (in most Linux distributions this is handled by fontconfig).

The Gio project has recently run into the same text shaping limitations for "complex scripts" (e.g., for Arabic or Bengali) that are discussed in this GitHub issue. See gio#146 for a more detailed write-up and suggested path forward. Fyne will need to walk a similar path to resolve this issue.

To summarize:

The key components for a solution are using HarfBuzz to compute glyphs and offsets, implementing an algorithm similar to the Pango library for preparing calls to hb_shape, implementing the unicode bidirectional algorithm, and then gluing the resulting glyph data to the existing font and rendering systems. HarfBuzz is the project that implements text shaping for complex scripts with the highest accuracy. It is so dominant in this space that it is used by the browser rendering engines (Chromium and Firefox), the major desktop GUI toolkits (Qt and GTK+), Android, and Java, among others. The project is large, complex, requires a lot of resources to maintain (shaping bugfixes for languages still come in weekly even after a decade of development), and is written in C++. Duplicating the effort is probably intractable; the Rustaceans opted to duplicate the effort with Allsorts, but it lags behind HarfBuzz in terms of language coverage. Windows and macOS each have a proprietary alternative (Uniscribe and Core Text, respectively).

There are a few possible paths forward discussed in gio#146, summarized below:

Write a cgo wrapper for HarfBuzz. This adds a sizeable system dependency for compilation.
Try to do an automated translation of HarfBuzz code into native Go or a specialty font encoding like Graphite. We're talking about (specialized) C++ to Go transpilation here, which is definitely non-trivial. Translation of lookup tables or unicode data files alone is insufficient.
Try to do something clever with linking (e.g., distributing a precompiled HarfBuzz and/or linking with .syso files).
Accept incomplete language coverage for now. This might mean implementing support for Graphite fonts and having users wait for appropriate fonts to be released, or it might mean porting or linking to the Allsorts project.
Something else we haven't thought of yet.

None of these options are without drawbacks. However, resolving the HarfBuzz linking problem in one way or another would be very beneficial for a lot of Go projects—not just Fyne and Gio, but pretty much anything that needs to render arbitrary text.

kkartaltepe · 2020-07-24T05:26:41Z

If you resign yourself to linking HB, you can simply link fribidi (with no dependencies its fairly nice) for bidi.

beoran · 2020-07-24T11:27:40Z

I looked at HarfBuzz and I think 50% of the code is C(C++) support, like memory management, etc., that we don't need in Go, because the language provides it for us, furthermore, the HarfBuzz api isn't all that great, it suffers from being very C-like due to memory allocation problems. In Go we have the standard library "unicode", "golang.org/x/text/language", and "x/image/font". HarfBuzz uses Ragel parsers which can also be generated as Go language in stead of C. The remaining few thousands lines of code should not be too hard for a manual conversion, at least not for the basic shaping API. This could look something like this:

    buffer = gotesh.Buffer(text)
    buffer.SetDirection(gotesh.LeftToRight)
    buffer.SetScript(language.Bengali.Script());
    buffer.SetLanguage(language.Bengali);
    glyphInfo, glyphPosition, err := buffer.Shape(fontFace,features);

Nothing there yet, but go here if you like the challenge ;)
https://gitlab.com/beoran/gotesh

andydotxyz · 2020-07-24T11:41:46Z

you can simply link fribidi

Unfortunately statically linking fribidi is not an option due to licensing.

andydotxyz · 2020-07-24T11:46:35Z

Thus, even a text shaper that is limited to non-complex scripts will need to implement "font fallback" behavior in practice

Quite right @Nik-U - we have the fallback in place so when someone loads, for example, a Japanese font, they still see the english text that the project includes etc.

I think that realistically there are multiple steps -> user defined font -> app defined font -> language font -> toolkit fallback.

We have avoided doing system lookups this far because a consistent experience was deemed important and some distros (particularly some lightweight Linux ones) don't even ship vector fonts by default.

I did consider a massive combined font file - but this resulted in around 200MB once Japanese and Chinese (traditional and simplified) were added - which is clearly more than we can reasonably embed in binaries.
Mostly for that reason I think we may need to go down the route of a "language pack" that can be downloaded and used in a font-lookup-order mechanism like described above.

beoran · 2020-08-13T07:24:52Z

On investigating the issue, I'd say that HarfBuzz, etc, are not the right idea, while, in stead, Graphite is: https://scripts.sil.org/cms/scripts/page.php?site_id=projects&item_id=graphite_home . In stead of having to program the language rules manually, graphite compiles them directly into the font. All that is needed is to implement a VM. That would be even easier to do in Go.

Move from iterating on rune to glyphID. Fixes fyne-io#598

andydotxyz · 2022-12-02T14:37:02Z

The rendering issues referred to here should be resolved in v2.3.0. The code that follows seems to render well:

package main

import (
	"os"

	"fyne.io/fyne/v2/app"
	"fyne.io/fyne/v2/container"
	"fyne.io/fyne/v2/widget"
)

func main() {
	os.Setenv("FYNE_FONT", "/Users/andy/Downloads/shruti.ttf")
	a := app.New()
	w := a.NewWindow("Hello")

	hello := widget.NewLabel("િદ્ધની")
	w.SetContent(container.NewVBox(
		hello,
	))

	w.ShowAndRun()
}

beoran · 2022-12-02T14:39:23Z

Great job for porting the whole of haarfbuzz from C++ to Go! You are a master amongst masters!

andydotxyz · 2022-12-02T15:41:03Z

Great job for porting the whole of haarfbuzz from C++ to Go! You are a master amongst masters!

I appreciate the enthusiasm, but honestly all of the thanks belong to @benoitkugler for the porting work and @whereswaldon for building most of the https://github.com/go-text/typesetting project that made this possible!
There will be more improvements to the rendering efficiency which we will be sharing through go-text as well, but so far we are standing on the shoulders of giants!

beoran · 2022-12-02T15:43:27Z

Sorry, I had that mixed up. Honor to those who deserve it! Giants indeed!

andydotxyz added this to the 2.0 - Discussing next major release milestone Jan 8, 2020

beoran mentioned this issue Jan 8, 2020

not support chinese #604

Closed

andydotxyz modified the milestones: 2.0 - Discussing next major release, 3.0 - Multimedia and more Apr 6, 2020

andydotxyz mentioned this issue Nov 17, 2020

Special characters missing in default font #480

Closed

andydotxyz added the Internationalisation I18n and support for non-bundled fonts and languages label Jan 5, 2021

andydotxyz mentioned this issue Feb 4, 2021

please add Chinese char support to build-in font #1901

Closed

AlbinoGeek mentioned this issue Feb 19, 2021

utf8 support #1976

Closed

Jacalz mentioned this issue Mar 7, 2021

support chinese characters and emoji bug #2065

Closed

Jacalz mentioned this issue Mar 19, 2021

Chineese character support #2100

Closed

Jacalz mentioned this issue Apr 23, 2021

Arabic letters do not display correctly #2189

Closed

Jacalz mentioned this issue Oct 18, 2021

windows Chinese directory is garbled Jacalz/rymdport#26

Closed

andydotxyz mentioned this issue Mar 30, 2022

please add supported for chinese font fyne-io/defyne#14

Closed

andydotxyz modified the milestones: 3.0 - Multimedia and more, Cragganmore Release (late sept 2022) Jun 6, 2022

andydotxyz added a commit to andydotxyz/fyne that referenced this issue Dec 2, 2022

Fix rendering of compound glyphs

27c1284

Move from iterating on rune to glyphID. Fixes fyne-io#598

andydotxyz mentioned this issue Dec 2, 2022

Fix rendering of compound glyphs #3449

Merged

3 tasks

andydotxyz closed this as completed Dec 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Displaying unicode or different language like Bengali doesn't work. #598

Displaying unicode or different language like Bengali doesn't work. #598

monirz commented Jan 4, 2020

andydotxyz commented Jan 4, 2020

monirz commented Jan 5, 2020

andydotxyz commented Jan 5, 2020

monirz commented Jan 5, 2020

monirz commented Jan 5, 2020 •

edited

andydotxyz commented Jan 5, 2020

monirz commented Jan 6, 2020

mrezai commented Jan 8, 2020

mrezai commented Jan 8, 2020

beoran commented Jan 8, 2020 •

edited

andydotxyz commented Jan 8, 2020

andydotxyz commented Jan 8, 2020

kkartaltepe commented Feb 15, 2020

Nik-U commented Jul 24, 2020

kkartaltepe commented Jul 24, 2020

beoran commented Jul 24, 2020 •

edited

andydotxyz commented Jul 24, 2020

andydotxyz commented Jul 24, 2020

beoran commented Aug 13, 2020

andydotxyz commented Dec 2, 2022

beoran commented Dec 2, 2022

andydotxyz commented Dec 2, 2022

beoran commented Dec 2, 2022

Displaying unicode or different language like Bengali doesn't work. #598

Displaying unicode or different language like Bengali doesn't work. #598

Comments

monirz commented Jan 4, 2020

andydotxyz commented Jan 4, 2020

monirz commented Jan 5, 2020

andydotxyz commented Jan 5, 2020

monirz commented Jan 5, 2020

monirz commented Jan 5, 2020 • edited

andydotxyz commented Jan 5, 2020

monirz commented Jan 6, 2020

mrezai commented Jan 8, 2020

mrezai commented Jan 8, 2020

beoran commented Jan 8, 2020 • edited

andydotxyz commented Jan 8, 2020

andydotxyz commented Jan 8, 2020

kkartaltepe commented Feb 15, 2020

Nik-U commented Jul 24, 2020

kkartaltepe commented Jul 24, 2020

beoran commented Jul 24, 2020 • edited

andydotxyz commented Jul 24, 2020

andydotxyz commented Jul 24, 2020

beoran commented Aug 13, 2020

andydotxyz commented Dec 2, 2022

beoran commented Dec 2, 2022

andydotxyz commented Dec 2, 2022

beoran commented Dec 2, 2022

monirz commented Jan 5, 2020 •

edited

beoran commented Jan 8, 2020 •

edited

beoran commented Jul 24, 2020 •

edited