TeX escaping ≥ and ≤ #3444

HFriberg · 2017-02-22T08:39:12Z

The non-strict greater than and smaller than symbols, ≥ and ≤, are not defined in the TeX builder of Sphinx. I've studied the case and found that the perhaps best solution is having these definitions in the preamble:

\\DeclareUnicodeCharacter{2265}{\\ensuremath{\\geq}}
\\DeclareUnicodeCharacter{2264}{\\ensuremath{\\leq}}

I wanted to contribute this to Sphinx, but the question is where? Should this go into the ADDITIONAL_SETTINGS of sphinx/writers/latex.py where \\nobreakspace is defined similarly, or should it rather go into sphinx/util/texescape.py where most unicode definitions seem to be?

The text was updated successfully, but these errors were encountered:

jfbu · 2017-02-22T10:02:22Z

Thanks for the suggestion ! The texescape will apply to all engines, hence is not appropriate: with xetex and luatex you can input character as is with OpenType font, even in math mode with package unicode-math.

The latex.py location could be the place for pdflatex engine. However there are many Unicode characters ;-). Hence my opinion is that this should not be incorporated to Sphinx-core. What do you think @tk0miya ?

If the document uses Unicode characters extensively it is best compiled with xelatex/lualatex and OpenType fonts. If with pdflatex user can always insert the \\DeclareUnicodeCharacter... via latex-elements's 'preamble' key in conf.py or in a use of 'inputenc' key as addition to \\usepackage[utf8]{inputenc}. (or since Sphinx 1.5 using the templating approach briefly evoked at bottom of latex customization.)

By the way I can recommend newunicodechar LaTeX package. With it you can do things like

\newunicodechar{≤}{\ensuremath{\leq}}
\newunicodechar{⩽}{\ensuremath{\leqslant}}% requires amssymb or other math font package

without needing to check a Unicode codepoint (assuming you have some way via your keyboard or copy paste to insert the character).

Note: the rendering of some glyphs may be considered language dependent. For example, French uses ⩽ although 90% of people writing scientific papers have lost that memory due to the big effort to redefine \leq to be \leqslant in TeX, which they don't know how to do.

This is another reason why it complicates things to try to put such mappings in Sphinx-core.

jfbu · 2017-02-22T10:23:20Z

by the way, for the specific case you mentioned you can use \\usepackage[utf8x]{inputenc} for the 'inputenc' key of latex_elements dictionary.

jfbu · 2017-02-22T10:54:25Z

ah sorry, utf8x option causes \DeclareUnicodeCharacter macro to be redefined and (in my brief investigation) it then wants decimal not hexadecimal input. As Sphinx puts \\DeclareUnicodeCharacter{00A0}{\\nobreakspace} in LaTeX file this causes bug.

It appears that with utf8x, one should do \\DeclareUnicodeCharacter{160}{\\nobreakspace}.
Currently, Sphinx user must use template approach to workaround this and use utf8x option with inputenc.

HFriberg · 2017-02-22T11:47:25Z

Thank you! I see the French redefinition is a complicating factor and will just stick to my 'preamble' key in conf.py (sorry for not mentioning that I already knew this trick).

I seem to recall horrors with utf8x (although I forgot why), but the newunicodechar package is a very nice suggestion I will make use of. Great thanks also for mentioning the experimental templating feature in tex, but I am already doing customizations via preamble commands such as:

\\providecommand*{\\DUrolestrikethrough}[1]{\\sout{#1}}

which is sufficient for my needs.

tk0miya · 2017-02-22T16:13:01Z

The latex.py location could be the place for pdflatex engine. However there are many Unicode characters ;-). Hence my opinion is that this should not be incorporated to Sphinx-core. What do you think @tk0miya ?

It's difficult question. Surely there are many unicode characters. so it is hard to incorporate all of them into sphinx-core.
But, as you said, these characters cause the errors with non unicode support LaTeX engines; pdflatex and platex.

So it would be nice to add the characters to texescape.py when we receive issue. I know it is not a perfect way. But it saves our time.

BTW, fundamentally, it would be nice if we move to the engines supports unicode; XeTeX, LuaTeX and upLaTeX.

jfbu added builder:latex type:question labels Feb 22, 2017

This was referenced Feb 22, 2017

setting 'inputenc' key to \\usepackage[utf8x]{inputenc} leads to failed PDF build #3445

Closed

Fix #3445: (latex) make \\usepackage[utf8x]{inputenc} usable #3446

Merged

HFriberg closed this as completed Feb 22, 2017

tk0miya reopened this Feb 22, 2017

tk0miya mentioned this issue Mar 5, 2017

Adding more unicode characters to the generated *.tex file #3511

Closed

jfbu mentioned this issue Oct 9, 2017

U+2212 MINUS SIGN breaking PDF generation #4136

Closed

AA-Turner added this to the some future version milestone Sep 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TeX escaping ≥ and ≤ #3444

TeX escaping ≥ and ≤ #3444

HFriberg commented Feb 22, 2017

jfbu commented Feb 22, 2017 •

edited

jfbu commented Feb 22, 2017

jfbu commented Feb 22, 2017 •

edited

HFriberg commented Feb 22, 2017

tk0miya commented Feb 22, 2017

TeX escaping ≥ and ≤ #3444

TeX escaping ≥ and ≤ #3444

Comments

HFriberg commented Feb 22, 2017

jfbu commented Feb 22, 2017 • edited

jfbu commented Feb 22, 2017

jfbu commented Feb 22, 2017 • edited

HFriberg commented Feb 22, 2017

tk0miya commented Feb 22, 2017

jfbu commented Feb 22, 2017 •

edited

jfbu commented Feb 22, 2017 •

edited