l3text-case: Undefined \@@_change_case_char_UTFviii:nnnNNNN #939

aminophen · 2021-05-23T14:12:40Z

The following code generates an error on pLaTeX:

\RequirePackage{latexbug}
\documentclass{article}
\begin{document}
\ExplSyntaxOn
\text_lowercase:n{日本語}
\ExplSyntaxOff
\end{document}

! Undefined control sequence.
<argument> ...xt_change_case_char_UTFviii:nnnNNNN

Here \@@_change_case_char_UTFviii:nnnNNNN appears:

latex3/l3kernel/l3text-case.dtx

Line 637 in b9fb9cb

{ \@@_change_case_char_UTFviii:nnnNNNN }

the defined name is wrong as \@@_change_case_char_UTFviii:nnnNNNNN (surplus "N")

latex3/l3kernel/l3text-case.dtx

Line 169 in b9fb9cb

% \begin{macro}[EXP]{\@@_change_case_char_UTFviii:nnnNNNNN}

latex3/l3kernel/l3text-case.dtx

Line 647 in b9fb9cb

\cs_new:Npn \@@_change_case_char_UTFviii:nnnNNNNN #1#2#3#4#5#6#7

The text was updated successfully, but these errors were encountered:

car222222 · 2021-05-23T15:22:20Z

But you do not have cases for such glyphs:-)!

Thanks for discovering this and counting the Ns.

aminophen · 2021-05-23T15:33:13Z

Actually I encountered this error when using biblatex; some BIB files containing Japanese entry cannot be processed by biblatex+biber, and adding an option casechanger=latex2e resolved the issue.

FrankMittelbach · 2021-05-23T15:51:36Z

@aminophen what exactly is the intended result of a lowercased kanji? unchanged? or is there a convention?

aminophen · 2021-05-23T15:53:35Z

@FrankMittelbach Unchanged. Actually \lowercase / \uppercase primitives has no effect on Japanese characters on (u)pTeX.

FrankMittelbach · 2021-05-23T16:06:08Z

@aminophen so I thought, was just checking, but of course unchanged \neq error :-)

josephwright · 2021-05-23T16:30:31Z

There's also some other bug ... I'll fix the lot

aminophen · 2021-05-23T22:34:12Z

some other bug

That can be specific to pTeX/upTeX, not on 8-bit pdfTeX. The problem lies in the unnecessary and wrong handling of JP character tokens, which should be simply passed as-is.

It may be necessary to consider the method of handling JP token: the safe way is a bit different between pTeX (simple) and upTeX (extended to allow storing a catcode information also for JP token).

for pTeX

\documentclass{article}
\makeatletter
\def\CHARS#1{%
  \@tfor\xx@char:=#1\do{%
    % from here on, \xx@char is a \def'ed single character token
    % ===== case of pTeX
    % it's really simple:
    %   * 2 byte code = JP token
    %   * 1 byte code = Latin token
    \expandafter\@tempcnta\expandafter=\expandafter`\xx@char\relax
    \ifnum\@tempcnta>255\relax
      \typeout{[\xx@char]: 2 byte = JP}%
    \else
      \typeout{[\xx@char]: 1 byte = Latin}%
    \fi
    % =====
  }%
}
\makeatother
\begin{document}

\CHARS{日あ、αA}% => JP, JP, JP, JP, Latin

\end{document}

for upTeX

\documentclass{article}
\makeatletter
\def\CHARS#1{%
  \@tfor\xx@char:=#1\do{%
    % from here on, \xx@char is a \def'ed single character token
    % ===== case of upTeX
    % concept: using \Ucharcat, generate a character token
    % which has a charcode=256 (outside ASCII) and
    % a kcatcode=16,17,18,19 which represents a JP token
    \expandafter\ifcat\Ucharcat256 16 \xx@char\relax
      \typeout{[\xx@char]: This is 16 = JP}%
    \else
    \expandafter\ifcat\Ucharcat256 17 \xx@char\relax
      \typeout{[\xx@char]: This is 17 = JP}%
    \else
    \expandafter\ifcat\Ucharcat256 18 \xx@char\relax
      \typeout{[\xx@char]: This is 18 = JP}%
    \else
    \expandafter\ifcat\Ucharcat256 19 \xx@char\relax
      \typeout{[\xx@char]: This is 19 = JP}%
    \else
      \typeout{This is not 16--19 = Latin}%
    \fi\fi\fi\fi
    % =====
  }%
}
\makeatother
\begin{document}

\CHARS{日、あ☃é}% => 16, 18, 17, 18, Latin (2 bytes)

\def\JP{あ}% kcatcode is stored as 17

\expandafter\CHARS\JP % => 17

\kcatcode`あ=15 % change "あ" into non-JP

\CHARS{あ}% => Latin (3 bytes)
\expandafter\CHARS\JP % =>17

\end{document}

blefloch · 2021-05-24T10:43:38Z

@aminophen Perhaps you could also comment on the question I just asked on stackexchange about kcatcode? I'm hoping to make various pieces of expl3 (such as l3tl-analysis, the peek analysis code, and l3regex) "do the right thing" in pTeX and upTeX.

aminophen · 2021-05-24T11:15:45Z

Within expl3, you can simply pass Japanese character tokens as-is (without changing anything). Anyway, OK I will answer for TeX.SX

josephwright · 2021-06-14T12:21:56Z

I'm hoping to at least avoid the hard error - I should have time today

See #939.

josephwright · 2021-06-14T15:09:54Z

Hmm, just changing the incorrect name looks OK.

josephwright · 2021-06-14T15:15:29Z

Ah, I see ...

josephwright · 2021-06-14T15:59:06Z

Could someone check my idea ... I think I've excluded the right things

FrankMittelbach added the bug Something isn't working label May 23, 2021

josephwright self-assigned this May 23, 2021

aminophen mentioned this issue Jun 14, 2021

Japanese characters in metadata on (u)pLaTeX latex3/pdfresources#18

Open

josephwright added a commit that referenced this issue Jun 14, 2021

Correct an internal function name

01c796e

See #939.

josephwright added a commit that referenced this issue Jun 14, 2021

Avoid case changing high chars in (u)pTeX (issue #939)

0e42f6c

aminophen mentioned this issue Jun 20, 2021

(u)pTeX で expl3 をサポートするために texjporg/tex-jp-build#122

Open

josephwright closed this as completed Aug 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

l3text-case: Undefined \@@_change_case_char_UTFviii:nnnNNNN #939

l3text-case: Undefined \@@_change_case_char_UTFviii:nnnNNNN #939

aminophen commented May 23, 2021

car222222 commented May 23, 2021

aminophen commented May 23, 2021

FrankMittelbach commented May 23, 2021

aminophen commented May 23, 2021

FrankMittelbach commented May 23, 2021

josephwright commented May 23, 2021

aminophen commented May 23, 2021 •

edited

blefloch commented May 24, 2021

aminophen commented May 24, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

l3text-case: Undefined \@@_change_case_char_UTFviii:nnnNNNN #939

l3text-case: Undefined \@@_change_case_char_UTFviii:nnnNNNN #939

Comments

aminophen commented May 23, 2021

car222222 commented May 23, 2021

aminophen commented May 23, 2021

FrankMittelbach commented May 23, 2021

aminophen commented May 23, 2021

FrankMittelbach commented May 23, 2021

josephwright commented May 23, 2021

aminophen commented May 23, 2021 • edited

for pTeX

for upTeX

blefloch commented May 24, 2021

aminophen commented May 24, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

josephwright commented Jun 14, 2021

aminophen commented May 23, 2021 •

edited