Multiscript support in biblatex/biber #416

plk · 2016-04-26T08:48:49Z

No description provided.

duncdrum · 2016-05-03T14:54:06Z

I can't get the MWE from this SE thread to work.

\documentclass{article}
\usepackage{fontspec} 
\usepackage{polyglossia} 
\setdefaultlanguage{english}
\usepackage{xeCJK}
\setCJKmainfont{Hiragino Mincho Pro}

\usepackage[style=authoryear,%
            language=auto,%
            autolang=langname,%
          vform=romanised]{biblatex}
\addbibresource{literature.bib}
\usepackage{filecontents}
\begin{filecontents}{literature.bib}
@COLLECTION{yanagida_zengaku_sosho_1975,
  LANGID = {japanese},
  EDITOR = {柳田聖山},
  EDITOR_romanised = {Yanagida, Seizan},
  TITLE = {禪學叢書},
  TITLE_romanised = {Chūbun shuppansha},
  TITLE_translated_english = {Collected Materials for the Study of Zen},
  LOCATION = {京都},
  LOCATION_romanised = {Kyōto},
  LOCATION_translated_english = {Kyoto},
  PUBLISHER = {中文出版社},
  PUBLISHER_romanised = {Chūbun shuppansha},
  DATE = {1974/1977}
}
\end{filecontents}
\begin{document}
Hello World.\footcite{yanagida_zengaku_sosho_1975}
\nocite{*}

\printbibliography
\end{document}
}

The vform=romanised isn't recognised

Package xkeyval Error: `vform' undefined in families `blx@opt@pre'.

Is this still be the preferred option to include multi-lingual data in a biblatex file? It seems that romanised is a specific use case, whereas transliteration would be more fitting. Sanskrit texts for example where transcribed into Chinese, without being "romanised." Are there limits on the number of transcriptions and translation one can include?

Are there any restrictions for the contents of the LANGID field? Most catalogues that include language information seem to have these in ISO form. Does each item require one primary language, or would a bilingual edition have two LANGIDs, how do these interact with babel/polyglossia?

plk · 2016-05-03T15:20:29Z

The multiscript code was never in a released version and was in a separate git branch but it is currently in limbo and hasn't been updated in a long time because it really over-complicated the biblatex internals and I hit several problems. I would like to look at it again at some point but at the moment it's not really useable.

duncdrum · 2016-05-03T15:30:16Z

I see so for biblatex export of our records, we should wrap transliteration and translation into standard biblatex fields?

...
EDITOR = {Yanagida~Seizan, 柳田聖山},
...

since biblatexml seems to be in a similar state of limbo?

plk · 2016-05-03T15:50:25Z

It's not so much anything to do with the data source format (biblatexml has had a lot of work for version 3.4 which will be released soon) but the internals which handle multi-script. There is no current obvious way to deal with this apart from perhaps with the related entry functionality - you could have multiple entries with RELATED fields and then you'd have to write driver macros to support this. You could ask on TSE. That format you suggest won't work because name fields have to be parsed by the usual bibtex name parsing rules.

maieul · 2016-05-06T12:04:18Z

dear @plk, I will be glad to make any feedback for test in such features. If I remember well conversation on previous topic, two points are very important:

the fact that biblatex and biber setting must be correlated
the possibility to specify normalisated form in different language for different field of the same entry.

u-fischer · 2016-05-19T09:09:32Z

I'm too still interested in this topic. Imho problems are

lists (of location or names) as it can happen that only some names needs a translation/transscription.
the handling of names where "first name lastname" doesn't work in the same way as in english.
How to avoid too complicated "if this variant exists then ... else if this variant exist ... else ..." chains in the drivers.

plk · 2016-05-19T09:18:07Z

It's exactly point three you mention that made me think that the last PoC I did of this was not the right way - it got extremely complicated and ugly in the internals. Point two about names should I think be possible already with the name changes in 3.3 - that was partly the motivation - you can define custom name parts and also customise how sort keys are constructed for name parts (see 93-nameparts.tex sample file which implements basic Russian patronymics).

A problem I forsee is how to determine labelname, labeltitle etc. without demanding that every field they reference has every script variant defined. The alternative is a messy interface to select the script variant of them and I'd rather avoid that.

u-fischer · 2016-05-19T09:38:23Z

Can the name parts interface handle different name types in one author list in one go? That means a chinese name, a russian name with patronoymics and a german? And the main question: How can one manage a bib in xml-format. Things like your answer here http://tex.stackexchange.com/a/308761/2388 looks very good but I doubt that user want to write xml manually.

Perhaps biber could handle an input syntax like this:

 author={\namepart{family}{Fischer} \namepart{given}{Ulrike}  and \namepart{family}{...} \namepart{prefix}{von}  }

Then one wouldn't have to convert everything to xml to explore the power of the \namepart system.

Regarding the labelname: I think one shouldn't overdo the automation but allow user to define them manually if the wishes gets to special, e.g. labelname_typeX={...} with some interface to select such labelnames.

plk · 2016-05-19T09:44:46Z

I would need to experiment a bit but essentially, any name parts defined by \DeclareDatamodelConstant[type=list]{nameparts}{ ... } are available to names. This doesn't really address the multi-script requirement though. The XML format has had a lot of work for 3.4. You don't need to write it by hand - tool mode can convert to/from this (I use the XML format for my own work).

I am a bit loathe to extend the bibtex format as it usually means hacking the btparse C library and this is painful and fragile. It's also a general CPAN module and so it must always remain backwards compatible with any generic bibtex usage.

u-fischer · 2016-05-19T12:19:00Z

The problem with converting bib<->xml is that it will (probably) only work as long the content is usable in both formats. But I do understand that extending the format of normal name fields is difficult. What about a new field format with a strict input syntax with name parts? Then one could use xauthor={....}.

Btw: I get errors when converting to xml and back to bib:

    G:\biblatextest>biber --tool --output-format=biblatexml biblatex-examples.bib
    G:\biblatextest>biber --tool --output-format=bibtex biblatex-examples_bibertool.bltxml

plk · 2016-05-19T15:23:26Z

You need to use --input-format=biblatexml on the second run. I suppose I could auto-detect this from the filename extension.

plk · 2016-05-21T17:27:14Z

@u-fischer - I have added an extended name format for bibtex data sources when using biber. It allows you to specify the name parts explicitly and you can mix and match this with normal bibtex names:

AUTHOR = {Alan Smith and family=Brown, prefix=de, given=Robert}

I'd rather not encourage tex markup in names, hence this format (which has to be handled in biber anyway).

Detection of which parsing routine to use is automatic but you can turn off extended name format parsing with a biber flag in case of issues. It also allows explicit specification of prefices and supports any custom nameparts defined in the data model. See the biber PDF doc and the 93-nameparts.tex which comes with biblatex which uses both biblatexml and this extended bibtex format. This is in 2.6/3.5 dev versions.

u-fischer · 2016-05-21T21:51:50Z

This sounds very good, I will try it tomorrow -- and it is naturally ok that it not a TeX-syntax, I only used it because I'm used to.

plk · 2016-05-23T22:06:05Z

Actually, you are right about the bib<->biblatexml round-trip. There is currently no support for biblatexml->bib, only bib->bib.

EDIT: See below, this is now possible - tool mode can now convert between anything, including the extended name format and normal name format.

Shinoto-github · 2016-05-26T02:22:19Z

I am new to this discussion and cannot really help developing code etc.

But I can speak Japanese and have rudimentary knowledge of Chinese and Korean, and I do write in the humanities in several languages, using multilingual bibliographies (including Western languages). I will be glad to help with comments if you wish so and can test documents. If this is more of a nuisance, do not hesitate to tell me. This is OK for me.

As for author names, the solution of plk AUTHOR = {Alan Smith and family={Brown}, prefix={de}, given={Robert}} looks good. There is another case with generation names in Korean and Chinese; some want them mentioned isolated, others include them into their personal (given) name.

This is all for now.

plk · 2016-05-26T07:37:28Z

This could be helpful when I get time to look more into it. The new name format you mention is already in bibaltex 3.5/biber 2.6 development versions (on sourceforge) and you can define any new nameparts you need to deal with things like generation names. However, the main issue with multilingual support is having multiple copies of the same field in the same bibliography data entry and this is something which is quite hard to implement.

Shinoto-github · 2016-05-26T08:06:15Z

Great to hear about the name format in the development versions!

I see three main problems with Far Eastern sources which I will explain below. If I understand something wrong, please do not take your precious time to correct my view. I would be embarrassed if my comments steal your time rather than help finding a solution.

Different formats in the same bibliography. Some Far Eastern journals request separation of Western and Far East languages; so it is easy to get two different bibliographies and it should be easy to apply a style that is appropriate to the language. But since the style is chosen in the preamble, it must be a universal style for different cases.
Brackets for certain fields (like series titles) are different to those in Western languages. If a package like csquotes can deal with Japanese etc. brackets, and we could choose them in a modification of the style, that would make the bibliography look a lot better.
As for multiple name versions, there are mostly these types: (1) Translations into different languages (for institutions), (2) Transscripts according to different transscript systems into different writing systems, (3) Original writing, and (4) Transcription for identification. The fourth is particularly important in styles similar to authoryear, where the author is identified and mentioned once, whereas each entry might have a different original or transscript or translation. -- But basically, every field can have these variations, a title can be translated or transscribed as well.

For the time being, I am writing my biblatex files with the ID transscript (4) in romanised form into the normal author and editor field, for the other cases I use a field-naming-system that adds the type and the intended language or writing system to the field name. E.g. "author" comes as "author-trsscpt-hepburn" or "author-trslation-de" or "author-orig-ja" etc. -- Whenever I use the entries, for now I use regular expressions to create the fields that the authoryear style recognises in order to get the respective data printed.

moewew · 2016-06-05T16:10:28Z

The new way to give name parts explicitly is really useful. I still think that the biblatexml is a bit too overwhelming for the user, while .bib files are very easy to understand.

Would it also be possible to give the sortnamekeyscheme in this model (per-name)?

plk · 2016-06-05T17:16:48Z

@moewew - yes, that's important. It should work now for per-namelist and per-name scope in bibtex datasources - see the biblatex doc on \DeclareSortingNamekeyScheme.

I agree about biblatexml - until there is some GUI interface to a backend XML format like this, it's not very easy to see things at a glance even though it's conceptually easier and less prone to errors.

moewew · 2016-06-06T07:38:29Z

That works very well, thank you.

I noticed that Biber complains (use of uninitialized value in (.)) if a name does not include a family part. The output is fine, but it seems Biber expects names to have a family part (maybe this is connected to the next observation). With the new name scheme it would probably be necessary to allow for customisation of the uniquename option (per sortnamekeyscheme or something new).

This all came up in Bibtex/Biber: how to cite an author using Ethiopian conventions? on TeX.SX.

MWE (for the use of uninitialized value in (.))

\documentclass{article}
\usepackage{filecontents}

\begin{filecontents*}{\jobname.bib}
@book{james,
  author  = {given=James},
  title   = {Test},
  date    = {1983}
}
\end{filecontents*}

\usepackage[style=authoryear]{biblatex}

\addbibresource{\jobname.bib}

\begin{document}
\textcite{james}
\printbibliography
\end{document}

For me the allure of the .bib format is that you don't need a GUI to work with it comfortably. (Judging by the number of questions on TeX.SX about JabRef and other exporters they can even cause more problems than they solve.) Even though I have a soft spot for XML and agree that it is a nicer format to store the data, the biblatexml format is a bit too verbose for me to work with manually.

plk · 2016-06-06T09:08:16Z

Yes, I need to remove the last traces of assumption that every name has a family name - that's been hard-coded into biblatex/biber for a long time. Then I think uniquename etc. need to be customisable. Looking into it.

simifilm · 2016-06-06T09:33:46Z

Probably a subject for a separate discussion: While I really like the new flexible data model for names, I think biblatex finally reaches a point where we have to think more about how GUIs can handle the changes.

I just added pen names to biblatex-fiwiwhich works fine. But if I have an entry like

Author = {given={William}, family={Atheling}, suffix={Jr.}, truefamily={Blish}, truegiven={James}},

in a .bib file, BibDesk, my GUI of choice, can't handle it anymore. And, of course, the biblatexml can't be handled by any application.

I realize that the biblatex devs can't also take care of the various GUIs, and unfortunately, the BibDesk haven't been very forthcoming about biblatex in the past, but I think some kind of exchange or communication between with some GUI devs could be established, this would be a big boon.

OTOH, if I end up with a .bib file which I can only edit with a text editor, that would, at least for me, be a big step backwards.

maieul · 2016-06-06T09:53:28Z

indeed, the Bibdesk team is not very open to biblatex. For example, they don't want mechanism of nested crossref.

However, may I suggest to use : as separator inside field

Author = {given:{William}, family:{Atheling}, suffix:{Jr.}, truefamily:{Blish}, truegiven:{James}}

That will make the GUI be compatible without any modification

plk · 2016-06-06T10:13:13Z

Good idea - I only use Emacs and so I am not really aware of the GUI situation.

maieul · 2016-06-06T10:21:05Z

I have tested with bibdesk: ":" is working. What should be tested is, I think, Zotero (with https://github.com/retorquere/zotero-better-bibtex) and JabRef.

u-fischer · 2016-06-06T10:21:49Z

However, may I suggest to use : as separator inside field

I can't test it, but why should bibdesk care about the separator (colon or equal sign)? Imho the only thing that should matter in a "normal" bibtex application is the numbers of commas.

plk · 2016-06-06T10:23:49Z

I imagine it confuses it with the = after the field name. I will make the separator configurable.

mariashinoto · 2020-02-12T02:28:52Z

I am not a linguist, but I understand transcription as an effort to write a word in another writing system such that the user of this different writing system can understand the pronunciation in the framework of his or her own language. Thhe second part of this concept is getting less important, as I understand from newer developments. E.g. there is Vygotsky and Wigotski etc., where Vygotsky is definitely not what a German reader would easily understand but is confronted with in newer German texts more frequently nowadays. Originally it was transcribed "Wigotski", which is very easy to pronounce correctly. Still, I would call both "transcriptions" rather than "transliterations". A transliteration tries to mimic the original writing system regardless of how something is pronounced in the any language. At least in my understanding. E.g. the Mount Fuji in Japanese used to be transcribed "Fudschi" in German earlier, now the transcription "Fuji" has become the rule, but a transliteration would be "Huzi", and this is also an official Japanese transcription. I personally would prefer to use the term "transcription" in all cases, so there is no struggle with "what am I dealing with at the moment?". But, I am not the expert. Maria

…

Am 12.02.2020 um 10:44 schrieb Paulo Ney de Souza ***@***.***>: Can we state what is meant by "transcription" and what is meant by "transliteration" in the context of BibLaTeX -- in the manual -- and with examples! -- specially well accepted examples? I share several of the concerns Maria has brought up -- not exactly with the same solution and the root of the problem is probably on the definition of transcription/translitertation. Paulo Ney On Tue, Feb 11, 2020 at 4:34 PM zaw-shinoto ***@***.***> wrote: > Thanks for the sample. As you know I am not "in" the technology, so I just > reply on the language features and the bibliography output. > > > > (A) > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > This naming of the transliteration is not helpful (maybe this was from an > early stage of development?); we have several transliterations (as in > Korean and Chinese), and different journals ask for different > transliterations. So we need e.g. "hepburn", "JIS somewhat" etc. like this. > > > JOURNALTITLE_transliteration_ja-hepburn = {Kumamoto Kofun Kenkyū}, > > or > > > JOURNALTITLE_transliteration_ja-hepburn-corrupted = {Kumamoto Kofun > Kenkyu}, > > And some are rather transcriptions than transliterations, but this is > nothing of importance as long as the terminology is usable and the results > are OK. > > > (B) > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > Why is this a transcription rather than a transliteration or a writing > variant? Just a question, as long as it works. > > Besides, I do not think that mixing upper case letters and lower case > letters is a good idea. It may be a good idea with the main fiel like > "JOURNALTITLE", I like it personally. But for the variations we should use > only lower case because otherwise the thinking starts "Was this lower case > or upper case?". It does not matter for the transformation if I remember > right, but what if engines start recognizing upper case and lower case. > > Same for underscores and minus. OK, one might agree in separation of > variants for transliterations and transcriptions. But basically, all the > same, either this or that, is easier. > > > (C) > > > AUTHOR = {阿南,*,亨}, > > > Please ask Ulrike Fischer why the star had been introduced. The name > normally would be written 阿南享 or 阿南享, and the colons are at the right > place for Biblatex, since 享 is the personal name. In your bibliography > example, the star leads some problems. > > > (D) About the image of the bibliography > > 1. The star in both versions of the author name should disappear. > > 2. The question marks around Japanese text should be replaced by Japanese > brackets 『』and「」respectively. -- I am testing Zotero and CSL at the moment, > they have nice translations for brackets and fixed terms like "ed." etc. > (But these also are not exactly what the papers require). > > 3. Same (like 1., 2.) about the hiragana version. But we do not really > need it this way, it is basically for sorting, since there are several ways > to sort Japanese beside the alphabetical order. > > > > Thank your very much for your effort! > > Maria > > > > > > > > > Am 12.02.2020 um 04:36 schrieb plk ***@***.***>: > > > > Just to keep this somewhere, here is a somewhat contrived example with > one of your entries, cleaned up to use the correct alternates with the > results currently. This functionality is available in the experimental v4.0 > of biblatex and biber. I am ignoring that biblatex has no japanese strings > localisation currently: > > > > \documentclass[a4paper]{article} > > \usepackage{luatexja-fontspec} > > \setmainfont{TeXGyrePagella-Regular} > > \setmainjfont{IPAexMincho} > > \usepackage[english,main=japanese]{babel} > > \begin{filecontents}[force]{\jobname.bib} > > @Article{Anan_2010_Pfeilkoecher-Kesao, > > LANGID = {japanese}, > > AUTHOR = {阿南,*,亨}, > > AUTHOR_transcription_ja-Hira = {あなん,*,とおる？}, > > AUTHOR_transliteration_ja-Latn = {Anan, Tōru}, > > TITLE = {袈裟尾高塚古墳の靫}, > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > TITLE_transliteration_ja-Latn = {Kesao takazuka kofun no yuki}, > > TITLE_translation_en = {The quivers of the Kesao tumulus}, > > JOURNALTITLE = {熊本古墳研究}, > > JOURNALTITLE_transcription_ja-Hira = {くまもとこふんけんきゅう}, > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > PAGES = {44-54}, > > VOLUME = 3, > > YEAR = 2010, > > MONTH = 5, > > OWNER = {M. Shinoto}, > > TIMESTAMP = {2014.03.16}, > > WHAT = {Sonderdruck} > > } > > \end{filecontents} > > \usepackage{csquotes} > > \usepackage[style=authoryear,% > > dynamiclabel=true,% > > language=auto,% > > autolang=other,% > > autofieldlang=other]{biblatex} > > \addbibresource{\jobname.bib} > > > > % Temporary, until we have a real babel/polyglossia<->BCP47 mapping > capability > > \maplangtag{ja-latn}{english}% Use babel english for latin script > Japanese > > \maplangtag{ja-hira}{japanese}% Use babel japanese for Hiragana script > Japanese > > \DeclareMsselect{japanese}{ > > \alternate{transcription}{ja-hira} > > \alternate{transliteration}{ja-latn} > > } > > \begin{document} > > \cite{Anan_2010_Pfeilkoecher-Kesao} > > \printbibliography > > \newrefcontext[msform=transcription,mslang=ja-hira] > > \printbibliography > > \newrefcontext[msform=transliteration,mslang=ja-latn] > > \printbibliography > > \end{document} > > > > > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#416?email_source=notifications&email_token=AAR7WYVFDWQKTWXLNF7KCF3RCM7X7A5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELOYUZA#issuecomment-584944228>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AAR7WYXYZ63J2J5467L5BSLRCM7X7ANCNFSM4CB64K5Q> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

pauloney · 2020-02-12T03:08:38Z

Your understanding is a bit twisted and I would refer you to a simple definition (and differences) at: https://scriptsource.org/cms/scripts/page.php?item_id=entry_detail&uid=gslmka8xq3 as stated there (on the third paragraph) I quote here ...the Devanagari letter is usually transliterated in the Latin script as ‘j’ or ‘ja’, which roughly approximates its pronunciation in most contexts. However, a transcription system (such as the IPA) would transcribe the same character, , as 'dʒə' if the source text was in Hindi, but as 'ɟʝə' or 'zə' if the source text was in Marathi... and this is a very important difference for biblatex -- transliterations are concerned with how a character is pronounced in the new language. Transcriptions depend on the SOURCE language, if it is to be understood and so, its definition in the .bib file should be attached to TWO languages, something like: JOURNALTITLE_transliteration_mr-Deva_en-Latn Paulo Ney On Tue, Feb 11, 2020 at 6:28 PM zaw-shinoto <notifications@github.com> wrote:

…

I am not a linguist, but I understand transcription as an effort to write a word in another writing system such that the user of this different writing system can understand the pronunciation in the framework of his or her own language. Thhe second part of this concept is getting less important, as I understand from newer developments. E.g. there is Vygotsky and Wigotski etc., where Vygotsky is definitely not what a German reader would easily understand but is confronted with in newer German texts more frequently nowadays. Originally it was transcribed "Wigotski", which is very easy to pronounce correctly. Still, I would call both "transcriptions" rather than "transliterations". A transliteration tries to mimic the original writing system regardless of how something is pronounced in the any language. At least in my understanding. E.g. the Mount Fuji in Japanese used to be transcribed "Fudschi" in German earlier, now the transcription "Fuji" has become the rule, but a transliteration would be "Huzi", and this is also an official Japanese transcription. I personally would prefer to use the term "transcription" in all cases, so there is no struggle with "what am I dealing with at the moment?". But, I am not the expert. Maria > Am 12.02.2020 um 10:44 schrieb Paulo Ney de Souza < ***@***.***>: > > Can we state what is meant by "transcription" and what is meant by > "transliteration" in the context of BibLaTeX -- in the manual -- and with > examples! -- specially well accepted examples? I share several of the > concerns Maria has brought up -- not exactly with the same solution and the > root of the problem is probably on the definition of > transcription/translitertation. > > Paulo Ney > > On Tue, Feb 11, 2020 at 4:34 PM zaw-shinoto ***@***.***> > wrote: > > > Thanks for the sample. As you know I am not "in" the technology, so I just > > reply on the language features and the bibliography output. > > > > > > > > (A) > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > This naming of the transliteration is not helpful (maybe this was from an > > early stage of development?); we have several transliterations (as in > > Korean and Chinese), and different journals ask for different > > transliterations. So we need e.g. "hepburn", "JIS somewhat" etc. like this. > > > > > JOURNALTITLE_transliteration_ja-hepburn = {Kumamoto Kofun Kenkyū}, > > > > or > > > > > JOURNALTITLE_transliteration_ja-hepburn-corrupted = {Kumamoto Kofun > > Kenkyu}, > > > > And some are rather transcriptions than transliterations, but this is > > nothing of importance as long as the terminology is usable and the results > > are OK. > > > > > > (B) > > > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > Why is this a transcription rather than a transliteration or a writing > > variant? Just a question, as long as it works. > > > > Besides, I do not think that mixing upper case letters and lower case > > letters is a good idea. It may be a good idea with the main fiel like > > "JOURNALTITLE", I like it personally. But for the variations we should use > > only lower case because otherwise the thinking starts "Was this lower case > > or upper case?". It does not matter for the transformation if I remember > > right, but what if engines start recognizing upper case and lower case. > > > > Same for underscores and minus. OK, one might agree in separation of > > variants for transliterations and transcriptions. But basically, all the > > same, either this or that, is easier. > > > > > > (C) > > > > > AUTHOR = {阿南,*,亨}, > > > > > > Please ask Ulrike Fischer why the star had been introduced. The name > > normally would be written 阿南享 or 阿南享, and the colons are at the right > > place for Biblatex, since 享 is the personal name. In your bibliography > > example, the star leads some problems. > > > > > > (D) About the image of the bibliography > > > > 1. The star in both versions of the author name should disappear. > > > > 2. The question marks around Japanese text should be replaced by Japanese > > brackets 『』and「」respectively. -- I am testing Zotero and CSL at the moment, > > they have nice translations for brackets and fixed terms like "ed." etc. > > (But these also are not exactly what the papers require). > > > > 3. Same (like 1., 2.) about the hiragana version. But we do not really > > need it this way, it is basically for sorting, since there are several ways > > to sort Japanese beside the alphabetical order. > > > > > > > > Thank your very much for your effort! > > > > Maria > > > > > > > > > > > > > > > > > Am 12.02.2020 um 04:36 schrieb plk ***@***.***>: > > > > > > Just to keep this somewhere, here is a somewhat contrived example with > > one of your entries, cleaned up to use the correct alternates with the > > results currently. This functionality is available in the experimental v4.0 > > of biblatex and biber. I am ignoring that biblatex has no japanese strings > > localisation currently: > > > > > > \documentclass[a4paper]{article} > > > \usepackage{luatexja-fontspec} > > > \setmainfont{TeXGyrePagella-Regular} > > > \setmainjfont{IPAexMincho} > > > \usepackage[english,main=japanese]{babel} > > > \begin{filecontents}[force]{\jobname.bib} > > > @Article{Anan_2010_Pfeilkoecher-Kesao, > > > LANGID = {japanese}, > > > AUTHOR = {阿南,*,亨}, > > > AUTHOR_transcription_ja-Hira = {あなん,*,とおる？}, > > > AUTHOR_transliteration_ja-Latn = {Anan, Tōru}, > > > TITLE = {袈裟尾高塚古墳の靫}, > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > TITLE_transliteration_ja-Latn = {Kesao takazuka kofun no yuki}, > > > TITLE_translation_en = {The quivers of the Kesao tumulus}, > > > JOURNALTITLE = {熊本古墳研究}, > > > JOURNALTITLE_transcription_ja-Hira = {くまもとこふんけんきゅう}, > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > PAGES = {44-54}, > > > VOLUME = 3, > > > YEAR = 2010, > > > MONTH = 5, > > > OWNER = {M. Shinoto}, > > > TIMESTAMP = {2014.03.16}, > > > WHAT = {Sonderdruck} > > > } > > > \end{filecontents} > > > \usepackage{csquotes} > > > \usepackage[style=authoryear,% > > > dynamiclabel=true,% > > > language=auto,% > > > autolang=other,% > > > autofieldlang=other]{biblatex} > > > \addbibresource{\jobname.bib} > > > > > > % Temporary, until we have a real babel/polyglossia<->BCP47 mapping > > capability > > > \maplangtag{ja-latn}{english}% Use babel english for latin script > > Japanese > > > \maplangtag{ja-hira}{japanese}% Use babel japanese for Hiragana script > > Japanese > > > \DeclareMsselect{japanese}{ > > > \alternate{transcription}{ja-hira} > > > \alternate{transliteration}{ja-latn} > > > } > > > \begin{document} > > > \cite{Anan_2010_Pfeilkoecher-Kesao} > > > \printbibliography > > > \newrefcontext[msform=transcription,mslang=ja-hira] > > > \printbibliography > > > \newrefcontext[msform=transliteration,mslang=ja-latn] > > > \printbibliography > > > \end{document} > > > > > > > > > > > > — > > > You are receiving this because you were mentioned. > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub > > < #416?email_source=notifications&email_token=AAR7WYVFDWQKTWXLNF7KCF3RCM7X7A5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELOYUZA#issuecomment-584944228>, > > or unsubscribe > > < https://github.com/notifications/unsubscribe-auth/AAR7WYXYZ63J2J5467L5BSLRCM7X7ANCNFSM4CB64K5Q> > > . > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub, or unsubscribe. > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416?email_source=notifications&email_token=AAR7WYXBTKEDSLSOJDCRDIDRCNNGNA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPDHAA#issuecomment-584987520>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR7WYRWCHEEB3IVILYBLULRCNNGNANCNFSM4CB64K5Q> .

norbusan · 2020-02-12T03:26:59Z

Just to chime in a bit, I agree completely with @pauloney , as I did study these kind of things during my (Tibetan/Sanskrit/Ancient Greek) studies.

Transliteration is used to convey a proper pronounciation, something like the current use of Katakana in Japanese (and the use of Hiragana before the reform).

Transcription is just a representation change that is reversible. That means, that a transcription needs to be reversible, that is, one can return from the transcribed text to the original without errors (in theory).

To pick up the example of @zaw-shinoto , actually both "Fuji" and "Huzi" are transcriptions, but different ones, one is Hepburn (revised or not) and one is Nihon-shiki or Kunrei-shiki. To be honest, both of them are actually not "transcriptions", because there is no way to distinguish Hiragana from Katakana from Kanji in it, and thus one cannot return to the original text. But this is a different topic.

pauloney · 2020-02-12T03:53:45Z

Sorry that I had a copy and paste mistake, it should have been: JOURNALTITLE_transcription_mr-Deva_en-Latn Paulo Ney

…

On Tue, Feb 11, 2020 at 7:27 PM Norbert Preining ***@***.***> wrote: Just to chime in a bit, I agree completely with @pauloney <https://github.com/pauloney> , as I did study these kind of things during my (Tibetan/Sanskrit/Ancient Greek) studies. Transliteration is used to convey a proper pronounciation, something like the current use of Katakana in Japanese (and the use of Hiragana before the reform). Transcription is just a representation change that is reversible. That means, that a transcription needs to be reversible, that is, one can return from the transcribed text to the original without errors (in theory). To pick up the example of @zaw-shinoto <https://github.com/zaw-shinoto> , actually both "Fuji" and "Huzi" are transcriptions, but different ones, one is Hepburn (revised or not) and one is Nihon-shiki or Kunrei-shiki. To be honest, both of them are actually not "transcriptions", because there is no way to distinguish Hiragana from Katakana from Kanji in it, and thus one cannot return to the original text. But this is a different topic. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416?email_source=notifications&email_token=AAR7WYVEN3UGLWJ66C6J7YDRCNUALA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPIKHY#issuecomment-585008415>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR7WYS6XYER4CTJEDNMVCDRCNUALANCNFSM4CB64K5Q> .

mariashinoto · 2020-02-12T04:15:34Z

Thanks, actually, such differentiation does not change the way in which I use biblatex, and using only one term helps me to get rid of such discussions. I had a look on that link and the word "continuum" sprung into my eye, and this is enough for me, I will leave the discussion to those who care. I can understand those that do care. Best, Maria

…

Am 12.02.2020 um 12:08 schrieb Paulo Ney de Souza ***@***.***>: Your understanding is a bit twisted and I would refer you to a simple definition (and differences) at: https://scriptsource.org/cms/scripts/page.php?item_id=entry_detail&uid=gslmka8xq3 as stated there (on the third paragraph) I quote here ...the Devanagari letter is usually transliterated in the Latin script as ‘j’ or ‘ja’, which roughly approximates its pronunciation in most contexts. However, a transcription system (such as the IPA) would transcribe the same character, , as 'dʒə' if the source text was in Hindi, but as 'ɟʝə' or 'zə' if the source text was in Marathi... and this is a very important difference for biblatex -- transliterations are concerned with how a character is pronounced in the new language. Transcriptions depend on the SOURCE language, if it is to be understood and so, its definition in the .bib file should be attached to TWO languages, something like: JOURNALTITLE_transliteration_mr-Deva_en-Latn Paulo Ney On Tue, Feb 11, 2020 at 6:28 PM zaw-shinoto ***@***.***> wrote: > I am not a linguist, but I understand transcription as an effort to write > a word in another writing system such that the user of this different > writing system can understand the pronunciation in the framework of his or > her own language. > > Thhe second part of this concept is getting less important, as I > understand from newer developments. E.g. there is Vygotsky and Wigotski > etc., where Vygotsky is definitely not what a German reader would easily > understand but is confronted with in newer German texts more frequently > nowadays. Originally it was transcribed "Wigotski", which is very easy to > pronounce correctly. Still, I would call both "transcriptions" rather than > "transliterations". > > A transliteration tries to mimic the original writing system regardless of > how something is pronounced in the any language. At least in my > understanding. > > E.g. the Mount Fuji in Japanese used to be transcribed "Fudschi" in German > earlier, now the transcription "Fuji" has become the rule, but a > transliteration would be "Huzi", and this is also an official Japanese > transcription. > > I personally would prefer to use the term "transcription" in all cases, so > there is no struggle with "what am I dealing with at the moment?". > > > But, I am not the expert. > > Maria > > > > > > > Am 12.02.2020 um 10:44 schrieb Paulo Ney de Souza < > ***@***.***>: > > > > Can we state what is meant by "transcription" and what is meant by > > "transliteration" in the context of BibLaTeX -- in the manual -- and > with > > examples! -- specially well accepted examples? I share several of the > > concerns Maria has brought up -- not exactly with the same solution and > the > > root of the problem is probably on the definition of > > transcription/translitertation. > > > > Paulo Ney > > > > On Tue, Feb 11, 2020 at 4:34 PM zaw-shinoto ***@***.***> > > wrote: > > > > > Thanks for the sample. As you know I am not "in" the technology, so I > just > > > reply on the language features and the bibliography output. > > > > > > > > > > > > (A) > > > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > > > This naming of the transliteration is not helpful (maybe this was from > an > > > early stage of development?); we have several transliterations (as in > > > Korean and Chinese), and different journals ask for different > > > transliterations. So we need e.g. "hepburn", "JIS somewhat" etc. like > this. > > > > > > > JOURNALTITLE_transliteration_ja-hepburn = {Kumamoto Kofun Kenkyū}, > > > > > > or > > > > > > > JOURNALTITLE_transliteration_ja-hepburn-corrupted = {Kumamoto Kofun > > > Kenkyu}, > > > > > > And some are rather transcriptions than transliterations, but this is > > > nothing of importance as long as the terminology is usable and the > results > > > are OK. > > > > > > > > > (B) > > > > > > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > > > Why is this a transcription rather than a transliteration or a writing > > > variant? Just a question, as long as it works. > > > > > > Besides, I do not think that mixing upper case letters and lower case > > > letters is a good idea. It may be a good idea with the main fiel like > > > "JOURNALTITLE", I like it personally. But for the variations we should > use > > > only lower case because otherwise the thinking starts "Was this lower > case > > > or upper case?". It does not matter for the transformation if I > remember > > > right, but what if engines start recognizing upper case and lower > case. > > > > > > Same for underscores and minus. OK, one might agree in separation of > > > variants for transliterations and transcriptions. But basically, all > the > > > same, either this or that, is easier. > > > > > > > > > (C) > > > > > > > AUTHOR = {阿南,*,亨}, > > > > > > > > > Please ask Ulrike Fischer why the star had been introduced. The name > > > normally would be written 阿南享 or 阿南享, and the colons are at the right > > > place for Biblatex, since 享 is the personal name. In your bibliography > > > example, the star leads some problems. > > > > > > > > > (D) About the image of the bibliography > > > > > > 1. The star in both versions of the author name should disappear. > > > > > > 2. The question marks around Japanese text should be replaced by > Japanese > > > brackets 『』and「」respectively. -- I am testing Zotero and CSL at the > moment, > > > they have nice translations for brackets and fixed terms like "ed." > etc. > > > (But these also are not exactly what the papers require). > > > > > > 3. Same (like 1., 2.) about the hiragana version. But we do not really > > > need it this way, it is basically for sorting, since there are several > ways > > > to sort Japanese beside the alphabetical order. > > > > > > > > > > > > Thank your very much for your effort! > > > > > > Maria > > > > > > > > > > > > > > > > > > > > > > > > > Am 12.02.2020 um 04:36 schrieb plk ***@***.***>: > > > > > > > > Just to keep this somewhere, here is a somewhat contrived example > with > > > one of your entries, cleaned up to use the correct alternates with the > > > results currently. This functionality is available in the experimental > v4.0 > > > of biblatex and biber. I am ignoring that biblatex has no japanese > strings > > > localisation currently: > > > > > > > > \documentclass[a4paper]{article} > > > > \usepackage{luatexja-fontspec} > > > > \setmainfont{TeXGyrePagella-Regular} > > > > \setmainjfont{IPAexMincho} > > > > \usepackage[english,main=japanese]{babel} > > > > \begin{filecontents}[force]{\jobname.bib} > > > > @Article{Anan_2010_Pfeilkoecher-Kesao, > > > > LANGID = {japanese}, > > > > AUTHOR = {阿南,*,亨}, > > > > AUTHOR_transcription_ja-Hira = {あなん,*,とおる？}, > > > > AUTHOR_transliteration_ja-Latn = {Anan, Tōru}, > > > > TITLE = {袈裟尾高塚古墳の靫}, > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > TITLE_transliteration_ja-Latn = {Kesao takazuka kofun no yuki}, > > > > TITLE_translation_en = {The quivers of the Kesao tumulus}, > > > > JOURNALTITLE = {熊本古墳研究}, > > > > JOURNALTITLE_transcription_ja-Hira = {くまもとこふんけんきゅう}, > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > PAGES = {44-54}, > > > > VOLUME = 3, > > > > YEAR = 2010, > > > > MONTH = 5, > > > > OWNER = {M. Shinoto}, > > > > TIMESTAMP = {2014.03.16}, > > > > WHAT = {Sonderdruck} > > > > } > > > > \end{filecontents} > > > > \usepackage{csquotes} > > > > \usepackage[style=authoryear,% > > > > dynamiclabel=true,% > > > > language=auto,% > > > > autolang=other,% > > > > autofieldlang=other]{biblatex} > > > > \addbibresource{\jobname.bib} > > > > > > > > % Temporary, until we have a real babel/polyglossia<->BCP47 mapping > > > capability > > > > \maplangtag{ja-latn}{english}% Use babel english for latin script > > > Japanese > > > > \maplangtag{ja-hira}{japanese}% Use babel japanese for Hiragana > script > > > Japanese > > > > \DeclareMsselect{japanese}{ > > > > \alternate{transcription}{ja-hira} > > > > \alternate{transliteration}{ja-latn} > > > > } > > > > \begin{document} > > > > \cite{Anan_2010_Pfeilkoecher-Kesao} > > > > \printbibliography > > > > \newrefcontext[msform=transcription,mslang=ja-hira] > > > > \printbibliography > > > > \newrefcontext[msform=transliteration,mslang=ja-latn] > > > > \printbibliography > > > > \end{document} > > > > > > > > > > > > > > > > — > > > > You are receiving this because you were mentioned. > > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > > > > — > > > You are receiving this because you were mentioned. > > > Reply to this email directly, view it on GitHub > > > < > #416?email_source=notifications&email_token=AAR7WYVFDWQKTWXLNF7KCF3RCM7X7A5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELOYUZA#issuecomment-584944228>, > > > > or unsubscribe > > > < > https://github.com/notifications/unsubscribe-auth/AAR7WYXYZ63J2J5467L5BSLRCM7X7ANCNFSM4CB64K5Q> > > > > . > > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#416?email_source=notifications&email_token=AAR7WYXBTKEDSLSOJDCRDIDRCNNGNA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPDHAA#issuecomment-584987520>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AAR7WYRWCHEEB3IVILYBLULRCNNGNANCNFSM4CB64K5Q> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

pauloney · 2020-02-12T04:47:29Z

Yes, there is a "continuum" of differences between "transliteration" and "transcription", but as far as BibLaTeX is concerned - the continuum does not exists -- is either TWO or THREE -- one needs TWO languages (the language of the source and the one the book is written) and the other needs THREE (the language of the source, the language of the trancription and the language the books is being written on). The language the book is on is always a given, so for the biblatex record we need sometimes ONE sometimes TWO for an accurate record. Paulo Ney On Tue, Feb 11, 2020 at 8:15 PM zaw-shinoto <notifications@github.com> wrote:

…

Thanks, actually, such differentiation does not change the way in which I use biblatex, and using only one term helps me to get rid of such discussions. I had a look on that link and the word "continuum" sprung into my eye, and this is enough for me, I will leave the discussion to those who care. I can understand those that do care. Best, Maria > Am 12.02.2020 um 12:08 schrieb Paulo Ney de Souza < ***@***.***>: > > Your understanding is a bit twisted and I would refer you to a simple > definition (and differences) at: > > > https://scriptsource.org/cms/scripts/page.php?item_id=entry_detail&uid=gslmka8xq3 > > as stated there (on the third paragraph) I quote here > > ...the Devanagari letter is usually transliterated in the Latin script as > ‘j’ or ‘ja’, which roughly approximates its pronunciation in most contexts. > However, a transcription system (such as the IPA) would transcribe the same > character, , as 'dʒə' if the source text was in Hindi, but as 'ɟʝə' or 'zə' > if the source text was in Marathi... > > > and this is a very important difference for biblatex -- transliterations > are concerned with how a character is pronounced in the new language. > Transcriptions depend on the SOURCE language, if it is to be understood and > so, its definition in the .bib file should be attached to TWO languages, > something like: > > JOURNALTITLE_transliteration_mr-Deva_en-Latn > > Paulo Ney > > > > On Tue, Feb 11, 2020 at 6:28 PM zaw-shinoto ***@***.***> > wrote: > > > I am not a linguist, but I understand transcription as an effort to write > > a word in another writing system such that the user of this different > > writing system can understand the pronunciation in the framework of his or > > her own language. > > > > Thhe second part of this concept is getting less important, as I > > understand from newer developments. E.g. there is Vygotsky and Wigotski > > etc., where Vygotsky is definitely not what a German reader would easily > > understand but is confronted with in newer German texts more frequently > > nowadays. Originally it was transcribed "Wigotski", which is very easy to > > pronounce correctly. Still, I would call both "transcriptions" rather than > > "transliterations". > > > > A transliteration tries to mimic the original writing system regardless of > > how something is pronounced in the any language. At least in my > > understanding. > > > > E.g. the Mount Fuji in Japanese used to be transcribed "Fudschi" in German > > earlier, now the transcription "Fuji" has become the rule, but a > > transliteration would be "Huzi", and this is also an official Japanese > > transcription. > > > > I personally would prefer to use the term "transcription" in all cases, so > > there is no struggle with "what am I dealing with at the moment?". > > > > > > But, I am not the expert. > > > > Maria > > > > > > > > > > > > > Am 12.02.2020 um 10:44 schrieb Paulo Ney de Souza < > > ***@***.***>: > > > > > > Can we state what is meant by "transcription" and what is meant by > > > "transliteration" in the context of BibLaTeX -- in the manual -- and > > with > > > examples! -- specially well accepted examples? I share several of the > > > concerns Maria has brought up -- not exactly with the same solution and > > the > > > root of the problem is probably on the definition of > > > transcription/translitertation. > > > > > > Paulo Ney > > > > > > On Tue, Feb 11, 2020 at 4:34 PM zaw-shinoto < ***@***.***> > > > wrote: > > > > > > > Thanks for the sample. As you know I am not "in" the technology, so I > > just > > > > reply on the language features and the bibliography output. > > > > > > > > > > > > > > > > (A) > > > > > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > > > > > This naming of the transliteration is not helpful (maybe this was from > > an > > > > early stage of development?); we have several transliterations (as in > > > > Korean and Chinese), and different journals ask for different > > > > transliterations. So we need e.g. "hepburn", "JIS somewhat" etc. like > > this. > > > > > > > > > JOURNALTITLE_transliteration_ja-hepburn = {Kumamoto Kofun Kenkyū}, > > > > > > > > or > > > > > > > > > JOURNALTITLE_transliteration_ja-hepburn-corrupted = {Kumamoto Kofun > > > > Kenkyu}, > > > > > > > > And some are rather transcriptions than transliterations, but this is > > > > nothing of importance as long as the terminology is usable and the > > results > > > > are OK. > > > > > > > > > > > > (B) > > > > > > > > > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > > > > > Why is this a transcription rather than a transliteration or a writing > > > > variant? Just a question, as long as it works. > > > > > > > > Besides, I do not think that mixing upper case letters and lower case > > > > letters is a good idea. It may be a good idea with the main fiel like > > > > "JOURNALTITLE", I like it personally. But for the variations we should > > use > > > > only lower case because otherwise the thinking starts "Was this lower > > case > > > > or upper case?". It does not matter for the transformation if I > > remember > > > > right, but what if engines start recognizing upper case and lower > > case. > > > > > > > > Same for underscores and minus. OK, one might agree in separation of > > > > variants for transliterations and transcriptions. But basically, all > > the > > > > same, either this or that, is easier. > > > > > > > > > > > > (C) > > > > > > > > > AUTHOR = {阿南,*,亨}, > > > > > > > > > > > > Please ask Ulrike Fischer why the star had been introduced. The name > > > > normally would be written 阿南享 or 阿南享, and the colons are at the right > > > > place for Biblatex, since 享 is the personal name. In your bibliography > > > > example, the star leads some problems. > > > > > > > > > > > > (D) About the image of the bibliography > > > > > > > > 1. The star in both versions of the author name should disappear. > > > > > > > > 2. The question marks around Japanese text should be replaced by > > Japanese > > > > brackets 『』and「」respectively. -- I am testing Zotero and CSL at the > > moment, > > > > they have nice translations for brackets and fixed terms like "ed." > > etc. > > > > (But these also are not exactly what the papers require). > > > > > > > > 3. Same (like 1., 2.) about the hiragana version. But we do not really > > > > need it this way, it is basically for sorting, since there are several > > ways > > > > to sort Japanese beside the alphabetical order. > > > > > > > > > > > > > > > > Thank your very much for your effort! > > > > > > > > Maria > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Am 12.02.2020 um 04:36 schrieb plk ***@***.***>: > > > > > > > > > > Just to keep this somewhere, here is a somewhat contrived example > > with > > > > one of your entries, cleaned up to use the correct alternates with the > > > > results currently. This functionality is available in the experimental > > v4.0 > > > > of biblatex and biber. I am ignoring that biblatex has no japanese > > strings > > > > localisation currently: > > > > > > > > > > \documentclass[a4paper]{article} > > > > > \usepackage{luatexja-fontspec} > > > > > \setmainfont{TeXGyrePagella-Regular} > > > > > \setmainjfont{IPAexMincho} > > > > > \usepackage[english,main=japanese]{babel} > > > > > \begin{filecontents}[force]{\jobname.bib} > > > > > @Article{Anan_2010_Pfeilkoecher-Kesao, > > > > > LANGID = {japanese}, > > > > > AUTHOR = {阿南,*,亨}, > > > > > AUTHOR_transcription_ja-Hira = {あなん,*,とおる？}, > > > > > AUTHOR_transliteration_ja-Latn = {Anan, Tōru}, > > > > > TITLE = {袈裟尾高塚古墳の靫}, > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > > TITLE_transliteration_ja-Latn = {Kesao takazuka kofun no yuki}, > > > > > TITLE_translation_en = {The quivers of the Kesao tumulus}, > > > > > JOURNALTITLE = {熊本古墳研究}, > > > > > JOURNALTITLE_transcription_ja-Hira = {くまもとこふんけんきゅう}, > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > > PAGES = {44-54}, > > > > > VOLUME = 3, > > > > > YEAR = 2010, > > > > > MONTH = 5, > > > > > OWNER = {M. Shinoto}, > > > > > TIMESTAMP = {2014.03.16}, > > > > > WHAT = {Sonderdruck} > > > > > } > > > > > \end{filecontents} > > > > > \usepackage{csquotes} > > > > > \usepackage[style=authoryear,% > > > > > dynamiclabel=true,% > > > > > language=auto,% > > > > > autolang=other,% > > > > > autofieldlang=other]{biblatex} > > > > > \addbibresource{\jobname.bib} > > > > > > > > > > % Temporary, until we have a real babel/polyglossia<->BCP47 mapping > > > > capability > > > > > \maplangtag{ja-latn}{english}% Use babel english for latin script > > > > Japanese > > > > > \maplangtag{ja-hira}{japanese}% Use babel japanese for Hiragana > > script > > > > Japanese > > > > > \DeclareMsselect{japanese}{ > > > > > \alternate{transcription}{ja-hira} > > > > > \alternate{transliteration}{ja-latn} > > > > > } > > > > > \begin{document} > > > > > \cite{Anan_2010_Pfeilkoecher-Kesao} > > > > > \printbibliography > > > > > \newrefcontext[msform=transcription,mslang=ja-hira] > > > > > \printbibliography > > > > > \newrefcontext[msform=transliteration,mslang=ja-latn] > > > > > \printbibliography > > > > > \end{document} > > > > > > > > > > > > > > > > > > > > — > > > > > You are receiving this because you were mentioned. > > > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > > > > > > > — > > > > You are receiving this because you were mentioned. > > > > Reply to this email directly, view it on GitHub > > > > < > > #416?email_source=notifications&email_token=AAR7WYVFDWQKTWXLNF7KCF3RCM7X7A5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELOYUZA#issuecomment-584944228>, > > > > > > or unsubscribe > > > > < > > https://github.com/notifications/unsubscribe-auth/AAR7WYXYZ63J2J5467L5BSLRCM7X7ANCNFSM4CB64K5Q> > > > > > > . > > > > > > > — > > > You are receiving this because you were mentioned. > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub > > < #416?email_source=notifications&email_token=AAR7WYXBTKEDSLSOJDCRDIDRCNNGNA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPDHAA#issuecomment-584987520>, > > or unsubscribe > > < https://github.com/notifications/unsubscribe-auth/AAR7WYRWCHEEB3IVILYBLULRCNNGNANCNFSM4CB64K5Q> > > . > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub, or unsubscribe. > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416?email_source=notifications&email_token=AAR7WYUQZH3MC7JMAROXSN3RCNZWRA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPLDOA#issuecomment-585019832>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR7WYXCJFNQF6LQJ5RCK2LRCNZWRANCNFSM4CB64K5Q> .

mariashinoto · 2020-02-12T05:08:21Z

In my long life as a researcher, I do need 4 or mostly more variants of titles etc. in my Bibtex/Biblatex database. I call these fields whatever I like. For publication, I then create a Bibtex file from exactly those fields that I need. For titles in Japanese e.g. I mostly store: Original (Japanese), translation (English and German mostly, I translate once, I do not translate every time I publish a paper), "yomi" for sorting in Japanese publications -- for this I mostly need hepburn or hiragana, so at least 2 variants, and the title in [whatever you call it], transliteration or transcription, anyway, it is mostly in revised hepburn. So, my database has quite a variety of fields, and what I care least about is whether what I need is a transliterations or transcription that I need. I do not deny that this is an important difference in other circumstances. Here is my suggestion: Let us forget about transcription or transliteration, let us call these kinds of fields "writing" or "writ" or "w". Everybody can add custom extensions like "hepburn" or even add something about its character like "transliteration-hepburn". And reg. the continuum between transliteration and transcription, one may even introduce a fuzzy scale like "transliteration-8-hepburn". I would love that ;) So there is - the stem of a field like: TITLE - the main extension explaining the function: writing or translation, shortened if possible - the custom extensions for anybody's need, they are innumerous. E.g. TITLE_trslat_... and TITLE_writ_ ... I hope this is a practical solution. I am so thankful to plk for taking care of this problem, and I think it should be as simple as possible. There is still the enourmous problem of how to transform the custom extensions into bibliography styles. I think of something like namespaces where custom extensions are matched to certain functionalities in the bibliography style. I style could ask for a text document that pairs the name of the field needed with the name of the field in the database. And of course, with a fork for every language, since I do need different re-writings (transcriptions or whatever) for Korean than for Japanese. Best, Maria

…

Am 12.02.2020 um 13:47 schrieb Paulo Ney de Souza ***@***.***>: Yes, there is a "continuum" of differences between "transliteration" and "transcription", but as far as BibLaTeX is concerned - the continuum does not exists -- is either TWO or THREE -- one needs TWO languages (the language of the source and the one the book is written) and the other needs THREE (the language of the source, the language of the trancription and the language the books is being written on). The language the book is on is always a given, so for the biblatex record we need sometimes ONE sometimes TWO for an accurate record. Paulo Ney On Tue, Feb 11, 2020 at 8:15 PM zaw-shinoto ***@***.***> wrote: > Thanks, actually, such differentiation does not change the way in which I > use biblatex, and using only one term helps me to get rid of such > discussions. I had a look on that link and the word "continuum" sprung into > my eye, and this is enough for me, I will leave the discussion to those who > care. I can understand those that do care. > > Best, > Maria > > > > > Am 12.02.2020 um 12:08 schrieb Paulo Ney de Souza < > ***@***.***>: > > > > Your understanding is a bit twisted and I would refer you to a simple > > definition (and differences) at: > > > > > > > https://scriptsource.org/cms/scripts/page.php?item_id=entry_detail&uid=gslmka8xq3 > > > > as stated there (on the third paragraph) I quote here > > > > ...the Devanagari letter is usually transliterated in the Latin script > as > > ‘j’ or ‘ja’, which roughly approximates its pronunciation in most > contexts. > > However, a transcription system (such as the IPA) would transcribe the > same > > character, , as 'dʒə' if the source text was in Hindi, but as 'ɟʝə' or > 'zə' > > if the source text was in Marathi... > > > > > > and this is a very important difference for biblatex -- transliterations > > are concerned with how a character is pronounced in the new language. > > Transcriptions depend on the SOURCE language, if it is to be understood > and > > so, its definition in the .bib file should be attached to TWO languages, > > something like: > > > > JOURNALTITLE_transliteration_mr-Deva_en-Latn > > > > Paulo Ney > > > > > > > > On Tue, Feb 11, 2020 at 6:28 PM zaw-shinoto ***@***.***> > > wrote: > > > > > I am not a linguist, but I understand transcription as an effort to > write > > > a word in another writing system such that the user of this different > > > writing system can understand the pronunciation in the framework of > his or > > > her own language. > > > > > > Thhe second part of this concept is getting less important, as I > > > understand from newer developments. E.g. there is Vygotsky and > Wigotski > > > etc., where Vygotsky is definitely not what a German reader would > easily > > > understand but is confronted with in newer German texts more > frequently > > > nowadays. Originally it was transcribed "Wigotski", which is very easy > to > > > pronounce correctly. Still, I would call both "transcriptions" rather > than > > > "transliterations". > > > > > > A transliteration tries to mimic the original writing system > regardless of > > > how something is pronounced in the any language. At least in my > > > understanding. > > > > > > E.g. the Mount Fuji in Japanese used to be transcribed "Fudschi" in > German > > > earlier, now the transcription "Fuji" has become the rule, but a > > > transliteration would be "Huzi", and this is also an official Japanese > > > transcription. > > > > > > I personally would prefer to use the term "transcription" in all > cases, so > > > there is no struggle with "what am I dealing with at the moment?". > > > > > > > > > But, I am not the expert. > > > > > > Maria > > > > > > > > > > > > > > > > > > > Am 12.02.2020 um 10:44 schrieb Paulo Ney de Souza < > > > ***@***.***>: > > > > > > > > Can we state what is meant by "transcription" and what is meant by > > > > "transliteration" in the context of BibLaTeX -- in the manual -- and > > > with > > > > examples! -- specially well accepted examples? I share several of > the > > > > concerns Maria has brought up -- not exactly with the same solution > and > > > the > > > > root of the problem is probably on the definition of > > > > transcription/translitertation. > > > > > > > > Paulo Ney > > > > > > > > On Tue, Feb 11, 2020 at 4:34 PM zaw-shinoto < > ***@***.***> > > > > wrote: > > > > > > > > > Thanks for the sample. As you know I am not "in" the technology, > so I > > > just > > > > > reply on the language features and the bibliography output. > > > > > > > > > > > > > > > > > > > > (A) > > > > > > > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > > > > > > > This naming of the transliteration is not helpful (maybe this was > from > > > an > > > > > early stage of development?); we have several transliterations (as > in > > > > > Korean and Chinese), and different journals ask for different > > > > > transliterations. So we need e.g. "hepburn", "JIS somewhat" etc. > like > > > this. > > > > > > > > > > > JOURNALTITLE_transliteration_ja-hepburn = {Kumamoto Kofun > Kenkyū}, > > > > > > > > > > or > > > > > > > > > > > JOURNALTITLE_transliteration_ja-hepburn-corrupted = {Kumamoto > Kofun > > > > > Kenkyu}, > > > > > > > > > > And some are rather transcriptions than transliterations, but this > is > > > > > nothing of importance as long as the terminology is usable and the > > > results > > > > > are OK. > > > > > > > > > > > > > > > (B) > > > > > > > > > > > > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > > > > > > > Why is this a transcription rather than a transliteration or a > writing > > > > > variant? Just a question, as long as it works. > > > > > > > > > > Besides, I do not think that mixing upper case letters and lower > case > > > > > letters is a good idea. It may be a good idea with the main fiel > like > > > > > "JOURNALTITLE", I like it personally. But for the variations we > should > > > use > > > > > only lower case because otherwise the thinking starts "Was this > lower > > > case > > > > > or upper case?". It does not matter for the transformation if I > > > remember > > > > > right, but what if engines start recognizing upper case and lower > > > case. > > > > > > > > > > Same for underscores and minus. OK, one might agree in separation > of > > > > > variants for transliterations and transcriptions. But basically, > all > > > the > > > > > same, either this or that, is easier. > > > > > > > > > > > > > > > (C) > > > > > > > > > > > AUTHOR = {阿南,*,亨}, > > > > > > > > > > > > > > > Please ask Ulrike Fischer why the star had been introduced. The > name > > > > > normally would be written 阿南享 or 阿南享, and the colons are at the > right > > > > > place for Biblatex, since 享 is the personal name. In your > bibliography > > > > > example, the star leads some problems. > > > > > > > > > > > > > > > (D) About the image of the bibliography > > > > > > > > > > 1. The star in both versions of the author name should disappear. > > > > > > > > > > 2. The question marks around Japanese text should be replaced by > > > Japanese > > > > > brackets 『』and「」respectively. -- I am testing Zotero and CSL at > the > > > moment, > > > > > they have nice translations for brackets and fixed terms like > "ed." > > > etc. > > > > > (But these also are not exactly what the papers require). > > > > > > > > > > 3. Same (like 1., 2.) about the hiragana version. But we do not > really > > > > > need it this way, it is basically for sorting, since there are > several > > > ways > > > > > to sort Japanese beside the alphabetical order. > > > > > > > > > > > > > > > > > > > > Thank your very much for your effort! > > > > > > > > > > Maria > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > Am 12.02.2020 um 04:36 schrieb plk ***@***.***>: > > > > > > > > > > > > Just to keep this somewhere, here is a somewhat contrived > example > > > with > > > > > one of your entries, cleaned up to use the correct alternates with > the > > > > > results currently. This functionality is available in the > experimental > > > v4.0 > > > > > of biblatex and biber. I am ignoring that biblatex has no japanese > > > strings > > > > > localisation currently: > > > > > > > > > > > > \documentclass[a4paper]{article} > > > > > > \usepackage{luatexja-fontspec} > > > > > > \setmainfont{TeXGyrePagella-Regular} > > > > > > \setmainjfont{IPAexMincho} > > > > > > \usepackage[english,main=japanese]{babel} > > > > > > \begin{filecontents}[force]{\jobname.bib} > > > > > > @Article{Anan_2010_Pfeilkoecher-Kesao, > > > > > > LANGID = {japanese}, > > > > > > AUTHOR = {阿南,*,亨}, > > > > > > AUTHOR_transcription_ja-Hira = {あなん,*,とおる？}, > > > > > > AUTHOR_transliteration_ja-Latn = {Anan, Tōru}, > > > > > > TITLE = {袈裟尾高塚古墳の靫}, > > > > > > TITLE_transcription_ja-Hira = {けさおたかつかこふんのゆき}, > > > > > > TITLE_transliteration_ja-Latn = {Kesao takazuka kofun no yuki}, > > > > > > TITLE_translation_en = {The quivers of the Kesao tumulus}, > > > > > > JOURNALTITLE = {熊本古墳研究}, > > > > > > JOURNALTITLE_transcription_ja-Hira = {くまもとこふんけんきゅう}, > > > > > > JOURNALTITLE_transliteration_ja-Latn = {Kumamoto Kofun Kenkyū}, > > > > > > PAGES = {44-54}, > > > > > > VOLUME = 3, > > > > > > YEAR = 2010, > > > > > > MONTH = 5, > > > > > > OWNER = {M. Shinoto}, > > > > > > TIMESTAMP = {2014.03.16}, > > > > > > WHAT = {Sonderdruck} > > > > > > } > > > > > > \end{filecontents} > > > > > > \usepackage{csquotes} > > > > > > \usepackage[style=authoryear,% > > > > > > dynamiclabel=true,% > > > > > > language=auto,% > > > > > > autolang=other,% > > > > > > autofieldlang=other]{biblatex} > > > > > > \addbibresource{\jobname.bib} > > > > > > > > > > > > % Temporary, until we have a real babel/polyglossia<->BCP47 > mapping > > > > > capability > > > > > > \maplangtag{ja-latn}{english}% Use babel english for latin > script > > > > > Japanese > > > > > > \maplangtag{ja-hira}{japanese}% Use babel japanese for Hiragana > > > script > > > > > Japanese > > > > > > \DeclareMsselect{japanese}{ > > > > > > \alternate{transcription}{ja-hira} > > > > > > \alternate{transliteration}{ja-latn} > > > > > > } > > > > > > \begin{document} > > > > > > \cite{Anan_2010_Pfeilkoecher-Kesao} > > > > > > \printbibliography > > > > > > \newrefcontext[msform=transcription,mslang=ja-hira] > > > > > > \printbibliography > > > > > > \newrefcontext[msform=transliteration,mslang=ja-latn] > > > > > > \printbibliography > > > > > > \end{document} > > > > > > > > > > > > > > > > > > > > > > > > — > > > > > > You are receiving this because you were mentioned. > > > > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > > > > > > > > > > — > > > > > You are receiving this because you were mentioned. > > > > > Reply to this email directly, view it on GitHub > > > > > < > > > > #416?email_source=notifications&email_token=AAR7WYVFDWQKTWXLNF7KCF3RCM7X7A5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELOYUZA#issuecomment-584944228>, > > > > > > > > > or unsubscribe > > > > > < > > > > https://github.com/notifications/unsubscribe-auth/AAR7WYXYZ63J2J5467L5BSLRCM7X7ANCNFSM4CB64K5Q> > > > > > > > > > . > > > > > > > > > — > > > > You are receiving this because you were mentioned. > > > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > > > > > > > — > > > You are receiving this because you were mentioned. > > > Reply to this email directly, view it on GitHub > > > < > #416?email_source=notifications&email_token=AAR7WYXBTKEDSLSOJDCRDIDRCNNGNA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPDHAA#issuecomment-584987520>, > > > > or unsubscribe > > > < > https://github.com/notifications/unsubscribe-auth/AAR7WYRWCHEEB3IVILYBLULRCNNGNANCNFSM4CB64K5Q> > > > > . > > > > > — > > You are receiving this because you were mentioned. > > Reply to this email directly, view it on GitHub, or unsubscribe. > > > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > <#416?email_source=notifications&email_token=AAR7WYUQZH3MC7JMAROXSN3RCNZWRA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELPLDOA#issuecomment-585019832>, > or unsubscribe > <https://github.com/notifications/unsubscribe-auth/AAR7WYXCJFNQF6LQJ5RCK2LRCNZWRANCNFSM4CB64K5Q> > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

plk · 2020-02-12T07:25:29Z

To be clear, the multiscript alternate forms (translation, transliteration erc.) are not interpreted by biblatex at all. You are free to use them as you will, they are just string identifiers. The multiscript language/script identifiers must be valid BCP47 tag format (this does not enforce case). The separator between the field, form and lang is by default underscore but this is customisable. The dash separator within BCP47 tags is not customisable as it is part of the IETF standard. The example was just to show the mechanisms, not to advocate for data classifications in this example.

retorquere · 2020-02-12T07:33:16Z

Why not standardize separator between field/form/lang? Unless it is configurable in the bib file itself, having it variable means the bib file is harder to parse/produce by other tools (such as Zotero BBT)

mariashinoto · 2020-02-12T11:38:53Z

Thank you for these clarifications and for your efforts anyway. So underscores, no dashes, no problem. If you need more comments on the results, I will be happy to be of assistance if I can. I am not really sure what you are trying to achieve: is it (a) a standard for multilingual reference databases in Biblatex format or just (b) the conversion to several reference styles or (c) both? As I summarized earlier, it is possible to transform any custom field into Biblatex or even Bibtex format that can then be utilized with the common styles that do exist for different journals etc.. This was my two stage management during the last years, and it worked really well. This in mind, why not concentrate on basic fiels in biblatex (e.g. author, author_translation ((for institutions etc.)), author_writing) for every entry; and everybody can add variations with extensions as they like (custom variations). This means, that any use of data from a Biblatex database does need a preliminary step in which one has to decide which of the custom variation will populate the generic field. (A) E.g. should I use title_translation_de for an article in German and thus populate title_translation with the content of title_translation_de; or if it is an English article, I would use title_translation_en to fill into this field. (B) Another problem is that of transcriptions or transliterations which I would call "writing". E.g. there is revised Roman for Korean and Hepburn for Japanese beside many others, so in the preliminary stage there is a decision: If the language is Korean the field "author_writing_kr_revRoman" (custom variation) is used for "author_writing", while "author_writing_ja_hepburn" is used if the language of the entry in Japanese. These cases and fields have to be mapped against each other, with YAML or what else. I used to do it with simple GREP, but this is fragile, so I am looking for more solid solution. It seems to me that this preliminary stage may be another challenge. Best, Maria

…

Am 12.02.2020 um 16:25 schrieb plk ***@***.***>: To be clear, the multiscript alternate forms (translation, transliteration erc.) are not interpreted by biblatex at all. You are free to use them as you will, they are just string identifiers. The multiscript language/script identifiers must be valid BCP47 tag format (this does not enforce case). The separator between the field, form and lang is by default underscore but this is customisable. The dash separator within BCP47 tags is not customisable as it is part of the IETF standard. The example was just to show the mechanisms, not to advocate for data classifications in this example. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

plk · 2020-02-12T11:40:21Z

Well it is simply that there is a biber option to change the default underscore when parsing data, that’s all. It can also be changed on tool mode output.

plk · 2020-02-12T11:46:42Z

The purpose is to provide functionality for defining and using multiple “alternates” of the same field within one entry. The exact forms/langs are in a sense arbitrary as you would use the form/langs as string keys to determine what to print. However, the lang is used to auto-switch babel/polyglossia language even on a per list item basis which means that real multilingual/script support is possible.

mariashinoto · 2020-02-12T11:48:54Z

This sounds really great and like a much more consistent solution!

…

Am 12.02.2020 um 20:46 schrieb plk ***@***.***>: The purpose is to provide functionality for defining and using multiple “alternates” of the same field within one entry. The exact forms/langs are in a sense arbitrary as you would use the form/langs as string keys to determine what to print. However, the lang is used to auto-switch babel/polyglossia language even on a per list item basis which means that real multilingual/script support is possible. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or unsubscribe.

pauloney · 2020-02-12T13:43:10Z

Can we write most of this stuff to the manual? PN

…

On Wed, Feb 12, 2020, 3:48 AM zaw-shinoto ***@***.***> wrote: This sounds really great and like a much more consistent solution! > Am 12.02.2020 um 20:46 schrieb plk ***@***.***>: > > The purpose is to provide functionality for defining and using multiple “alternates” of the same field within one entry. The exact forms/langs are in a sense arbitrary as you would use the form/langs as string keys to determine what to print. However, the lang is used to auto-switch babel/polyglossia language even on a per list item basis which means that real multilingual/script support is possible. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub, or unsubscribe. > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416?email_source=notifications&email_token=AAR7WYWBUWPKW2LIES7LYLTRCPO2PA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOELQPQ2I#issuecomment-585169001>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAR7WYQHGFFX46ELO222NWDRCPO2PANCNFSM4CB64K5Q> .

plk · 2020-02-12T13:57:06Z

@pauloney - it's already in the manual for 4.0, with some examples. All of the new experimental functionality is documented.

yannis1962 · 2020-11-25T07:57:37Z

Hi, as I'm now processing a big volume with thousands of bibliographical references I was wondering if there has been a new version of biblatex solving these problems, and whether it is available in the latest TeXlive. (If I have to choose among different syntaxes I would like to take the one that will stay in biblatex for the years to come.) Thanks in advance, Yannis

Début du message réexpédié : De: Yannis Haralambous ***@***.***> Objet: Rép. : [plk/biblatex] Multiscript support in biblatex/biber (#416) Date: 20 octobre 2019 à 18:05:04 UTC+2 À: plk/biblatex ***@***.***> Cc: plk/biblatex ***@***.***>, Mention ***@***.***> IMHO multiscript alternates are needed only for AUTHOR and EDITOR, because these are the fields that serve for the citations. What I need is a reference with the original script following the Latin one between brackets, but the citation should have only the Latin transcription. This is my problem. All other fields can incorporate directly both scripts... Here is an example of a reference I use in the book I'm typesetting: @Article{bib35, AUTHOR_ORIGINAL = {野村雅昭 and 伊藤菊子}, AUTHOR+AN = {1=ja-Japn;2=ja-Japn}, AUTHOR = {Nomura, Masaaki and Itō, Kikuko}, TITLE = {{{\japanese 漢字の表音度} [Phoneticity in kanji]}}, JOURNAL = {{\japanese 計量国語学} [Mathematical Linguistics]}, VOLUME = {11}, PAGES = {306--311}, YEAR = {1978}, } > Le 20 oct. 2019 à 17:31, plk ***@***.*** ***@***.***>> a écrit : > > @duncdrum <https://github.com/duncdrum> - how useful is it in your opinion to be able to choose the multiscript alternates on a per-field basis for sorting? That is, would be enough to say "use transliterations, where available, for all fields in sorting" or can you see a use case where it would be required to select the particular multiscript alternate per-field as in "use transliteration for title but translation for author for sorting" etc.? > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub <#416?email_source=notifications&email_token=AAFXC7K7RHCQ3O2AZQ6A5ZTQPR2VTA5CNFSM4CB64K52YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEBYMWMQ#issuecomment-544262962>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFXC7I5UUR6D6JI2PI3IYDQPR2VTANCNFSM4CB64K5Q>. > <http://www.imt-atlantique.fr/> Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC <http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT <http://www.imt.fr/> — Vous cherchez trop à comprendre, c'est un grave défaut. — J'ai déjà entendu cette phrase. — Vous l'avez écrite. (Jean Cocteau)

<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC <http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT <http://www.imt.fr/> Le tact dans l'audace, c'est de savoir jusqu'où on peut aller trop loin. (Jean Cocteau)

plk · 2020-11-25T10:51:54Z

Version 4.0 of biblatex and 4.0 of biber, in the "experimental" folders on Sourceforge implement the multiscript branch currently. The documentation in that version has all of the details and there is a sample "98-multiscript-biber.pdf" and the source "98-multiscript.tex" which has examples.

yannis1962 · 2020-11-25T15:12:39Z

Thanks for the information. I downloaded biblatex 4, but what about biber 4? If I ask for tlmgr update biber, will I get version 4? Otherwise how can I install it? (Currently I have version 2.13)

Le 25 nov. 2020 à 11:52, plk ***@***.***> a écrit : Version 4.0 of biblatex and 4.0 of biber, in the "experimental" folders on Sourceforge implement the multiscript branch currently. The documentation in that version has all of the details and there is a sample "98-multiscript-biber.pdf" and the source "98-multiscript.tex" which has examples. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFXC7IWLLQFZ3KKSZPRDFTSRTOVVANCNFSM4CB64K5Q>.

<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC <http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT <http://www.imt.fr/> Les artifices typographiques poussés très loin avec une grande audace ont l'avantage de faire naître un lyrisme visuel qui était presque inconnu avant notre époque. Ces artifices peuvent aller très loin encore et consommer la synthèse des arts, de la musique, de la peinture et de la littérature. (Guillaume Apollinaire)

plk · 2020-11-25T15:33:08Z

Unfortunately, these versions are not in TLMGR and you have to get them from Sourceforge and install manually. You need biblatex 4 and biber 4.

yannis1962 · 2020-12-01T20:10:11Z

For some reason I'm not getting anymore what I had last year. Maybe at that time you made a special cfg file for me? Here is the code I used to write: @INPROCEEDINGS{bib21, AUTHOR = {Kōno, Rokurō and Nagata, Hidemasa and Sasahara, Hiroyuki}, AUTHOR_ORIGINAL = {河野六郎 and 永田英正 and 笹原宏之}, AUTHOR+AN = {1=ja-Japn;2=ja-Japn;3=ja-Japn}, EDITOR = {Kōno, Rokurō and Chino, Eiichi and Nishida, Tatsuo}, EDITOR_ORIGINAL = {河野六郎 and 千野栄一 and 西田龍雄}, EDITOR+AN = {1=ja-Japn;2=ja-Japn;3=ja-Japn}, TITLE = {{{\japanese 漢字} [Kanji]}}, BOOKTITLE = {{{\japanese 世界文字辞典} [Encyclopedia of the World's Scripts]}}, PUBLISHER = {{\japanese 三省堂} [Sanseidō]}, ADDRESS = {Tokyo}, YEAR = {2001}, PAGES = {256--281}, } and the result I used to get was: Kōno, Rokurō [河野六郎], Hidemasa Nagata [永田英正], and Hiroyuki Sasahara [笹原宏之] (2001). “漢字 [Kanji]”. In: 世界文字辞典 [Encyclopedia of the World’s Scripts]. Ed. by Rokurō Kōno [河野六郎], Eiichi Chino [千野栄一], and Tatsuo Nishida [西田龍雄]. Tokyo: 三省堂 [Sanseidō], pp. 256–281. (See for example <http://www.fluxus-editions.fr/gla1-hond.php>) The idea is that author or editor names appear first in Latin script and then in Non-Latin script between brackets, and that the Latin script part is used for sorting. Now from what I see in 98-multiscript.tex there is no "_ORIGINAL" but rather "_TRANSLATION" and "_TRANSLITERATION" (in my case it is the latter) but when I use it I get only the Non-Latin version and not the Latin followed by the Non-Latin between brackets. Can you help me in getting the result I had last year? Thanks in advance, Yannis

Le 25 nov. 2020 à 16:33, plk ***@***.***> a écrit : Unfortunately, these versions are not in TLMGR and you have to get them from Sourceforge and install manually. You need biblatex 4 and biber 4. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#416 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAFXC7OJDXU6OS4HUSDQ7WDSRUPUJANCNFSM4CB64K5Q>.

<http://www.imt-atlantique.fr/> Yannis HARALAMBOUS Professor Computer Science Department UMR CNRS 6285 Lab-STICC <http://perso.telecom-bretagne.eu/yannisharalambous/> <https://twitter.com/y_haralambous> <https://www.linkedin.com/in/yannis-haralambous-5529073?trk=hp-identity-name>Technopôle Brest-Iroise CS 83818 29238 Brest Cedex 3, France Une école de l'IMT <http://www.imt.fr/> De l'autre côté de la rade le collège se dressait toujours, trapu, massif, disgracieux, telle une description trop longue dans un roman du dix-neuvième siècle. (Jean-Pierre Ohl)

plk · 2020-12-03T21:39:54Z

Here is a complete example which works with the 4.0 versions of biblatex/biber and which shows the general approach and gives the results you want. Some notes:

This is a slightly strange format as is has Japanese as the primary language for non-names and English/transliterations for names. So, I chose english as the default language and gave japanese as the alternate form.
To be more correct, you would probably want to define a custom name part for japanese in the data model. Here, the japanese name translations are treated as just "family" parts which is obviously semantically incorrect.
More check could be put in to test for the presence of japanese translated variants for fields to avoid spurious output.

\documentclass[a4paper]{article}
\usepackage{luatexja-fontspec}
\setmainfont{TeXGyrePagella-Regular}
\setmainjfont{IPAexMincho}
\usepackage[japanese,main=english]{babel}
\begin{filecontents}[force]{\jobname.bib}
@INPROCEEDINGS{bib21,
  AUTHOR                      = {Kōno, Rokurō and Nagata, Hidemasa and Sasahara, Hiroyuki},
  AUTHOR_translation_ja-jp    = {河野六郎 and 永田英正 and 笹原宏之},
  EDITOR                      = {Kōno, Rokurō and Chino, Eiichi and Nishida, Tatsuo},
  EDITOR_translation_ja-jp    = {河野六郎 and 千野栄一 and 西田龍雄},
  TITLE                       = {Kanji},
  TITLE_translation_ja-jp     = {漢字},
  BOOKTITLE                   = {Encyclopedia of the World's Scripts},
  BOOKTITLE_translation_ja-jp = {世界文字辞典},
  PUBLISHER                   = {Sanseidō},
  PUBLISHER_translation_ja-jp = {三省堂},
  ADDRESS                     = {Tokyo},
  PAGES                       = {256--281},
  YEAR                        = {2001}
}
\end{filecontents}
\usepackage{csquotes}
\usepackage[style=authoryear,%
            dynamiclabel=true,%
            language=auto,%
            autolang=other,%
            autofieldlang=other]{biblatex}
\addbibresource{\jobname.bib}

\def\jpnamevariant#1{%
  \expandafter\ifdefvoid\csname #1translationja-jp\endcsname
    {}
    {\space\mkbibbrackets{\csuse{#1translationja-jp}}}}

\def\jplistvariant#1{%
  \expandafter\ifdefvoid\csname #1translationja-jp\endcsname
    {}
    {\csuse{#1translationja-jp}}}
  
\def\jpfieldvariant#1{%
  \iffieldundef[translation][ja-jp]{#1}
    {}
    {\thefield[translation][ja-jp]{#1}}}

\DeclareNameFormat{given-family}{%
  \ifgiveninits
    {\usebibmacro{name:given-family}
      {\namepartfamily}
      {\namepartgiveni}
      {\namepartprefix}
      {\namepartsuffix}}
    {\usebibmacro{name:given-family}
      {\namepartfamily}
      {\namepartgiven}
      {\namepartprefix}
      {\namepartsuffix}}%
   \jpnamevariant{namepartfamily}%  
   \usebibmacro{name:andothers}}
  
\DeclareNameFormat{family-given/given-family}{%
  \ifnumequal{\value{listcount}}{1}
    {\ifgiveninits
       {\usebibmacro{name:family-given}
         {\namepartfamily}
         {\namepartgiveni}
         {\namepartprefix}
         {\namepartsuffix}}
       {\usebibmacro{name:family-given}
         {\namepartfamily}
         {\namepartgiven}
         {\namepartprefix}
         {\namepartsuffix}}%
     \jpnamevariant{namepartfamily}%
     \ifboolexpe{%
       test {\ifdefvoid\namepartgiven}
       and
       test {\ifdefvoid\namepartprefix}}
       {}
       {\usebibmacro{name:revsdelim}}}
    {\ifgiveninits
       {\usebibmacro{name:given-family}
         {\namepartfamily}
         {\namepartgiveni}
         {\namepartprefix}
         {\namepartsuffix}}
       {\usebibmacro{name:given-family}
         {\namepartfamily}
         {\namepartgiven}
         {\namepartprefix}
         {\namepartsuffix}}%
     \jpnamevariant{namepartfamily}}%
  \usebibmacro{name:andothers}}

\DeclareFieldFormat
  [article,inbook,incollection,inproceedings,patent,thesis,unpublished]
  {title}{\mkbibquote{\jpfieldvariant{title}\addspace\mkbibbrackets{#1}\isdot}}

\DeclareFieldFormat{booktitle}{\jpfieldvariant{booktitle}\addspace\mkbibbrackets{#1}}

\DeclareListFormat{publisher}{%
  \usebibmacro{list:delim}{#1}%
  \usebibmacro{list:langswitchon}%
  \jplistvariant{listitem}\addspace\mkbibbrackets{#1}\isdot
  \usebibmacro{list:langswitchoff}%
  \usebibmacro{list:andothers}}

\begin{document}
\cite{bib21}
\printbibliography
\end{document}

dbitouze · 2021-01-02T13:00:37Z

Here is a complete example which works with the 4.0 versions of biblatex/biber

Sounds very nice, thanks! But where are available "the 4.0 versions of biblatex/biber"?

plk · 2021-01-02T13:53:43Z

In the "Experimental" folders on SF:

https://sourceforge.net/projects/biblatex-biber/files/biblatex-biber/experimental/
https://sourceforge.net/projects/biblatex/files/experimental/

plk · 2022-07-27T19:19:59Z

We are looking into "releasing" the muiltiscript version 4.0 of biblatex/biber as separate packages on CTAN so that they can have wider evaluation. They would be called biblatex-ms and biber-ms and users should be able to install these in parallel. They should be (and are in all current regression tests) backwards-compatible when not using any of the multiscript features and extended .bib syntax. However, they will still be considered "experimental" as they have not had extensive testing with many styles etc. They are slower than the current "standard" version as they have to do a lot more in order to account for multiple scripts in any field.

plk · 2023-03-08T21:34:36Z

biblatex-ms and biber-ms are now in TL for testing and feedback. They are fully documented and should be backwards compatible and up to date with the current 3.19/2.19 versions of biblatex/biber.

plk self-assigned this Apr 26, 2016

plk added the enhancement label Apr 26, 2016

duncdrum mentioned this issue May 4, 2016

Juris-M missing multi-lingual fields retorquere/zotero-better-bibtex#482

Closed

LSinev mentioned this issue Feb 24, 2020

TODO: Подборка сложных случаев bibtex записей AndreyAkinshin/Russian-Phd-LaTeX-Dissertation-Template#168

Open

tobiasdiez mentioned this issue Dec 20, 2020

Chinese author of essay reference section displays extra space JabRef/jabref#7199

Closed

1 task

pcdi mentioned this issue May 5, 2022

Handling of explicit nameparts JabRef/jabref#8760

Closed

plk added this to the v4.0 milestone Mar 8, 2023

bdarcus mentioned this issue Jun 12, 2023

Multilingual support bdarcus/csln#66

Open

Multiscript support in biblatex/biber #416

Multiscript support in biblatex/biber #416

Comments

plk commented Apr 26, 2016

duncdrum commented May 3, 2016

plk commented May 3, 2016

duncdrum commented May 3, 2016 • edited

plk commented May 3, 2016

maieul commented May 6, 2016

u-fischer commented May 19, 2016

plk commented May 19, 2016 • edited

u-fischer commented May 19, 2016

plk commented May 19, 2016

u-fischer commented May 19, 2016

plk commented May 19, 2016

plk commented May 21, 2016 • edited

u-fischer commented May 21, 2016

plk commented May 23, 2016 • edited

Shinoto-github commented May 26, 2016

plk commented May 26, 2016

Shinoto-github commented May 26, 2016

moewew commented Jun 5, 2016

plk commented Jun 5, 2016 • edited

moewew commented Jun 6, 2016

plk commented Jun 6, 2016

simifilm commented Jun 6, 2016

maieul commented Jun 6, 2016

plk commented Jun 6, 2016

maieul commented Jun 6, 2016

u-fischer commented Jun 6, 2016

plk commented Jun 6, 2016

mariashinoto commented Feb 12, 2020 via email

pauloney commented Feb 12, 2020 via email

norbusan commented Feb 12, 2020

pauloney commented Feb 12, 2020 via email

mariashinoto commented Feb 12, 2020 via email

pauloney commented Feb 12, 2020 via email

mariashinoto commented Feb 12, 2020 via email

plk commented Feb 12, 2020

retorquere commented Feb 12, 2020

mariashinoto commented Feb 12, 2020 via email

plk commented Feb 12, 2020

plk commented Feb 12, 2020

mariashinoto commented Feb 12, 2020 via email

pauloney commented Feb 12, 2020 via email

plk commented Feb 12, 2020

yannis1962 commented Nov 25, 2020 via email

plk commented Nov 25, 2020

yannis1962 commented Nov 25, 2020 via email

plk commented Nov 25, 2020

yannis1962 commented Dec 1, 2020 via email

plk commented Dec 3, 2020

dbitouze commented Jan 2, 2021

plk commented Jan 2, 2021

plk commented Jul 27, 2022

plk commented Mar 8, 2023

duncdrum commented May 3, 2016 •

edited

plk commented May 19, 2016 •

edited

plk commented May 21, 2016 •

edited

plk commented May 23, 2016 •

edited

plk commented Jun 5, 2016 •

edited