Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output-safechars in tool mode #68

Closed
thorade opened this issue May 4, 2015 · 8 comments
Closed

output-safechars in tool mode #68

thorade opened this issue May 4, 2015 · 8 comments

Comments

@thorade
Copy link

thorade commented May 4, 2015

According to http://tex.stackexchange.com/q/57743/8569 accented characters and umlauts in the bib file should be written as {\"a} or {\^e},
but when using biber in tool mode with output_safechars I get
{\"{a}} and {\^{e}}.
Is that the intended behavior?

@plk
Copy link
Owner

plk commented May 4, 2015

Those comments are correct for bibtex but not for biber. biber can handle the \"{a} case and sorting issues with such macro encodings are irrelevant as everything is converted into UTF-8 internally for sorting. So, yes, that's the intended behaviour at the moment.

@thorade
Copy link
Author

thorade commented May 4, 2015

Thanks for the answer, even though I still did not understand why the umlauts are encoded this way with two braces.
As you say this is the inteded behavior, I'll just close this issue and live with it.

PS: I personally use UTF8 and LuaLaTeX with biber and Biblatex,
but in one project we are writing a report using pandoc and markdown (and an Ascii bib file).

@thorade thorade closed this as completed May 4, 2015
@thorade
Copy link
Author

thorade commented May 4, 2015

Probably also intended!?
with the command biber --tool output_encoding=UTF8 references.bib
M{\"u}ller will be converted to M{ü}ller

@plk
Copy link
Owner

plk commented May 4, 2015

The reason for all this is because parsing these macros is very tricky and some standards have to be imposed. However, this last thing you mention is fixed in biber 2.1

@thorade
Copy link
Author

thorade commented May 4, 2015

One more question:
Can I completely disable processing of the data?
In other words, can I use the --output_align and --output-indent=2 and --output_fieldcase=lower but leave the content and encoding as it is, no matter how broken it is?

@plk
Copy link
Owner

plk commented May 4, 2015

Hmm, you mean not touch the TeX macros? Not really, biber needs to parse and convert to UTF-8 in order to sort etc. and outputting exactly what was in the .bib character for character in terms of fields would be very difficult.

@thorade
Copy link
Author

thorade commented May 4, 2015

OK, then just leave everything as it is.
Thanks for answering all my questions, and of course thanks for biber!

@thorade
Copy link
Author

thorade commented Dec 21, 2015

Note to myself: Biber 2.1 fixed the double curly braces not only for UTF-8 output but also for the --output_safechars case

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants