Languages support #281

earth2378 · 2018-05-12T10:17:50Z

I tried to use with Thai language but it was found that:

terminate called after throwing an instance of 'xml::serialization'
what(): xl/sharedStrings.xml: error: invalid UTF-8

tfussell · 2018-06-04T15:58:20Z

thanks for reporting his problem. UTF compatibility is a priority for me. Could you show me the code that caused this problem? Or tell me more about what you were trying to do.

ZM-J · 2018-07-12T04:17:48Z

@tfussell Chinese also got the same fault.
Just substitute

ws.cell("B2").value("Hello world");

in sample code with

ws.cell("B2").value("你好，世界");

and you will get the fault.
I use vs2017 and use xlnt.lib as the linker input. I got the right xlsx file after running the sample code, but got fault immediately when I changed the string to Chinese.

ZM-J · 2018-07-12T05:33:22Z

Finally I solved the problem by noticing issue #215 and using u8"你好，世界". I think @earth2378 might encounter the same problem as mine.

Crzyrndm · 2018-07-12T21:31:43Z

Nice find.

A catch you may want to be aware of if you need portability is that even with the u8 literal, different compilers/editors may still choke and give the incorrect output (relevant stack overflow).

The only way to ensure that everything works as expected anywhere with literals is the unicode escape sequences. It's downright ugly though :(

你好，世界
u8"\u4F60\u597D\uFF0C\u4E16\u754C"

Hex format from https://unicodelookup.com/#%E4%BD%A0%E5%A5%BD%EF%BC%8C%E4%B8%96%E7%95%8C/1

Crzyrndm · 2018-07-28T01:29:47Z

It is likely that using u8 prefix (+ escape sequences if neccesary) will resolve the reported issue for @earth2378 . Confirmation (or not) of this would be appreciated

@tfussell
This is a recurring issue. I wonder if documentation (e.g. where UTF8 compatibility is mentioned, add a note about source literals) or source changes/additions can be made that may prevent this error.

li1553770945 · 2020-08-08T12:09:25Z

Finally I solved the problem by noticing issue #215 and using u8"你好，世界". I think @earth2378 might encounter the same problem as mine.

I try to use your method,but if I want to read a excel,like ws.cell(1,1).to_string() ,if the value of (1,1) is Chinese,it will be messy code.Do you know how can I fix it?

tfussell · 2020-08-20T19:13:51Z

Finally I solved the problem by noticing issue #215 and using u8"你好，世界". I think @earth2378 might encounter the same problem as mine.

I try to use your method,but if I want to read a excel,like ws.cell(1,1).to_string() ,if the value of (1,1) is Chinese,it will be messy code.Do you know how can I fix it?

What are you trying to do with it? It should be a valid UTF-8 string you can use like any other Chinese UTF-8 string from another source.

tfussell added the bug label Jun 4, 2018

Crzyrndm added the awaiting-response label Jul 28, 2018

tfussell added docs and removed awaiting-response bug labels Dec 20, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Languages support #281

Languages support #281

earth2378 commented May 12, 2018 •

edited

Loading

tfussell commented Jun 4, 2018

ZM-J commented Jul 12, 2018 •

edited

Loading

ZM-J commented Jul 12, 2018

Crzyrndm commented Jul 12, 2018 •

edited

Loading

Crzyrndm commented Jul 28, 2018 •

edited

Loading

li1553770945 commented Aug 8, 2020

tfussell commented Aug 20, 2020

Languages support #281

Languages support #281

Comments

earth2378 commented May 12, 2018 • edited Loading

tfussell commented Jun 4, 2018

ZM-J commented Jul 12, 2018 • edited Loading

ZM-J commented Jul 12, 2018

Crzyrndm commented Jul 12, 2018 • edited Loading

Crzyrndm commented Jul 28, 2018 • edited Loading

li1553770945 commented Aug 8, 2020

tfussell commented Aug 20, 2020

earth2378 commented May 12, 2018 •

edited

Loading

ZM-J commented Jul 12, 2018 •

edited

Loading

Crzyrndm commented Jul 12, 2018 •

edited

Loading

Crzyrndm commented Jul 28, 2018 •

edited

Loading