UTF8 instead of UTF16 for international support #1220

pcrockett-pathway · 2019-09-24T09:48:50Z

Issue description

I appreciate that WACS is friendly for international users and supports Unicode instead of just ASCII. However the choice of UTF16 (AKA Encoding.Unicode) really creates some weird problems. As a trivial example, if I run this command in PowerShell:

.\wacs.exe --help > help.txt

... it produces a text file that looks like this:

PowerShell inserts extra null characters because it assumes each two-byte character is actually two separate one-byte characters.

Parsing output looking for [WARN] and [EROR] also becomes more difficult because of this. In PowerShell, I'd normally expect to be able to do this:

$wacsOutput = .\wacs.exe [command line args]
# Now look for [WARN] or [EROR] messages in $wacsOutput

But PowerShell doesn't do this correctly. I actually have to do this instead:

$wacsOutputTempFile = Join-Path $PSScriptRoot "temp.txt"
Start-Process ".\wacs.exe" -ArgumentList $CommandArgs `
    -Wait `
    -RedirectStandardOutput $wacsOutputTempFile `
    -NoNewWindow
$wacsOutput = Get-Content -Encoding Unicode $wacsOutputTempFile
# Now look for [WARN] or [EROR] messages in $wacsOutput

Would it be possible to use the UTF8 encoding instead? This would give us the best of both worlds: It's still a Unicode encoding with the ability to display international character sets, yet it remains backward-compatible with shells that assume an ASCII encoding.

The text was updated successfully, but these errors were encountered:

WouterTinus · 2019-09-26T18:57:35Z

We've used UTF8 initially for international domain support, but it turned out that at least in a classic cmd window, UTF16 works better for displaying certain Asian language characters while UTF8 falls apart for some reason, I think probably because of legacy stuff in the Microsoft console.

Perhaps it's possible to make Powershell aware of the UTF16 output encoding somehow?

https://stackoverflow.com/questions/49476326/displaying-unicode-in-powershell

pcrockett-pathway · 2019-09-27T07:08:49Z

Thanks for the link, that looks promising. I had not tried setting [console]::InputEncoding or [console]::OutputEncoding yet (though I'm unable to test at the moment).

Another option worth considering is perhaps adding an --encoding parameter, where UTF16 is the default, but a user could easily specify ASCII if they needed it. If this little PowerShell hack works, I could live without that feature, though others might appreciate it. This particular PowerShell quirk isn't the easiest thing to figure out.

woutertinusf19 · 2019-09-27T09:53:15Z

A setting is always an option if it turns out that it's not possible to get Powershell to understand the output stream. Another thing that you might try in your case is to look at the log file instead of capturing the screen output.

pcrockett-pathway · 2019-09-30T10:20:26Z

I was able to try changing the input and output encoding, and that kind of fixed it. It now looks like wacs.exe outputs errors in a different encoding.

Here's an example of what I get when I run it normally:

 [EROR]  parameters cannot be changed during a renewal. Recreate/overwrite the renewal or edit the .json file if you want to make changes.


   [ I N F O ]   A   s i m p l e   W i n d o w s   A C M E v 2   c l i e n t   ( W A C S )

   [ I N F O ]   S o f t w a r e   v e r s i o n   1 . 0 . 0 . 0   ( R E L E A S E )

   [ I N F O ]   I I S   v e r s i o n   8 . 5

   [ I N F O ]   S c h e d u l e d   t a s k   l o o k s   h e a l t h y

   [ I N F O ]   P l e a s e   r e p o r t   i s s u e s   a t   h t t p s : / / g i t h u b . c o m / P K I S h a r p / w i n - a c m e

And here's an example of what I get when I change the output encoding to [Text.Encoding]::Unicode:

嬠剅剏⁝瀠牡浡瑥牥⁳慣湮瑯戠⁥档湡敧⁤畤楲杮愠爠湥睥污‮敒牣慥整漯敶睲楲整琠敨爠湥睥污漠⁲摥瑩琠敨⸠獪湯映汩⁥晩礠畯眠湡⁴潴洠歡⁥档湡敧⹳਍
 [INFO] A simple Windows ACMEv2 client (WACS)
 [INFO] Software version 1.0.0.0 (RELEASE)
 [INFO] IIS version 8.5
 [INFO] Scheduled task looks healthy
 [INFO] Please report issues at https://github.com/PKISharp/win-acme

So it looks like I fixed one thing, but broke another thing.

WouterTinus · 2019-10-02T19:05:11Z

There will be a setting for this in the next release :)

WouterTinus · 2019-11-05T21:10:25Z

Released in 2.1.0

WouterTinus added the question label Sep 26, 2019

WouterTinus added this to the 2.1.0 milestone Oct 2, 2019

WouterTinus added enhancement and removed question labels Oct 2, 2019

WouterTinus closed this as completed Nov 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UTF8 instead of UTF16 for international support #1220

UTF8 instead of UTF16 for international support #1220

pcrockett-pathway commented Sep 24, 2019 •

edited

WouterTinus commented Sep 26, 2019

pcrockett-pathway commented Sep 27, 2019

woutertinusf19 commented Sep 27, 2019

pcrockett-pathway commented Sep 30, 2019

WouterTinus commented Oct 2, 2019

WouterTinus commented Nov 5, 2019

Navigation Menu

UTF8 instead of UTF16 for international support #1220

UTF8 instead of UTF16 for international support #1220

Comments

pcrockett-pathway commented Sep 24, 2019 • edited

Issue description

WouterTinus commented Sep 26, 2019

pcrockett-pathway commented Sep 27, 2019

woutertinusf19 commented Sep 27, 2019

pcrockett-pathway commented Sep 30, 2019

WouterTinus commented Oct 2, 2019

WouterTinus commented Nov 5, 2019

pcrockett-pathway commented Sep 24, 2019 •

edited