-
Notifications
You must be signed in to change notification settings - Fork 92
Fix: Windows Unicode encoding issue\ #662
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
paullizer
merged 1 commit into
microsoft:Development
from
vivche:windows-unicode-encoding-issue-fix
Jan 30, 2026
Merged
Fix: Windows Unicode encoding issue\ #662
paullizer
merged 1 commit into
microsoft:Development
from
vivche:windows-unicode-encoding-issue-fix
Jan 30, 2026
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- Added explicit UTF-8 encoding when reading file content on Windows - Prevents UnicodeDecodeError when processing non-ASCII filenames - Ensures consistent file handling across different operating systems
paullizer
approved these changes
Jan 30, 2026
Contributor
paullizer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice, straight forward.
paullizer
added a commit
that referenced
this pull request
Jan 30, 2026
* Add custom subdomain support for OpenAI and Speech Service in Terraform - Added custom_subdomain_name to OpenAI resource for managed identity authentication - Created Speech Service resource with custom subdomain configuration - Added RBAC role assignments for Speech Service (Managed Identity and App Service MI) - Includes Cognitive Services Speech User and Speech Contributor roles - Documentation: Azure Speech managed identity setup guide * Fix Azure AI Search test connection with managed identity Replaced REST API approach with SearchIndexClient SDK to properly handle managed identity authentication in Azure public cloud. The SDK automatically handles token acquisition and endpoint construction, eliminating the 'search_resource_manager is not defined' error that occurred with the REST API approach. * Fix Azure AI Search test connection with managed identity Replaced REST API approach with SearchIndexClient SDK to properly handle managed identity authentication in Azure public cloud. The SDK automatically handles token acquisition and endpoint construction, eliminating the 'search_resource_manager is not defined' error that occurred with the REST API approach. * Corrected file folder name * Corrected the version number to reference 0.236.012 * Removed unneeded folder and document * Revert terraform main.tf to upstream/Development version * updated the logging logic when running retention delete with archiving enabled (#642) * Corrected version to 0.236.011 (#645) * v0.237.001 (#649) * Use Microsoft python base image * Add python ENV vars * Add python ENV vars * Install deps to systme * Add temp dir to image and pip conf support * Add custom-ca-certificates dir * Logo bug fix (#654) * release note updating for github coplilot * fixed logo bug issue * added 2,3,4,5,6,14 days to rentention policy * added retention policy time updates * Rentention policy (#657) * Critical Retention Policy Deletion Fix * Create RETENTION_POLICY_NULL_LAST_ACTIVITY_FIX.md * fixed retention policy runtime bug and sidebar bug (#672) * Fix: Windows Unicode encoding issue for video uploads (#662) - Added explicit UTF-8 encoding when reading file content on Windows - Prevents UnicodeDecodeError when processing non-ASCII filenames - Ensures consistent file handling across different operating systems Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> * Update docs/how-to/azure_speech_managed_identity_manul_setup.md (#675) Co-authored-by: vivche <vivche@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add custom subdomain support for OpenAI and Speech Service in Terraform (#558) * Add custom subdomain support for OpenAI and Speech Service in Terraform - Added custom_subdomain_name to OpenAI resource for managed identity authentication - Created Speech Service resource with custom subdomain configuration - Added RBAC role assignments for Speech Service (Managed Identity and App Service MI) - Includes Cognitive Services Speech User and Speech Contributor roles - Documentation: Azure Speech managed identity setup guide * Update docs/how-to/azure_speech_managed_identity_manul_setup.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * 0.237.006 (#676) * Update chat-sidebar-conversations.js * 0.237.006 * Update release_notes.md --------- Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> Co-authored-by: Ed Clark <clarked@microsoft.com> Co-authored-by: Bionic711 <13358952+Bionic711@users.noreply.github.com> Co-authored-by: vivche <vivche@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
paullizer
added a commit
that referenced
this pull request
Jan 30, 2026
* Add custom subdomain support for OpenAI and Speech Service in Terraform - Added custom_subdomain_name to OpenAI resource for managed identity authentication - Created Speech Service resource with custom subdomain configuration - Added RBAC role assignments for Speech Service (Managed Identity and App Service MI) - Includes Cognitive Services Speech User and Speech Contributor roles - Documentation: Azure Speech managed identity setup guide * Fix Azure AI Search test connection with managed identity Replaced REST API approach with SearchIndexClient SDK to properly handle managed identity authentication in Azure public cloud. The SDK automatically handles token acquisition and endpoint construction, eliminating the 'search_resource_manager is not defined' error that occurred with the REST API approach. * Fix Azure AI Search test connection with managed identity Replaced REST API approach with SearchIndexClient SDK to properly handle managed identity authentication in Azure public cloud. The SDK automatically handles token acquisition and endpoint construction, eliminating the 'search_resource_manager is not defined' error that occurred with the REST API approach. * Corrected file folder name * Corrected the version number to reference 0.236.012 * Removed unneeded folder and document * Revert terraform main.tf to upstream/Development version * updated the logging logic when running retention delete with archiving enabled (#642) * Corrected version to 0.236.011 (#645) * v0.237.001 (#649) * Use Microsoft python base image * Add python ENV vars * Add python ENV vars * Install deps to systme * Add temp dir to image and pip conf support * Add custom-ca-certificates dir * Logo bug fix (#654) * release note updating for github coplilot * fixed logo bug issue * added 2,3,4,5,6,14 days to rentention policy * added retention policy time updates * Rentention policy (#657) * Critical Retention Policy Deletion Fix * Create RETENTION_POLICY_NULL_LAST_ACTIVITY_FIX.md * fixed retention policy runtime bug and sidebar bug (#672) * Fix: Windows Unicode encoding issue for video uploads (#662) - Added explicit UTF-8 encoding when reading file content on Windows - Prevents UnicodeDecodeError when processing non-ASCII filenames - Ensures consistent file handling across different operating systems Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> * Update docs/how-to/azure_speech_managed_identity_manul_setup.md (#675) Co-authored-by: vivche <vivche@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Add custom subdomain support for OpenAI and Speech Service in Terraform (#558) * Add custom subdomain support for OpenAI and Speech Service in Terraform - Added custom_subdomain_name to OpenAI resource for managed identity authentication - Created Speech Service resource with custom subdomain configuration - Added RBAC role assignments for Speech Service (Managed Identity and App Service MI) - Includes Cognitive Services Speech User and Speech Contributor roles - Documentation: Azure Speech managed identity setup guide * Update docs/how-to/azure_speech_managed_identity_manul_setup.md Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> --------- Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * 0.237.006 (#676) * Update chat-sidebar-conversations.js * 0.237.006 * Update release_notes.md * fixed sidebar race condition (#679) --------- Co-authored-by: Chen, Vivien <Vivien.Chen+ecolab@ecolab.com> Co-authored-by: Ed Clark <clarked@microsoft.com> Co-authored-by: Bionic711 <13358952+Bionic711@users.noreply.github.com> Co-authored-by: vivche <vivche@microsoft.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #644 - Windows Unicode Encoding Issue Report
Problem
The application crashes on Windows when processing or displaying Unicode characters beyond the Western European character set. This critical cross-platform compatibility issue occurs because:
cp1252encoding for stdout/stderr (limited to 256 Western European characters)This affects multiple areas including:
Common Error:
UnicodeEncodeError: 'charmap' codec can't encode character '\uXXXX'Solution
Configured UTF-8 encoding globally at application startup for Windows platforms. This ensures:
Changes
app.pyto reconfiguresys.stdoutandsys.stderrto UTF-8 on WindowsTesting