Fix ASCII encoding issue when updating files with non-ASCII characters by erkinalp · Pull Request #116 · OpenHands/openhands-aci

erkinalp · 2025-05-01T17:57:24Z

Description

This PR fixes an issue with file encoding detection that causes errors when trying to add non-ASCII characters (like Chinese text) to files that were initially created with only ASCII content.

Related Issue

This issue was reported in OpenHands issue #8209.

Motivation and Context

When a file is initially created with only ASCII characters, its encoding is detected as 'ascii'. However, when trying to add non-ASCII characters to this file later, the operation fails with a UnicodeEncodeError because the 'ascii' encoding can't handle these characters.

Error message:

UnicodeEncodeError: 'ascii' codec can't encode characters in position 0-3: ordinal not in range(128)

This fix ensures that files initially containing only ASCII characters can later accept non-ASCII content (such as Chinese, Japanese, or other Unicode characters).

How Has This Been Tested?

Added a test case in tests/test_encoding.py that verifies ASCII files are detected as UTF-8, ensuring they can handle non-ASCII characters when edited later.

Does this PR introduce a breaking change?

No, this PR does not introduce a breaking change. It maintains backward compatibility while fixing the encoding issue.

ryanhoangt

LGTM, thanks!

openhands-agent and others added 3 commits May 1, 2025 17:41

Fix ASCII encoding issue when updating files with non-ASCII characters

31ab5c4

move test to intg

1fcdf92

bump to 0.2.12

30ed4e6

ryanhoangt approved these changes May 2, 2025

View reviewed changes

ryanhoangt merged commit 03a1a97 into OpenHands:main May 2, 2025
4 checks passed

erkinalp deleted the fix-ascii-encoding-issue branch May 2, 2025 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix ASCII encoding issue when updating files with non-ASCII characters#116

Fix ASCII encoding issue when updating files with non-ASCII characters#116
ryanhoangt merged 3 commits into
OpenHands:mainfrom
erkinalp:fix-ascii-encoding-issue

erkinalp commented May 1, 2025

Uh oh!

ryanhoangt left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

erkinalp commented May 1, 2025

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

Uh oh!

ryanhoangt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants