Skip to content

'gbk' codec can't decode #167

@YangAoLib

Description

@YangAoLib

When I encounter an error while using the semble command line

UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 1522: illegal multibyte sequence

terminal version

PS D:\project\ke-recruit> pwsh --version
PowerShell 7.6.0

system info

version Windows 10 IoT enterprise LTSC 22H2
install date ‎2025/‎11/‎26
operate system version 19044.6218

semble version

PS D:\project\ke-recruit> uv tool list
semble v0.3.0
- semble

error info

PS D:\project\ke-recruit> chcp
活动代码页: 936
PS D:\project\ke-recruit> semble search "求职者登录"
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "C:\Users\yangao010\.local\bin\semble.exe\__main__.py", line 10, in <module>
    sys.exit(main())
             ~~~~^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 78, in main
    _cli_main()
    ~~~~~~~~~^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 203, in _cli_main
    _run_search(args.path, args.query, args.top_k, _resolve_content(args.content, args.include_text_files))
    ~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 142, in _run_search
    index = _load_index(path, content)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 134, in _load_index
    return _build_index(path, content)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 38, in _build_index
    else SembleIndex.from_path(path, content=content)
         ~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\index\index.py", line 149, in from_path
    cache_path = get_validated_cache(str(path), model_path, normalized)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cache.py", line 100, in get_validated_cache
    metadata = json.load(f)
  File "D:\yangao010\AppData\Roaming\uv\python\cpython-3.14-windows-x86_64-none\Lib\json\__init__.py", line 298, in load
    return loads(fp.read(),
                 ~~~~~~~^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 1522: illegal multibyte sequence
PS D:\project\ke-recruit> semble search "login"
Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "C:\Users\yangao010\.local\bin\semble.exe\__main__.py", line 10, in <module>
    sys.exit(main())
             ~~~~^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 78, in main
    _cli_main()
    ~~~~~~~~~^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 203, in _cli_main
    _run_search(args.path, args.query, args.top_k, _resolve_content(args.content, args.include_text_files))
    ~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 142, in _run_search
    index = _load_index(path, content)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 134, in _load_index
    return _build_index(path, content)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cli.py", line 38, in _build_index
    else SembleIndex.from_path(path, content=content)
         ~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\index\index.py", line 149, in from_path
    cache_path = get_validated_cache(str(path), model_path, normalized)
  File "D:\yangao010\AppData\Roaming\uv\tools\semble\Lib\site-packages\semble\cache.py", line 100, in get_validated_cache
    metadata = json.load(f)
  File "D:\yangao010\AppData\Roaming\uv\python\cpython-3.14-windows-x86_64-none\Lib\json\__init__.py", line 298, in load
    return loads(fp.read(),
                 ~~~~~~~^^
UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 1522: illegal multibyte sequence

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions