fix: automatically replace unsupported torch device #2514

Drincann · 2023-12-31T08:13:45Z

Thank you for your hard work on this fantastic project!

I've been trying to run this project on a Mac Studio with an M2 chip. Following the README instructions, I completed database initialization, model downloading, and other setup steps. However, I encountered a startup failure when launching the service.

The error message indicated: "raise AssertionError('Torch not compiled with CUDA enabled')". This is expected since I am using an M2 chip. Tracing the stack trace, I located the issue in the server/utils.py file, where the 'cuda' device is set:

def get_model_worker_config(model_name: str = None) -> dict:
    from configs.model_config import ONLINE_LLM_MODEL, MODEL_PATH
    from configs.server_config import FSCHAT_MODEL_WORKERS
    from server import model_workers

    config = FSCHAT_MODEL_WORKERS.get("default", {}).copy()
    config.update(ONLINE_LLM_MODEL.get(model_name, {}).copy())
    config.update(FSCHAT_MODEL_WORKERS.get(model_name, {}).copy()) # device: 'cuda' loaded here

    if model_name in ONLINE_LLM_MODEL:
    # ...

    if model_name in MODEL_PATH["llm_model"]:
    # ...
        config["device"] = llm_device(config.get("device"))
    return config

In configs/server_config.py, the 'cuda' device is explicitly set:

FSCHAT_MODEL_WORKERS = {
  # ...

    "chatglm3-6b": {
        "device": "cuda", # here
    },

  # ...
}

Although the current code supports multiple platforms, this 'cuda' setting is not suitable for users with Apple chips. To enhance the robustness of the project and ensure smooth initial startup for Apple chip users with chatglm3-6b, I suggest adding smarter device detection and fallback logic. When the selected device is not supported, the system should automatically switch to other viable options, such as falling back from 'cuda' to 'mps'.

zRzRzRzRzRzRzR

好的，会合并

fix: automatically replace unsupported torch device

5279e0c

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Dec 31, 2023

Drincann mentioned this pull request Jan 6, 2024

Fix device detection and fallback logic #2570

Merged

zRzRzRzRzRzRzR reviewed Jan 11, 2024

View reviewed changes

zRzRzRzRzRzRzR merged commit e7bba6b into chatchat-space:master Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: automatically replace unsupported torch device #2514

fix: automatically replace unsupported torch device #2514

Drincann commented Dec 31, 2023 •

edited

zRzRzRzRzRzRzR left a comment

fix: automatically replace unsupported torch device #2514

fix: automatically replace unsupported torch device #2514

Conversation

Drincann commented Dec 31, 2023 • edited

zRzRzRzRzRzRzR left a comment

Choose a reason for hiding this comment

Drincann commented Dec 31, 2023 •

edited