[WIP] add support for ultra-infer with cuda12.6 and cuda12.9 #4217

zhang-prog · 2025-06-15T18:58:56Z

No description provided.

Bobholamovic · 2025-06-16T06:05:54Z

paddlex/paddlex_cli.py

@@ -258,7 +259,25 @@ def _install_hpi_deps(device_type):
            package = "ultra-infer-npu-python"

        with importlib.resources.path("paddlex", "hpip_links.html") as f:
-            install_packages([package], pip_install_opts=["--find-links", str(f)])
+            try:
+                version = importlib.metadata.version(package)


建议用paddlex.utils.deps.get_dep_version

Bobholamovic · 2025-06-16T06:07:09Z

paddlex/paddlex_cli.py

+                response = input(
+                    f"The package '{package}' (version {version}) is already installed. Do you want to reinstall it? (y/n): "
+                )
+                if response.lower() == "y":


也支持一下yes

Bobholamovic · 2025-06-16T06:10:10Z

paddlex/paddlex_cli.py

+                else:
+                    return
+            except importlib.metadata.PackageNotFoundError:
+                install_packages([package], pip_install_opts=["--find-links", str(f)])



建议这里可以再加一个检查和提示：

if not is_paddle2onnx_plugin_available(): logging.info("The Paddle2ONNX plugin is not available. It is recommended to run `paddlex --install paddle2onnx` to install the Paddle2ONNX plugin to use the full functionality of high-performance inference.")

Bobholamovic · 2025-06-16T06:10:26Z

paddlex/paddlex_cli.py

+            try:
+                version = importlib.metadata.version(package)
+                response = input(
+                    f"The package '{package}' (version {version}) is already installed. Do you want to reinstall it? (y/n): "


我们对外统一暴露plugin的概念吧，就不说package了

Bobholamovic · 2025-06-16T06:31:02Z

libs/ultra-infer/cmake/onnxruntime.cmake

@@ -71,7 +71,11 @@ else()
      message("Cannot compile with onnxruntime-gpu while in linux-aarch64 platform, fallback to onnxruntime-cpu")
      set(ONNXRUNTIME_FILENAME "onnxruntime-linux-aarch64-${ONNXRUNTIME_VERSION}.tgz")
    else()
-      set(ONNXRUNTIME_FILENAME "onnxruntime-linux-x64-gpu-${ONNXRUNTIME_VERSION}.tgz")
+      if(CUDA_VERSION MATCHES "12.6" OR CUDA_VERSION MATCHES "12.9")


这里是否应该只匹配大版本号呢，还是说必须要匹配到minor version？

Done，大版本号
https://onnxruntime.ai/docs/execution-providers/CUDA-ExecutionProvider.html#build-from-source

好像没有修改？

Bobholamovic · 2025-06-16T07:47:04Z

paddlex/paddlex_cli.py

+                install_packages([package], pip_install_opts=["--find-links", str(f)])
+            else:
+                response = input(
+                    f"The plugin '{package}' (version {version}) is already installed. Do you want to reinstall it? (y/n): "


这里建议：

The high-performance inference plugin is already installed (version {repr(version)}). Do you want to reinstall it? (y/n)

Bobholamovic · 2025-06-16T07:47:44Z

paddlex/paddlex_cli.py

@@ -258,7 +260,30 @@ def _install_hpi_deps(device_type):
            package = "ultra-infer-npu-python"

        with importlib.resources.path("paddlex", "hpip_links.html") as f:
-            install_packages([package], pip_install_opts=["--find-links", str(f)])
+            version = get_dep_version(package)


待添加检查其他variant是否存在的逻辑，以及适配CUDA 12

Bobholamovic · 2025-06-19T13:03:56Z

paddlex/paddlex_cli.py

+                    install_packages(
+                        [package],
+                        pip_install_opts=[
+                            "--force-reinstall",


建议用标准的先卸载、再安装的方式，防止本地cache影响行为。此外，不应该--no-deps吧？不同版本依赖是可能会改变的

zhang-prog added 3 commits June 16, 2025 02:58

add support for ultra-infer with cuda12.6 and cuda12.9

d965199

update

d9574e4

add prompt to confirm ultra-infer reinstallation

9f06e76

Bobholamovic reviewed Jun 16, 2025

View reviewed changes

update

31f50d4

Bobholamovic reviewed Jun 16, 2025

View reviewed changes

update

8be5943

Bobholamovic approved these changes Jun 16, 2025

View reviewed changes

Bobholamovic reviewed Jun 16, 2025

View reviewed changes

zhang-prog added the wip label Jun 16, 2025

zhang-prog changed the title ~~add support for ultra-infer with cuda12.6 and cuda12.9~~ [WIP] add support for ultra-infer with cuda12.6 and cuda12.9 Jun 16, 2025

Bobholamovic reviewed Jun 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] add support for ultra-infer with cuda12.6 and cuda12.9 #4217

[WIP] add support for ultra-infer with cuda12.6 and cuda12.9 #4217

Uh oh!

zhang-prog commented Jun 15, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

zhang-prog Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

Bobholamovic Jun 16, 2025

Uh oh!

Bobholamovic Jun 19, 2025

Uh oh!

Uh oh!

[WIP] add support for ultra-infer with cuda12.6 and cuda12.9 #4217

Are you sure you want to change the base?

[WIP] add support for ultra-infer with cuda12.6 and cuda12.9 #4217

Uh oh!

Conversation

zhang-prog commented Jun 15, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!