Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nvidia-container-cli: mount error #2

Open
wangzhongliang opened this issue Jan 5, 2024 · 9 comments
Open

nvidia-container-cli: mount error #2

wangzhongliang opened this issue Jan 5, 2024 · 9 comments

Comments

@wangzhongliang
Copy link

after I run the commnad docker-compose up -d
the following error came out
[error ](nvidia-container-cli: mount error: file creation failed: /var/lib/docker/overlay2/72dc9ee5772968b2295c4827d10cc1ad8f74b2f241cae14396bf28f457537848/merged/usr/lib/x86_64-linux-gnu/libnvidia-ml.so.1: file exists: unknown)

image

@wangzhongliang wangzhongliang changed the title nvidia-container-cli: mount error: file creation failed: /var/lib/docker/overlay2/72dc9ee5772968b2295c4827d10cc1ad8f74b2f241cae14396bf28f457537848/merged/usr/lib/x86_64-linux-gnu/libnvidia-ml.so.1: file exists: unknown nvidia-container-cli: mount error Jan 5, 2024
@successren
Copy link
Collaborator

Are you using Windows Docker? Our image is packaged on Linux, and when loaded on Windows, it encounters Nvidia-related errors for which there is currently no solution. We plan to package a Windows version of the Docker image.

@successren
Copy link
Collaborator

Are you using Windows Docker? Our image is packaged on Linux, and when loaded on Windows, it encounters Nvidia-related errors for which there is currently no solution. We plan to package a Windows version of the Docker image.

Or you can refer to NVIDIA/nvidia-container-toolkit#289. You need to delete some Nvidia-related .so files in the container and then repackage the image.

@wangzhongliang
Copy link
Author

Are you using Windows Docker? Our image is packaged on Linux, and when loaded on Windows, it encounters Nvidia-related errors for which there is currently no solution. We plan to package a Windows version of the Docker image.

I use WSL2 on Windows, any solution for WSL2?

@wangzhongliang
Copy link
Author

Are you using Windows Docker? Our image is packaged on Linux, and when loaded on Windows, it encounters Nvidia-related errors for which there is currently no solution. We plan to package a Windows version of the Docker image.

Or you can refer to NVIDIA/nvidia-container-toolkit#289 (comment). You need to delete some Nvidia-related .so files in the container and then repackage the image.

thx, I will try

@wangzhongliang
Copy link
Author

Are you using Windows Docker? Our image is packaged on Linux, and when loaded on Windows, it encounters Nvidia-related errors for which there is currently no solution. We plan to package a Windows version of the Docker image.

Or you can refer to NVIDIA/nvidia-container-toolkit#289 (comment). You need to delete some Nvidia-related .so files in the container and then repackage the image.

I try the solution, and successfully enter the front end page
here is my modification:

  1. rebuild a new image based on freeren/qanything:v1.0.7, rename it as freeren/qanything:v1.0.7-fix
  2. modify the docker-compose.yaml as below (change the image name in qanything_local, and remove the runtime: nvidia)
    image

But new problem came out, all the apis (eg. /api/local_doc_qa/list_knowledge_base) are not working and return 502 error
image

@xixihahaliu
Copy link
Collaborator

This situation is most likely due to the qanything service not starting correctly. You need to check the system logs to see the specific reason. You can paste the key parts from the QAnything/api.log file.
image

@wangzhongliang
Copy link
Author

This situation is most likely due to the qanything service not starting correctly. You need to check the system logs to see the specific reason. You can paste the key parts from the QAnything/api.log file. image

i use docker-compose logs -f qanything_local to get the logs, but there isn't any log
But I can see logs of other services, such as minio, as below
image

maybe the modification I made can't work?

@dj456119
Copy link

You can build a new Dockerfile and remove that files:

# 基于现有镜像
FROM freeren/qanything:v1.0.7

# 执行删除操作
RUN rm -rf /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.1 /usr/lib/x86_64-linux-gnu/libcuda.so.1 /usr/lib/x86_64-linux-gnu/libcudadebugger.so.1 /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.1 /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.1 /usr/lib/x86_64-linux-gnu/libnvcuvid.so.1

@xixihahaliu
Copy link
Collaborator

这种情况很可能是由于 qanything 服务未正确启动造成的。具体原因需要查看系统日志。您可以粘贴 QAnything/api.log 文件中的关键部分。图像

我用来docker-compose logs -f qanything_local获取日志,但没有任何日志 但是我可以看到其他服务的日志,例如minio,如下 图像

也许我所做的修改不起作用?

The installation method for Windows has been updated. You can try again. For details, please refer to the readme file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants