Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docker error #47

Closed
mathpopo opened this issue Oct 20, 2022 · 7 comments
Closed

docker error #47

mathpopo opened this issue Oct 20, 2022 · 7 comments

Comments

@mathpopo
Copy link

(base) chenxin@chenxin-Nitro-AN515-52:~/disk1/github/AITemplate/docker$ ./build.sh cuda
Building CUDA Docker Image with tag ait:latest
unable to prepare context: unable to evaluate symlinks in Dockerfile path: lstat /home/chenxin/disk1/github/AITemplate/docker/docker: no such file or directory

@mathpopo
Copy link
Author

(base) chenxin@chenxin-Nitro-AN515-52:~/disk1/github/AITemplate/docker$ docker

Usage: docker [OPTIONS] COMMAND

A self-sufficient runtime for containers

Options:
--config string Location of client config files (default
"/home/chenxin/.docker")
-c, --context string Name of the context to use to connect to the
daemon (overrides DOCKER_HOST env var and
default context set with "docker context use")
-D, --debug Enable debug mode
-H, --host list Daemon socket(s) to connect to
-l, --log-level string Set the logging level
("debug"|"info"|"warn"|"error"|"fatal")
(default "info")
--tls Use TLS; implied by --tlsverify
--tlscacert string Trust certs signed only by this CA (default
"/home/chenxin/.docker/ca.pem")
--tlscert string Path to TLS certificate file (default
"/home/chenxin/.docker/cert.pem")
--tlskey string Path to TLS key file (default
"/home/chenxin/.docker/key.pem")
--tlsverify Use TLS and verify the remote
-v, --version Print version information and quit

Management Commands:
builder Manage builds
config Manage Docker configs
container Manage containers
context Manage contexts
image Manage images
manifest Manage Docker image manifests and manifest lists
network Manage networks
node Manage Swarm nodes
plugin Manage plugins
secret Manage Docker secrets
service Manage services
stack Manage Docker stacks
swarm Manage Swarm
system Manage Docker
trust Manage trust on Docker images
volume Manage volumes

Commands:
attach Attach local standard input, output, and error streams to a running container
build Build an image from a Dockerfile
commit Create a new image from a container's changes
cp Copy files/folders between a container and the local filesystem
create Create a new container
diff Inspect changes to files or directories on a container's filesystem
events Get real time events from the server
exec Run a command in a running container
export Export a container's filesystem as a tar archive
history Show the history of an image
images List images
import Import the contents from a tarball to create a filesystem image
info Display system-wide information
inspect Return low-level information on Docker objects
kill Kill one or more running containers
load Load an image from a tar archive or STDIN
login Log in to a Docker registry
logout Log out from a Docker registry
logs Fetch the logs of a container
pause Pause all processes within one or more containers
port List port mappings or a specific mapping for the container
ps List containers
pull Pull an image or a repository from a registry
push Push an image or a repository to a registry
rename Rename a container
restart Restart one or more containers
rm Remove one or more containers
rmi Remove one or more images
run Run a command in a new container
save Save one or more images to a tar archive (streamed to STDOUT by default)
search Search the Docker Hub for images
start Start one or more stopped containers
stats Display a live stream of container(s) resource usage statistics
stop Stop one or more running containers
tag Create a tag TARGET_IMAGE that refers to SOURCE_IMAGE
top Display the running processes of a container
unpause Unpause all processes within one or more containers
update Update configuration of one or more containers
version Show the Docker version information
wait Block until one or more containers stop, then print their exit codes

Run 'docker COMMAND --help' for more information on a command.

To get more help with docker, check out our guides at https://docs.docker.com/go/guides/

@antinucleon
Copy link
Contributor

You are currently at :~/disk1/github/AITemplate/docker
Need to run ./docker/build.sh cuda at :~/disk1/github/AITemplate

@mathpopo
Copy link
Author

@antinucleon thank you for your help
i have created docker, but i use "sudo docker run -it -v /usr/bin/:/apps ait:latest"
root@2ebbd1eec037:/apps# ./nvidia-smi
NVIDIA-SMI couldn't find libnvidia-ml.so library in your system. Please make sure that the NVIDIA Display Driver is properly installed and present in your system.
Please also try adding directory that contains libnvidia-ml.so to your system PATH.
root@2ebbd1eec037:/apps# ldd ./nvidia-smi
linux-vdso.so.1 (0x00007ffefe5d1000)
libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f5c6e5ea000)
libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f5c6e5e4000)
libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f5c6e3f2000)
librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00007f5c6e3e8000)
/lib64/ld-linux-x86-64.so.2 (0x00007f5c6e618000)

@mathpopo
Copy link
Author

root@01246f0349b6:/# nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Tue_Mar__8_18:18:20_PST_2022
Cuda compilation tools, release 11.6, V11.6.124
Build cuda_11.6.r11.6/compiler.31057947_0

@mathpopo
Copy link
Author

(base) chenxin@chenxin-Nitro-AN515-52:~/disk1/github/AITemplate$ nvidia-smi
Thu Oct 20 21:12:12 2022
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 470.141.03 Driver Version: 470.141.03 CUDA Version: 11.4 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 NVIDIA GeForce ... Off | 00000000:01:00.0 Off | N/A |
| N/A 47C P8 3W / N/A | 11MiB / 6078MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| 0 N/A N/A 1145 G /usr/lib/xorg/Xorg 4MiB |
| 0 N/A N/A 2085 G /usr/lib/xorg/Xorg 4MiB |
+-----------------------------------------------------------------------------+

@mathpopo
Copy link
Author

  1. how can i use "nvidia-smi"? 2.need i update the driver version to adapt for the cuda(11.6)?

@antinucleon
Copy link
Contributor

I don't know details about NVIDIA docker driver requirement, please check NVIDIA document for reference.

tissue3 pushed a commit to tissue3/AITemplate-1 that referenced this issue Feb 7, 2023
* benchmark + fix

* Add benchmark for fp16 accumulation.

* Make the profiler be aware of fp16.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants