Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unable to setup CLP in Debian OS #135

Closed
mantrimahitha opened this issue Jun 19, 2023 · 5 comments
Closed

Unable to setup CLP in Debian OS #135

mantrimahitha opened this issue Jun 19, 2023 · 5 comments
Assignees

Comments

@mantrimahitha
Copy link

Hi Team,
I am trying to follow the steps mentioned in package-template, when I am trying to start clp, it is failing with this error. The system has enough memory. Can you please edit the readme.md and post all the required softwares.

Traceback (most recent call last):
  File "/root/clp/tools/packager/out/clp-package-ubuntu-focal-x86_64-v0.0.1/etc/../sbin/start-clp", line 462, in main
    start_queue(instance_id, clp_config)
  File "/root/clp/tools/packager/out/clp-package-ubuntu-focal-x86_64-v0.0.1/etc/../sbin/start-clp", line 271, in start_queue
    subprocess.run(cmd, stdout=subprocess.DEVNULL, check=True)
  File "/opt/conda/default/lib/python3.10/subprocess.py", line 526, in run
    raise CalledProcessError(retcode, process.args,
subprocess.CalledProcessError: Command '['docker', 'exec', '-it', 'clp-queue-209e', 'rabbitmqctl', 'wait', '/tmp/rabbitmq.pid']' returned non-zero exit status 137.
@kirkrodrigues
Copy link
Member

Hi mantrimahitha,

Sorry, I haven't seen that issue before. From my own tests, the container in your error message uses a maximum of about 200MB during startup.

Could you give us the following details about your system to help us debug?

  • Amount of RAM
  • Docker version
  • OS version
  • clp commit hash that you're building from

Thanks!

@kirkrodrigues kirkrodrigues added bug Something isn't working and removed bug Something isn't working labels Jun 19, 2023
@bossjie
Copy link

bossjie commented Nov 15, 2023

@kirkrodrigues I met similar issue when running cmd "python3 start-clp"
here are my server info.

RAM: 64GB
Docker version 24.0.7
Operating System: CentOS Linux 7 (Core)
CPE OS Name: cpe:/o:centos:centos:7
Kernel: Linux 3.10.0-1160.49.1.el7.x86_64
Architecture: x86-64
clp last commit 2b884bd ( I believe so as the package tool pulls newest code from github)

error log:
Traceback (most recent call last):
File "start-clp", line 462, in main
start_queue(instance_id, clp_config)
File "start-clp", line 271, in start_queue
subprocess.run(cmd, stdout=subprocess.DEVNULL, check=True)
File "/usr/lib64/python3.6/subprocess.py", line 438, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['docker', 'exec', '-it', 'clp-queue-938f', 'rabbitmqctl', 'wait', '/tmp/rabbitmq.pid']' returned non-zero exit status 137.

And I consider the compress is seperate with clp process(Hope I'm wrong). so I run "python3 compress xxl-service.log"
and got below error. I'm heading to install using single node. the experience is like a blackbox touching by me. If we have better doc to explain the components( I see it trys to run docker images to support it) and install flow, it would be great. As the log info is limited.

docker: Error response from daemon: invalid mount config for type "bind": bind source path does not exist: /vdb/clp-package-ubuntu-focal-x86_64-v0.0.2/var/data/archives.
See 'docker run --help'.
Traceback (most recent call last):
File "compress", line 168, in
sys.exit(main(sys.argv))
File "compress", line 159, in main
subprocess.run(cmd, check=True)
File "/usr/lib64/python3.6/subprocess.py", line 438, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['docker', 'run', '-i', '--rm', '--network', 'host', '-w', '/opt/clp', '-u', '0:0', '--name', 'clp-compressor-502b', '--mount', 'type=bind,src=/vdb/clp-package-ubuntu-focal-x86_64-v0.0.2,dst=/opt/clp', '--mount', 'type=bind,src=/,dst=/mnt/logs,readonly', '--mount', 'type=bind,src=/vdb/clp-package-ubuntu-focal-x86_64-v0.0.2/var/data/somehostname.company.com,dst=/opt/clp/var/data', '--mount', 'type=bind,src=/vdb/clp-package-ubuntu-focal-x86_64-v0.0.2/var/log/somehostname.company.com,dst=/opt/clp/var/log', '--mount', 'type=bind,src=/vdb/clp-package-ubuntu-focal-x86_64-v0.0.2/var/data/archives,dst=/mnt/archive-output', 'ghcr.io/y-scope/clp/clp-execution-x86-ubuntu-focal:main', '/opt/clp/sbin/native/compress', '--config', '/opt/clp/var/log/.clp-compressor-502b-config.yml', '--remove-path-prefix', '/mnt/logs', '/mnt/logs/vdb/clp-package-ubuntu-focal-x86_64-v0.0.2/sbin/xxl-service.log']' returned non-zero exit status 125.

@bossjie
Copy link

bossjie commented Nov 16, 2023

by downloading from https://github.com/y-scope/clp/releases/download/v0.0.2/clp-package-ubuntu-focal-x86_64-v0.0.2.tar.gz and install in same type of centos server, run pip3 install -r requirements-pre-3.7.txt successfully but
it's not working

[root@hostname clp-package-ubuntu-focal-x86_64-v0.0.2]# sbin/start-clp

Status: Downloaded newer image for rabbitmq:3.9.8
2023-11-16 09:17:04,807 [INFO] [clp] Stopping clp-db-bffd...
2023-11-16 09:17:05,254 [INFO] [clp] Stopped clp-db-bffd.
2023-11-16 09:17:05,283 [ERROR] [clp] Failed to start CLP.
Traceback (most recent call last):
File "sbin/start-clp", line 462, in main
start_queue(instance_id, clp_config)
File "sbin/start-clp", line 271, in start_queue
subprocess.run(cmd, stdout=subprocess.DEVNULL, check=True)
File "/usr/lib64/python3.6/subprocess.py", line 438, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['docker', 'exec', '-it', 'clp-queue-bffd', 'rabbitmqctl', 'wait', '/tmp/rabbitmq.pid']'

@bossjie
Copy link

bossjie commented Nov 16, 2023

I even upgrade python3 to 3.7 and retry, it fails also.
2023-11-16 09:44:10,263 [INFO] [clp] Starting database...
2023-11-16 09:44:11,904 [INFO] [clp] Started database.
2023-11-16 09:44:11,904 [INFO] [clp] Creating database tables...
2023-11-16 09:44:12,617 [INFO] [clp] Created database tables.
2023-11-16 09:44:12,617 [INFO] [clp] Starting queue...
2023-11-16 09:44:16,752 [INFO] [clp] Stopping clp-db-0862...
2023-11-16 09:44:17,159 [INFO] [clp] Stopped clp-db-0862.
2023-11-16 09:44:17,179 [ERROR] [clp] Failed to start CLP.
Traceback (most recent call last):
File "sbin/start-clp", line 462, in main
start_queue(instance_id, clp_config)
File "sbin/start-clp", line 271, in start_queue
subprocess.run(cmd, stdout=subprocess.DEVNULL, check=True)
File "/usr/local/lib/python3.7/subprocess.py", line 468, in run
output=stdout, stderr=stderr)
subprocess.CalledProcessError: Command '['docker', 'exec', '-it', 'clp-queue-0862', 'rabbitmqctl', 'wait', '/tmp/rabbitmq.pid']' returned non-zero exit status 137.

@jackluo923 jackluo923 self-assigned this Nov 17, 2023
@jackluo923
Copy link
Member

jackluo923 commented Nov 17, 2023

After chatting offline with @bossjie, several issues were identified which prevented the successful startup of CLP package.

  1. Docker bug in CentOS 7 which prevents the MariaDB SQL database and rabbitmq container image from being pulled and started from docker-hub
  • solution: upgrade to a recent docker release such as docker-ce-24.0.7-1
  1. Permission issues with rabbitmq container's log file when started with the root user
  • temporary solution: use another user to start CLP package
  • permanent solution: a PR will be pushed out to resolve the underlying permission issue

After the two issues above are addressed, we were able to successfully start CLP package on CentOS 7.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants