Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI post-attachment operations failure #376

Closed
Godbefore opened this issue Feb 24, 2020 · 3 comments
Closed

AI post-attachment operations failure #376

Godbefore opened this issue Feb 24, 2020 · 3 comments

Comments

@Godbefore
Copy link

I tried ./all_run.sh -e SPECRUNID -s kmeans_baseline to test my openstack
but failed:Error: KMeans baseline phase

WARNING measure Problem creating the application: AI object 3FB6C742-61F3-5D7F-A22C-28D3D86581E7 (named "ai_3") could not be attached to this experiment: AI post-attachment operations failure: Parallel VM configuration for ai_3 failure (1495): Failure while executing generic post_boot configuration on on all VMs beloging to ai_3 (3FB6C742-61F3-5D7F-A22C-28D3D86581E7): Parallel run os command operation failure: Giving up on executing command "~/cbtool/scripts/common/cb_post_boot.sh" on hostname 11.128.14.14. Too many attempts (3).

Then,I ssh ai-vm and I found that sudo is available(sudo: /usr/bin/sudo must be owned by uid 0 and have the setuid bit set)
because the owner of /usr has changed from root to cbuser.
I wonder why the owner of /usr will be changed to cbuser after the kmeans_baseline start.
This could be why generic script failed.

cbuser@cb-cbuser-myopenstack-vm15-hadoopslave-ai-3:/$ ll
total 100
drwxr-xr-x 24 root root 4096 Feb 23 22:20 ./
drwxr-xr-x 24 root root 4096 Feb 23 22:20 ../
drwxr-xr-x 3 root root 4096 Feb 22 23:06 app/
drwxr-xr-x 2 root root 4096 Aug 8 2017 bin/
drwxr-xr-x 3 root root 4096 Aug 8 2017 boot/
drwxr-xr-x 18 root root 3700 Feb 23 22:20 dev/
drwxr-xr-x 106 root root 4096 Feb 23 22:21 etc/
drwxr-xr-x 4 root root 4096 Feb 19 15:25 home/
lrwxrwxrwx 1 root root 32 Aug 8 2017 initrd.img -> boot/initrd.img-4.4.0-87-generic
drwxr-xr-x 22 root root 4096 Feb 19 15:32 lib/
drwxr-xr-x 2 root root 4096 Feb 19 15:31 lib64/
drwx------ 2 root root 16384 Aug 8 2017 lost+found/
drwxr-xr-x 3 root root 4096 Aug 8 2017 media/
drwxr-xr-x 2 root root 4096 Aug 1 2017 mnt/
drwxr-xr-x 2 root root 4096 Aug 1 2017 opt/
dr-xr-xr-x 184 root root 0 Feb 23 22:20 proc/
drwx------ 5 root root 4096 Feb 19 15:42 root/
drwxr-xr-x 27 root root 980 Feb 23 22:23 run/
drwxr-xr-x 2 root root 12288 Aug 8 2017 sbin/
drwxr-xr-x 2 root root 4096 Apr 29 2017 snap/
drwxr-xr-x 2 root root 4096 Aug 1 2017 srv/
dr-xr-xr-x 13 root root 0 Feb 23 22:20 sys/
drwxrwxrwt 8 root root 4096 Feb 23 22:21 tmp/
drwxr-xr-x 10 cbuser cbuser 4096 Aug 8 2017 usr/
drwxr-xr-x 14 root root 4096 Feb 19 16:24 var/
lrwxrwxrwx 1 root root 29 Aug 8 2017 vmlinuz -> boot/vmlinuz-4.4.0-87-generic

@Godbefore
Copy link
Author

/results/perf

cb_common.sh (2096): Changing ownership of /usr/local/hadoop_store/hdfs/namenode with command "sudo chown -R cbuser:cbuser /usr"

common.sh

function change_directory_ownership {
CHOWN_USER=$1
CHOWN_GROUP=$2
CHOWN_DIR=$3

ACTUAL_CHOWN_DIR="/"$(echo "${CHOWN_DIR}" | cut -d '/' -f 2)
CMD="sudo chown -R ${CHOWN_USER}:${CHOWN_GROUP} ${ACTUAL_CHOWN_DIR}"
syslog_netcat "Changing ownership of ${CHOWN_DIR} with command "$CMD""
$CMD
}

@maugustosilva
Copy link
Collaborator

It looks like you're trying to run SPECCloud (which itself uses CB, or course). Which version of the SPECCloud kit are you using. This particular issue that you're pointing is fixed on the latest SPECCloud version (spec_cloud_iaas_2018_2019-08-21-15-03-22)

@Godbefore
Copy link
Author

So that's it
Thanks for your comment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants