Skip to content
This repository has been archived by the owner on Jul 25, 2022. It is now read-only.

mount.lustre failed: Cannot send after transport endpoint shutdown #107

Closed
wants to merge 1 commit into from

Conversation

tanabarr
Copy link
Contributor

@tanabarr tanabarr commented Aug 22, 2017

This is being tracked in: https://jira.hpdd.intel.com/browse/LU-9838

The pertinent failure string seems to be
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown

Failure is repeatable and can be seen on SSI runs 501, 502, 504

The lustre package versions are as follows:
lustre-client-2.9.59_35_gc1d70a4
lustre-2.9.59_35_gc1d70a4

http://jenkins.lotus.hpdd.lab.intel.com/job/integration-tests-shared-storage-configuration/arch=x86_64,distro=el7/501//consoleFull

ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_hsm.py", line 84, in setUp
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS, hsm = True)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-06-27T22:59:07.145542+00:00', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': u'17', u'logs': 
step_count: 2
console: modprobe osd_ldiskfs: 0
mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14 /mnt/testfs-OST0001: 108
mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', 'primary_host': <ManagedHost: lotus-55vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run
  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run
  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target
  File "/usr/lib/python2.7/site-packages/chroma_agent/chroma_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
created_at: 2017-06-27T22:59:52.549857+00:00
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'primary_host': u'lotus-55vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-06-27T22:59:54.742321+00:00
step_index: 0
state: failed
result: 
resource_uri: /api/step/118/
id: 118
log: 

@tanabarr
Copy link
Contributor Author

strange that during the test run the version of lustre is updated:

Jun 27 09:36:26 Installed: kernel-devel-3.10.0-514.21.1.el7_lustre.x86_64
Jun 27 09:36:26 Updated: libcom_err-1.42.13.wc6-7.el7.x86_64
Jun 27 09:36:26 Installed: libuutil1-0.6.5.9-1.el7.x86_64
Jun 27 09:36:26 Installed: libnvpair1-0.6.5.9-1.el7.x86_64
Jun 27 09:36:26 Installed: dkms-2.3-5.20170523git8c3065c.el7.noarch
Jun 27 09:37:37 Installed: spl-dkms-0.6.5.9-1.el7.noarch
Jun 27 09:40:25 Installed: zfs-dkms-0.6.5.9-1.el7.noarch
Jun 27 09:40:25 Installed: libzpool2-0.6.5.9-1.el7.x86_64
Jun 27 09:40:25 Installed: libzfs2-0.6.5.9-1.el7.x86_64
Jun 27 09:40:25 Updated: e2fsprogs-libs-1.42.13.wc6-7.el7.x86_64
Jun 27 09:40:25 Installed: lm_sensors-libs-3.4.0-4.20160601gitf9185e5.el7.x86_64
Jun 27 09:40:25 Installed: lustre-osd-ldiskfs-mount-2.9.59_35_gc1d70a4-1.el7.x86_64
Jun 27 09:40:32 Installed: kernel-3.10.0-514.21.1.el7_lustre.x86_64
Jun 27 09:40:32 Installed: 1:net-snmp-agent-libs-5.7.2-24.el7_3.2.x86_64
Jun 27 09:40:32 Installed: lustre-osd-zfs-mount-2.9.59_35_gc1d70a4-1.el7.x86_64
Jun 27 09:40:32 Installed: spl-0.6.5.9-1.el7.x86_64
Jun 27 09:40:32 Updated: libss-1.42.13.wc6-7.el7.x86_64
Jun 27 09:40:32 Updated: e2fsprogs-1.42.13.wc6-7.el7.x86_64
Jun 27 09:40:32 Installed: expect-5.45-14.el7_1.x86_64
Jun 27 09:46:29 Installed: lustre-dkms-2.9.59_35_gc1d70a4-1.el7.noarch
Jun 27 09:46:29 Installed: lustre-2.9.59_35_gc1d70a4-1.el7.x86_64
Jun 27 09:46:31 Installed: kmod-lustre-osd-ldiskfs-2.9.59_35_gc1d70a4-1.el7.x86_64
Jun 27 09:46:31 Installed: zfs-0.6.5.9-1.el7.x86_64
Jun 27 09:46:34 Installed: kmod-lustre-2.9.59_35_gc1d70a4-1.el7.x86_64
Jun 27 13:50:00 Updated: lustre-osd-zfs-mount-2.10.50-1.el7.x86_64
Jun 27 13:56:02 Updated: lustre-dkms-2.10.50-1.el7.noarch
Jun 27 13:56:02 Updated: lustre-osd-ldiskfs-mount-2.10.50-1.el7.x86_64
Jun 27 13:56:35 Updated: kmod-lustre-osd-ldiskfs-2.10.50-1.el7.x86_64
Jun 27 13:56:35 Updated: lustre-2.10.50-1.el7.x86_64
Jun 27 13:57:10 Updated: kmod-lustre-2.10.50-1.el7.x86_64

could this be because the version on the public lustre repo changed mid test?

@brianjmurrell
Copy link
Contributor

could this be because the version on the public lustre repo changed mid test?

Yes, that is the case: https://build.hpdd.intel.com/job/lustre-master/3608/changes.

We also now have: https://git.hpdd.intel.com/gitweb?p=fs/lustre-release.git;a=commit;h=a9d45ef39a471595709a31b3d60b0c67b7af0c91 which is on a new branch.

However! Lustre is supposed to be backward compatible through a range/selection of previous releases, so the kind of "upgrade" that this test run experienced should to be supported.

So looking at the test failure...

The first "Cannot send after transport endpoint shutdown" happened during the "test_copytool_remove" test which according to the messages log on lotus-55vm18 happened during the window of 15:45:17->16:00:18, but the Lustre "upgrade" happened at ~13:56:35, almost 2 hours earlier. I think the Lustre upgrade is a red-herring.

So looking at the failure more closely...

Jun 27 15:59:11 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): file extents enabled, maximum tree depth=5
Jun 27 15:59:11 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): mounted filesystem with ordered data mode. Opts: errors=remount-ro
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): file extents enabled, maximum tree depth=5
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): mounted filesystem with ordered data mode. Opts: errors=remount-ro
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): file extents enabled, maximum tree depth=5
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdb): mounted filesystem with ordered data mode. Opts: ,errors=remount-ro,no_mbcache,nodelalloc
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LustreError: 15f-b: testfs-OST0001: cannot register this server with the MGS: rc = -108. Is the MGS running?
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LustreError: 12529:0:(obd_mount_server.c:1846:server_fill_super()) Unable to start targets: -108
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LustreError: 12529:0:(obd_mount_server.c:1560:server_put_super()) no obd testfs-OST0001
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LustreError: 12529:0:(obd_mount_server.c:135:server_deregister_mount()) testfs-OST0001 not registered
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: Lustre: server umount testfs-OST0001 complete
Jun 27 15:59:53 lotus-55vm18.lotus.hpdd.lab.intel.com kernel: LustreError: 12529:0:(obd_mount.c:1505:lustre_fill_super()) Unable to mount  (-108)

Error is suggesting the MGS is not running, which it needs to be to complete the registration. Looking at lotus-55vm15's messages file during the same time window:

Jun 27 15:59:10 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdc): mounted filesystem with ordered data mode. Opts: errors=remount-ro
Jun 27 15:59:16 lotus-55vm15.lotus.hpdd.lab.intel.com firewalld: ERROR: ALREADY_ENABLED: '988:tcp' already in 'public'
Jun 27 15:59:17 lotus-55vm15.lotus.hpdd.lab.intel.com firewalld: ERROR: ALREADY_ENABLED: 988:tcp
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com stonith-ng[26671]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 14, saving inputs in /var/lib/pacemaker/pengine/pe-input-499.bz2
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation MGS_2c732b_monitor_0 on lotus-55vm16.lotus.hpdd.lab.intel.com
Jun 27 15:59:21 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation MGS_2c732b_monitor_0 locally on lotus-55vm15.lotus.hpdd.lab.intel.com
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Result of probe operation for MGS_2c732b on lotus-55vm15.lotus.hpdd.lab.intel.com: 7 (not running)
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 14 (Complete=2, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-499.bz2): Complete
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 15, saving inputs in /var/lib/pacemaker/pengine/pe-input-500.bz2
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 15 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-500.bz2): Complete
Jun 27 15:59:22 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:24 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:24 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:24 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 16, saving inputs in /var/lib/pacemaker/pengine/pe-input-501.bz2
Jun 27 15:59:24 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 16 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-501.bz2): Complete
Jun 27 15:59:24 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com stonith-ng[26671]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Start   MGS_2c732b#011(lotus-55vm15.lotus.hpdd.lab.intel.com)
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 17, saving inputs in /var/lib/pacemaker/pengine/pe-input-502.bz2
Jun 27 15:59:25 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating start operation MGS_2c732b_start_0 locally on lotus-55vm15.lotus.hpdd.lab.intel.com
Jun 27 15:59:27 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdc): mounted filesystem with ordered data mode. Opts: errors=remount-ro
Jun 27 15:59:27 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: LDISKFS-fs (sdc): mounted filesystem with ordered data mode. Opts: user_xattr,errors=remount-ro,no_mbcache,nodelalloc
Jun 27 15:59:27 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: 27861:0:(osd_handler.c:7007:osd_mount()) MGS-osd: device /dev/sdc was upgraded from Lustre-1.x without enabling the dirdata feature. If you do not want to downgrade to Lustre-1.x again, you can enable it via 'tune2fs -O dirdata device'
Jun 27 15:59:27 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: MGS: Connection restored to 5c28efa9-6d05-c86e-9d96-45224052a8c2 (at 0@lo)
Jun 27 15:59:27 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: Skipped 1 previous similar message
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [ [ ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [   { ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "args": [ ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "modprobe",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "osd_ldiskfs" ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     ],  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "rc": 0,  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "stderr": "",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "stdout": "" ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [   },  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [   { ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "args": [ ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "mount",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "-t",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "lustre",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk13",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [       "/mnt/MGS" ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     ],  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "rc": 0,  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "stderr": "mount.lustre: increased /sys/block/sdc/queue/max_sectors_kb from 512 to 16384\n",  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [     "stdout": "" ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [   } ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [ ] ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com lrmd[26672]:  notice: MGS_2c732b_start_0:27836:stderr [  ]
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Result of start operation for MGS_2c732b on lotus-55vm15.lotus.hpdd.lab.intel.com: 0 (ok)
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation MGS_2c732b_monitor_5000 locally on lotus-55vm15.lotus.hpdd.lab.intel.com
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 17 (Complete=2, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-502.bz2): Complete
Jun 27 15:59:28 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:33 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: MGS: Connection restored to c7547940-8b66-9e67-9f49-79141907f91f (at 10.14.83.21@tcp)
Jun 27 15:59:39 lotus-55vm15.lotus.hpdd.lab.intel.com firewalld: ERROR: ALREADY_ENABLED: '988:tcp' already in 'public'
Jun 27 15:59:40 lotus-55vm15.lotus.hpdd.lab.intel.com firewalld: ERROR: ALREADY_ENABLED: 988:tcp
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com stonith-ng[26671]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 18, saving inputs in /var/lib/pacemaker/pengine/pe-input-503.bz2
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation testfs-MDT0000_50c63c_monitor_0 on lotus-55vm16.lotus.hpdd.lab.intel.com
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation testfs-MDT0000_50c63c_monitor_0 locally on lotus-55vm15.lotus.hpdd.lab.intel.com
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Result of probe operation for testfs-MDT0000_50c63c on lotus-55vm15.lotus.hpdd.lab.intel.com: 7 (not running)
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 18 (Complete=2, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-503.bz2): Complete
Jun 27 15:59:42 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:43 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:43 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:43 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 19, saving inputs in /var/lib/pacemaker/pengine/pe-input-504.bz2
Jun 27 15:59:43 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 19 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-504.bz2): Complete
Jun 27 15:59:43 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:44 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:44 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:44 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 20, saving inputs in /var/lib/pacemaker/pengine/pe-input-505.bz2
Jun 27 15:59:44 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 20 (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-505.bz2): Complete
Jun 27 15:59:44 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_IDLE -> S_POLICY_ENGINE
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com stonith-ng[26671]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: On loss of CCM Quorum: Ignore
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Start   testfs-MDT0000_50c63c#011(lotus-55vm16.lotus.hpdd.lab.intel.com)
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com pengine[26674]:  notice: Calculated transition 21, saving inputs in /var/lib/pacemaker/pengine/pe-input-506.bz2
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating start operation testfs-MDT0000_50c63c_start_0 on lotus-55vm16.lotus.hpdd.lab.intel.com
Jun 27 15:59:46 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: MGS: Connection restored to c7547940-8b66-9e67-9f49-79141907f91f (at 10.14.83.21@tcp)
Jun 27 15:59:47 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Initiating monitor operation testfs-MDT0000_50c63c_monitor_5000 on lotus-55vm16.lotus.hpdd.lab.intel.com
Jun 27 15:59:48 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: Transition 21 (Complete=2, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-506.bz2): Complete
Jun 27 15:59:48 lotus-55vm15.lotus.hpdd.lab.intel.com crmd[26675]:  notice: State transition S_TRANSITION_ENGINE -> S_IDLE
Jun 27 15:59:52 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: Setting parameter testfs-MDT0000.mdt.hsm_control in log testfs-MDT0000
Jun 27 15:59:52 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: MGS: Connection restored to fe1fce7d-c89d-ab3e-870f-b1e6d37c8900 (at 10.14.83.22@tcp)
Jun 27 16:00:06 lotus-55vm15.lotus.hpdd.lab.intel.com kernel: Lustre: MGS: Connection restored to fe1fce7d-c89d-ab3e-870f-b1e6d37c8900 (at 10.14.83.22@tcp)

We can see that the MGS was started at 15:59:10 but that even as late as 15:59:52 it was still restoring connections. It restored connections to lotus-55vm17 and lotus-55vm16 but never lotus-55vm18. I'm not positive it should have but it's just a data point to consider.

It might be worth looking this test over to see if we have some kind of race, or other timing assumption that may be sometimes invalid. Maybe we are assuming that just because we start the MGS it's immediately available and perhaps we need an availability test before trying to register a new target. If we did find that to be the case however, I would strongly suspect a Lustre bug.

@tanabarr
Copy link
Contributor Author

failure still appears on subsequent runs with lustre packages installed on version 2.10.50 from the start:
http://jenkins.lotus.hpdd.lab.intel.com/job/integration-tests-shared-storage-configuration/516/arch=x86_64,distro=el7//consoleFull

@tanabarr
Copy link
Contributor Author

@tanabarr tanabarr self-assigned this Jun 29, 2017
@tanabarr tanabarr changed the title Repeatable failure on SSI with IML updated to use Lustre master mount.lustre failed: Cannot send after transport endpoint shutdown Jun 29, 2017
@tanabarr tanabarr added the bug label Jul 2, 2017
@brianjmurrell
Copy link
Contributor

Here's another occurrence:

test_nids (tests.integration.shared_storage_configuration.test_lnet_functionality.TestLNetFunctionality) ... ERROR
ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_lnet_functionality.py", line 18, in setUp
    self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-07-28T08:51:43.584792', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/82/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-9vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/82/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-9vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-9vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-9vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
COMMAND 17: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-07-28T08:51:43.584792', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 67 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/17/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/121/': None}, u'created_at': u'2017-07-28T08:51:43.863873', u'modified_at': u'2017-07-28T08:51:43.863845', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/121/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/66/', u'/api/job/59/', u'/api/job/63/'], u'id': 67, u'resource_uri': u'/api/job/67/'}

Step 121 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', 'primary_host': <ManagedHost: lotus-9vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/chroma_agent/chroma_common/filesystems/filesystem_ldiskfs.py", line 61, in mount

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-07-28T08:53:43.477014
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'primary_host': u'lotus-9vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-07-28T08:53:45.727514
step_index: 0
state: failed
result: 
resource_uri: /api/step/121/
id: 121
log: 


--------------------- >> end captured stdout << ----------------------

where this time there is nothing new on the Lustre branch we are pulling from so the upgrading-during-the-test is definitely a red-herring.

@brianjmurrell
Copy link
Contributor

Another:

test_create_filesystem_with_failover_mgs (tests.integration.shared_storage_configuration.test_managed_filesystem_with_failover.TestManagedFilesystemWithFailover) ... FAIL
FAIL
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 45, in test_create_filesystem_with_failover_mgs
    filesystem_id, volumes_expected_hosts_in_normal_state = self._test_create_filesystem_with_failover()
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 15, in _test_create_filesystem_with_failover
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-07-29T05:18:15.129969', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
COMMAND 17: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-07-29T05:18:15.129969', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 67 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/17/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/122/': None}, u'created_at': u'2017-07-29T05:18:15.395979', u'modified_at': u'2017-07-29T05:18:15.395946', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/122/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/66/', u'/api/job/59/', u'/api/job/63/'], u'id': 67, u'resource_uri': u'/api/job/67/'}

Step 122 failed:
step_count: 2
console: modprobe osd_zfs: 0


modprobe zfs: 0


mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001 /mnt/testfs-OST0001: 108

mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001', 'primary_host': <ManagedHost: lotus-58vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'zfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/chroma_agent/chroma_common/filesystems/filesystem.py", line 77, in mount

  File "/usr/lib/python2.7/site-packages/chroma_agent/lib/shell.py", line 106, in try_run

CommandExecutionError: Error (108) running 'mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001 /mnt/testfs-OST0001': '' 'mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-07-29T05:21:12.097790
args: {u'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15/testfs-OST0001', u'primary_host': u'lotus-58vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'zfs', u'target': u'testfs-OST0001'}
modified_at: 2017-07-29T05:21:24.339173
step_index: 0
state: failed
result: 
resource_uri: /api/step/122/
id: 122
log: 


--------------------- >> end captured stdout << ----------------------

@tanabarr
Copy link
Contributor Author

tanabarr commented Jul 30, 2017

Summary: so we've discounted the upgrade during test cause and we think it's to do with MGS availability/state. we can see that the MGS mount command returns rc=0 and stderr=mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384

should be noted that we don't actually register MGS, (steps empty for RegisterTargetJob with target=MGS)

As @brianjmurrell has suggested it might be prudent to verify availability of MGS when registering other targets. During RegisterTargetJob.get_deps() we do already verify the MGS is in a mounted state when registering filesystem member targets, but maybe just verifying mounted state is not sufficient.

@brianjmurrell
Copy link
Contributor

stderr=mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384

This is informational and benign.

As @brianjmurrell has suggested it might be prudent to verify availability of MGS when registering other targets.

No. That's not what I was saying. We really shouldn't be adding more "sanity checks" (of stuff that should be dependably working) to the process as sanity checks are what make it take so long already.

What I was suggesting was, as a temporary debugging measure to figure out what is going wrong here was to verify that the MGS format/registration/etc. is being successful before we move on to other steps.

@brianjmurrell
Copy link
Contributor

Another:

test_create_filesystem_with_failover_oss_chroma_controlled (tests.integration.shared_storage_configuration.test_managed_filesystem_with_failover.TestManagedFilesystemWithFailover) ... FAIL
FAIL
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 134, in test_create_filesystem_with_failover_oss_chroma_controlled
    filesystem_id, volumes_expected_hosts_in_normal_state = self._test_create_filesystem_with_failover()
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 15, in _test_create_filesystem_with_failover
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/', u'/api/job/75/'], u'complete': True, u'created_at': u'2017-07-31T03:18:54.967552', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/19/', u'id': 19, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/22/', u'id': 22, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-55vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-55vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-55vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/22/', u'id': 22, u'host_label': u'lotus-55vm17.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
COMMAND 19: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/', u'/api/job/75/'], u'complete': True, u'created_at': u'2017-07-31T03:18:54.967552', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/19/', u'id': 19, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 69 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/19/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/124/': None}, u'created_at': u'2017-07-31T03:18:55.253325', u'modified_at': u'2017-07-31T03:18:55.253281', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/124/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/65/', u'/api/job/68/', u'/api/job/61/'], u'id': 69, u'resource_uri': u'/api/job/69/'}

Step 124 failed:
step_count: 2
console: modprobe osd_zfs: 0


modprobe zfs: 0


mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001 /mnt/testfs-OST0001: 108

mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001', 'primary_host': <ManagedHost: lotus-55vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'zfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/chroma_agent/chroma_common/filesystems/filesystem.py", line 77, in mount

  File "/usr/lib/python2.7/site-packages/chroma_agent/lib/shell.py", line 106, in try_run

CommandExecutionError: Error (108) running 'mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001 /mnt/testfs-OST0001': '' 'mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-07-31T03:22:06.849509
args: {u'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13/testfs-OST0001', u'primary_host': u'lotus-55vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'zfs', u'target': u'testfs-OST0001'}
modified_at: 2017-07-31T03:22:29.086932
step_index: 0
state: failed
result: 
resource_uri: /api/step/124/
id: 124
log: 


--------------------- >> end captured stdout << ----------------------

@brianjmurrell
Copy link
Contributor

Another:

test_create_filesystem_with_failover_mgs (tests.integration.shared_storage_configuration.test_managed_filesystem_with_failover.TestManagedFilesystemWithFailover) ... FAIL
FAIL
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 45, in test_create_filesystem_with_failover_mgs
    filesystem_id, volumes_expected_hosts_in_normal_state = self._test_create_filesystem_with_failover()
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 15, in _test_create_filesystem_with_failover
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-02T03:04:01.514840', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 4, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 4, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-56vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-56vm5.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-56vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-56vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
COMMAND 18: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-02T03:04:01.514840', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 68 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/18/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/123/': None}, u'created_at': u'2017-08-02T03:04:01.765676', u'modified_at': u'2017-08-02T03:04:01.765648', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/123/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/64/', u'/api/job/67/', u'/api/job/60/'], u'id': 68, u'resource_uri': u'/api/job/68/'}

Step 123 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', 'primary_host': <ManagedHost: lotus-56vm8.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/chroma_agent/chroma_common/filesystems/filesystem_ldiskfs.py", line 61, in mount

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-02T03:07:04.184570
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'primary_host': u'lotus-56vm8.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-02T03:07:10.343301
step_index: 0
state: failed
result: 
resource_uri: /api/step/123/
id: 123
log: 


--------------------- >> end captured stdout << ----------------------

@jgrund
Copy link
Member

jgrund commented Aug 14, 2017

Another

Step 122 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', 'primary_host': <ManagedHost: lotus-48vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
    raise RuntimeError("Error (%s) mounting '%s': '%s' '%s'" % (result.rc, mount_point, result.stdout, result.stderr))

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T05:09:58.451289
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'primary_host': u'lotus-48vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T05:10:04.290235
step_index: 0
state: failed
result: 
resource_uri: /api/step/122/
id: 122
log: 


--------------------- >> end captured stdout << ----------------------
-------------------- >> begin captured logging << --------------------

@jgrund
Copy link
Member

jgrund commented Aug 14, 2017

This is being tracked in: https://jira.hpdd.intel.com/browse/LU-9838

@brianjmurrell
Copy link
Contributor

Another:

test_lnet_reverse_dependencies (tests.integration.shared_storage_configuration.test_lnet_functionality.TestLNetFunctionality) ... ERROR
ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_lnet_functionality.py", line 18, in setUp
    self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-12T09:50:56.617974', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/84/', u'volume_nodes': [{u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-57vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/84/', u'volume_nodes': [{u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-57vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-57vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-57vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
COMMAND 18: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-12T09:50:56.617974', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 68 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/18/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/124/': None}, u'created_at': u'2017-08-12T09:50:56.904910', u'modified_at': u'2017-08-12T09:50:56.904879', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/124/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/64/', u'/api/job/67/', u'/api/job/60/'], u'id': 68, u'resource_uri': u'/api/job/68/'}

Step 124 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', 'primary_host': <ManagedHost: lotus-57vm8.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
    raise RuntimeError("Error (%s) mounting '%s': '%s' '%s'" % (result.rc, mount_point, result.stdout, result.stderr))

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T09:52:50.965767
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'primary_host': u'lotus-57vm8.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T09:53:27.233453
step_index: 0
state: failed
result: 
resource_uri: /api/step/124/
id: 124
log: 


--------------------- >> end captured stdout << ----------------------

@brianjmurrell
Copy link
Contributor

Another:

test_lnet_reverse_dependencies (tests.integration.shared_storage_configuration.test_lnet_functionality.TestLNetFunctionality) ... ERROR
ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_lnet_functionality.py", line 18, in setUp
    self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-08-12T13:25:05.140326', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/84/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-noha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/84/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
COMMAND 17: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-08-12T13:25:05.140326', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 67 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/17/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/122/': None}, u'created_at': u'2017-08-12T13:25:05.424552', u'modified_at': u'2017-08-12T13:25:05.424523', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/122/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/66/', u'/api/job/59/', u'/api/job/63/'], u'id': 67, u'resource_uri': u'/api/job/67/'}

Step 122 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', 'primary_host': <ManagedHost: lotus-58vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
    raise RuntimeError("Error (%s) mounting '%s': '%s' '%s'" % (result.rc, mount_point, result.stdout, result.stderr))

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T13:27:41.916142
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'primary_host': u'lotus-58vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T13:27:45.161623
step_index: 0
state: failed
result: 
resource_uri: /api/step/122/
id: 122
log: 


--------------------- >> end captured stdout << ----------------------

and:

test_nids (tests.integration.shared_storage_configuration.test_lnet_functionality.TestLNetFunctionality) ... ERROR
ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_lnet_functionality.py", line 18, in setUp
    self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-08-12T13:51:16.352983', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 4, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/51/', u'volume_nodes': [{u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/5/', u'id': 5, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/10/', u'id': 10, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10672993730'}
COMMAND 17: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/57/', u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/'], u'complete': True, u'created_at': u'2017-08-12T13:51:16.352983', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/17/', u'id': 17, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 67 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/17/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/122/': None}, u'created_at': u'2017-08-12T13:51:16.656750', u'modified_at': u'2017-08-12T13:51:16.656719', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/122/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/66/', u'/api/job/59/', u'/api/job/63/'], u'id': 67, u'resource_uri': u'/api/job/67/'}

Step 122 failed:
step_count: 2
console: modprobe osd_zfs: 0


modprobe zfs: 0


mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001 /mnt/testfs-OST0001: 108

mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001', 'primary_host': <ManagedHost: lotus-58vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'zfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem.py", line 77, in mount
    return shell.Shell.try_run(["mount", "-t", "lustre", "%s" % self._device_path, mount_point])

  File "/usr/lib/python2.7/site-packages/chroma_agent/lib/shell.py", line 106, in try_run

CommandExecutionError: Error (108) running 'mount -t lustre zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001 /mnt/testfs-OST0001': '' 'mount.lustre: mount zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001 at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T13:54:22.220730
args: {u'device_path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11/testfs-OST0001', u'primary_host': u'lotus-58vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'zfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T13:54:25.477759
step_index: 0
state: failed
result: 
resource_uri: /api/step/122/
id: 122
log: 


--------------------- >> end captured stdout << ----------------------

and:

test_create_filesystem_with_failover_mgs (tests.integration.shared_storage_configuration.test_managed_filesystem_with_failover.TestManagedFilesystemWithFailover) ... FAIL
FAIL
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 45, in test_create_filesystem_with_failover_mgs
    filesystem_id, volumes_expected_hosts_in_normal_state = self._test_create_filesystem_with_failover()
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_managed_filesystem_with_failover.py", line 15, in _test_create_filesystem_with_failover
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-12T16:08:22.964494', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/83/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 3, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 2, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk11', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 3, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 2, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk12', u'host_id': 1, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk12', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 4, u'resource_uri': u'/api/volume_node/4/', u'id': 4, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 2, u'resource_uri': u'/api/volume_node/15/', u'id': 15, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 3, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'host_id': 1, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk13', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/23/', u'volume_nodes': [{u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-58vm18.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 4, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 3, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 2, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 4, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk14', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/4/', u'id': 4, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/83/', u'volume_nodes': [{u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 3, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-58vm17.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 2, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-58vm16.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-58vm15.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk15', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
COMMAND 18: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/58/', u'/api/job/59/', u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/'], u'complete': True, u'created_at': u'2017-08-12T16:08:22.964494', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/18/', u'id': 18, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 68 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/18/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/124/': None}, u'created_at': u'2017-08-12T16:08:23.251948', u'modified_at': u'2017-08-12T16:08:23.251918', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/124/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/64/', u'/api/job/67/', u'/api/job/60/'], u'id': 68, u'resource_uri': u'/api/job/68/'}

Step 124 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', 'primary_host': <ManagedHost: lotus-58vm18.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
    raise RuntimeError("Error (%s) mounting '%s': '%s' '%s'" % (result.rc, mount_point, result.stdout, result.stderr))

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sdb/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sdb at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T16:10:31.353876
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk14', u'primary_host': u'lotus-58vm18.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T16:10:33.503356
step_index: 0
state: failed
result: 
resource_uri: /api/step/124/
id: 124
log: 


--------------------- >> end captured stdout << ----------------------

@brianjmurrell
Copy link
Contributor

Another:

test_copytool_start_stop (tests.integration.shared_storage_configuration.test_hsm.TestHsmCopytoolManagement) ... ERROR
ERROR
Traceback (most recent call last):
  File "/usr/share/chroma-manager/tests/integration/shared_storage_configuration/test_hsm.py", line 84, in setUp
    filesystem_id = self.create_filesystem_standard(self.TEST_SERVERS, hsm = True)
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 368, in create_filesystem_standard
    'conf_params': {}})
  File "/usr/share/chroma-manager/tests/integration/core/chroma_integration_testcase.py", line 432, in create_filesystem
    timeout = LONG_TEST_TIMEOUT
  File "/usr/share/chroma-manager/tests/integration/core/api_testcase_with_test_reset.py", line 336, in wait_for_command
    self.assertFalse(command['errored'] or command['cancelled'], command)
AssertionError: {u'jobs': [u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/', u'/api/job/75/', u'/api/job/76/', u'/api/job/81/'], u'complete': True, u'created_at': u'2017-08-12T22:57:49.529124', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/20/', u'id': 20, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}
-------------------- >> begin captured stdout << ---------------------
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/22/', u'id': 22, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/23/', u'id': 23, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/83/', u'volume_nodes': [{u'use': True, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/47/', u'volume_nodes': [{u'use': True, u'volume_id': 1, u'primary': False, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 4, u'resource_uri': u'/api/volume_node/3/', u'id': 3, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 2, u'resource_uri': u'/api/volume_node/8/', u'id': 8, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 1, u'primary': True, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 3, u'resource_uri': u'/api/volume_node/13/', u'id': 13, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 1, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'host_id': 1, u'resource_uri': u'/api/volume_node/18/', u'id': 18, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk1', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/1/', u'id': 1, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/26/', u'volume_nodes': [{u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 4, u'resource_uri': u'/api/volume_node/2/', u'id': 2, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': True, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 2, u'resource_uri': u'/api/volume_node/7/', u'id': 7, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 2, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 3, u'resource_uri': u'/api/volume_node/12/', u'id': 12, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 2, u'primary': False, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk4', u'host_id': 1, u'resource_uri': u'/api/volume_node/17/', u'id': 17, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk4', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/2/', u'id': 2, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/49/', u'volume_nodes': [{u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 1, u'resource_uri': u'/api/volume_node/20/', u'id': 20, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': True, u'host': u'/api/host/4/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 4, u'resource_uri': u'/api/volume_node/21/', u'id': 21, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 3, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 2, u'resource_uri': u'/api/volume_node/22/', u'id': 22, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 3, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'host_id': 3, u'resource_uri': u'/api/volume_node/23/', u'id': 23, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk3', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/3/', u'id': 3, u'size': u'10672993730'}
{u'status': u'configured-ha', u'kind': u'SCSI device', u'storage_resource': u'/api/storage_resource/24/', u'volume_nodes': [{u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/4/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 4, u'resource_uri': u'/api/volume_node/1/', u'id': 1, u'host_label': u'lotus-9vm8.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': False, u'host': u'/api/host/2/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 2, u'resource_uri': u'/api/volume_node/6/', u'id': 6, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 5, u'primary': False, u'host': u'/api/host/3/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 3, u'resource_uri': u'/api/volume_node/11/', u'id': 11, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 5, u'primary': True, u'host': u'/api/host/1/', u'path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'host_id': 1, u'resource_uri': u'/api/volume_node/16/', u'id': 16, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'disk2', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/5/', u'id': 5, u'size': u'10737418240'}
{u'status': u'configured-ha', u'kind': u'ZfsPool', u'storage_resource': u'/api/storage_resource/83/', u'volume_nodes': [{u'use': True, u'volume_id': 6, u'primary': False, u'host': u'/api/host/2/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 2, u'resource_uri': u'/api/volume_node/9/', u'id': 9, u'host_label': u'lotus-9vm6.lotus.hpdd.lab.intel.com'}, {u'use': False, u'volume_id': 6, u'primary': False, u'host': u'/api/host/3/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 3, u'resource_uri': u'/api/volume_node/14/', u'id': 14, u'host_label': u'lotus-9vm7.lotus.hpdd.lab.intel.com'}, {u'use': True, u'volume_id': 6, u'primary': True, u'host': u'/api/host/1/', u'path': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'host_id': 1, u'resource_uri': u'/api/volume_node/19/', u'id': 19, u'host_label': u'lotus-9vm5.lotus.hpdd.lab.intel.com'}], u'filesystem_type': None, u'label': u'zfs_pool_scsi0QEMU_QEMU_HARDDISK_disk5', u'usable_for_lustre': True, u'resource_uri': u'/api/volume/6/', u'id': 6, u'size': u'10672993730'}
COMMAND 20: FAILED
-----------------------------------------------------------
{u'jobs': [u'/api/job/60/', u'/api/job/61/', u'/api/job/62/', u'/api/job/63/', u'/api/job/64/', u'/api/job/65/', u'/api/job/66/', u'/api/job/67/', u'/api/job/68/', u'/api/job/69/', u'/api/job/70/', u'/api/job/71/', u'/api/job/72/', u'/api/job/73/', u'/api/job/74/', u'/api/job/75/', u'/api/job/76/', u'/api/job/81/'], u'complete': True, u'created_at': u'2017-08-12T22:57:49.529124', u'message': u'Creating filesystem testfs', u'cancelled': True, u'errored': True, u'resource_uri': u'/api/command/20/', u'id': 20, u'logs': u'\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n\n'}

Job 70 Errored (Register testfs-OST0001):
{u'commands': [u'/api/command/20/'], u'write_locks': [{u'locked_item_content_type_id': 91, u'locked_item_id': 4, u'locked_item_uri': u'/api/target/4/', u'resource_uri': u''}], u'description': u'Register testfs-OST0001', u'read_locks': [{u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 93, u'locked_item_id': 1, u'locked_item_uri': u'/api/target/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 92, u'locked_item_id': 2, u'locked_item_uri': u'/api/target/2/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 109, u'locked_item_id': 1, u'locked_item_uri': u'/api/filesystem/1/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 4, u'locked_item_uri': u'/api/lnet_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 4, u'locked_item_uri': u'/api/pacemaker_configuration/4/', u'resource_uri': u''}, {u'locked_item_content_type_id': 10, u'locked_item_id': 3, u'locked_item_uri': u'/api/lnet_configuration/3/', u'resource_uri': u''}, {u'locked_item_content_type_id': 49, u'locked_item_id': 3, u'locked_item_uri': u'/api/pacemaker_configuration/3/', u'resource_uri': u''}], u'class_name': u'RegisterTargetJob', u'step_results': {u'/api/step/125/': None}, u'created_at': u'2017-08-12T22:57:49.828407', u'modified_at': u'2017-08-12T22:57:49.828379', u'available_transitions': [], u'state': u'complete', u'steps': [u'/api/step/125/'], u'cancelled': False, u'errored': True, u'wait_for': [u'/api/job/66/', u'/api/job/69/', u'/api/job/62/'], u'id': 70, u'resource_uri': u'/api/job/70/'}

Step 125 failed:
step_count: 2
console: modprobe osd_ldiskfs: 0


mount -t lustre /dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2 /mnt/testfs-OST0001: 108

mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown


description: RegisterTargetStep: {'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', 'primary_host': <ManagedHost: lotus-9vm8.lotus.hpdd.lab.intel.com>, 'mount_point': u'/mnt/testfs-OST0001', 'backfstype': 'ldiskfs', 'target': <ManagedTarget: testfs-OST0001>}
class_name: RegisterTargetStep
backtrace: Traceback (most recent call last):

  File "/usr/lib/python2.7/site-packages/chroma_agent/device_plugins/action_runner.py", line 164, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/plugin_manager.py", line 305, in run

  File "/usr/lib/python2.7/site-packages/chroma_agent/action_plugins/manage_targets.py", line 282, in register_target

  File "/usr/lib/python2.7/site-packages/iml_common/filesystems/filesystem_ldiskfs.py", line 61, in mount
    raise RuntimeError("Error (%s) mounting '%s': '%s' '%s'" % (result.rc, mount_point, result.stdout, result.stderr))

RuntimeError: Error (108) mounting '/mnt/testfs-OST0001': '' 'mount.lustre: increased /sys/block/sde/queue/max_sectors_kb from 512 to 16384
mount.lustre: mount /dev/sde at /mnt/testfs-OST0001 failed: Cannot send after transport endpoint shutdown
'

created_at: 2017-08-12T23:01:08.492998
args: {u'device_path': u'/dev/disk/by-id/scsi-0QEMU_QEMU_HARDDISK_disk2', u'primary_host': u'lotus-9vm8.lotus.hpdd.lab.intel.com', u'mount_point': u'/mnt/testfs-OST0001', u'backfstype': u'ldiskfs', u'target': u'testfs-OST0001'}
modified_at: 2017-08-12T23:01:09.735407
step_index: 0
state: failed
result: 
resource_uri: /api/step/125/
id: 125
log: 


--------------------- >> end captured stdout << ----------------------

@jgrund
Copy link
Member

jgrund commented Aug 17, 2017

Seeing this in 20 of the last 20 runs of SSI.

@brianjmurrell
Copy link
Contributor

Another

@brianjmurrell
Copy link
Contributor

Another

@brianjmurrell
Copy link
Contributor

Another

@tanabarr tanabarr closed this Aug 22, 2017
@tanabarr tanabarr deleted the debug-mountorimportstep branch August 22, 2017 12:25
@jgrund
Copy link
Member

jgrund commented Aug 22, 2017

Moved to #230

utopiabound pushed a commit that referenced this pull request Apr 5, 2021
* Bump Copyright to 2021

Signed-off-by: Nick Linker <nlinker@gmail.com>

* Typo fixed

Signed-off-by: Nick Linker <nlinker@gmail.com>
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
3 participants