Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DEV] rhels7.4/rhels7.2 diskless hang on Power S822LC #274

Closed
immarvin opened this issue Oct 19, 2015 · 8 comments
Closed

[DEV] rhels7.4/rhels7.2 diskless hang on Power S822LC #274

immarvin opened this issue Oct 19, 2015 · 8 comments

Comments

@immarvin
Copy link
Contributor

immarvin commented Oct 19, 2015

I tried the following tasks on IBM Power S822LC with RHEL-7.2-20150820.0-Server-ppc64le-dvd1.iso, the status:

diskless provision: FAIL
description:

hang on reboot:

[   95.331916] dracut: Switching root
[   95.338809] ip_tables: (C) 2000-2006 Netfilter Core Team
[   95.339023] systemd[1]: Inserted module 'ip_tables'
[   95.343307] systemd[1]: systemd 219 running in system mode. (+PAM +AUDIT +SELINUX +IMA -APPARMOR +SMACK +SYSVINIT +UTMP +LIBCRYPTSETUP +GCRYPT +GNUTLS +ACL +XZ -LZ4 -SECCOMP +BLKID +ELFUTILS +KMOD +IDN)
[   95.343917] systemd[1]: Detected architecture ppc64-le.

Welcome to Red Hat Enterprise Linux Server 7.2 Beta (Maipo)!

[   95.344189] systemd[1]: Initializing machine ID from random generator.
[   95.363919] systemd[1]: Cannot add dependency job for unit display-manager.service, ignoring: Unit display-manager.service failed to load: No such file or directory.
[   95.364846] systemd[1]: Started Dispatch Password Requests to Console Directory Watch.
[   95.365043] systemd[1]: Starting Dispatch Password Requests to Console Directory Watch.
[  OK  ] Set up automount Arbitrary Executab...ats File System Automount Point.
[   95.365561] systemd[1]: Set up automount Arbitrary Executable File Formats File System Automount Point.
[   95.365646] systemd[1]: Starting Arbitrary Executable File Formats File System Automount Point.
[  OK  ] Reached target Encrypted Volumes.
[   95.365997] systemd[1]: Reached target Encrypted Volumes.
[   95.366048] systemd[1]: Starting Encrypted Volumes.
[  OK  ] Reached target Swap.
[   95.366212] systemd[1]: Reached target Swap.
[   95.366263] systemd[1]: Starting Swap.
[   95.366340] systemd[1]: Started Forward Password Requests to Wall Directory Watch.
[   95.366412] systemd[1]: Starting Forward Password Requests to Wall Directory Watch.
[  OK  ] Created slice Root Slice.
[   95.366916] systemd[1]: Created slice Root Slice.
[   95.366968] systemd[1]: Starting Root Slice.
[  OK  ] Created slice System Slice.
[   95.367221] systemd[1]: Created slice System Slice.
[   95.367278] systemd[1]: Starting System Slice.
[  OK  ] Created slice system-serial\x2dgetty.slice.
[   95.367558] systemd[1]: Created slice system-serial\x2dgetty.slice.
[   95.367773] systemd[1]: Starting system-serial\x2dgetty.slice.
[   95.367839] systemd[1]: Started Replay Read-Ahead Data.
[  OK  ] Created slice system-getty.slice.
[   95.368100] systemd[1]: Created slice system-getty.slice.
[   95.368151] systemd[1]: Starting system-getty.slice.
[  OK  ] Listening on /dev/initctl Compatibility Named Pipe.
[   95.368382] systemd[1]: Listening on /dev/initctl Compatibility Named Pipe.
[   95.368442] systemd[1]: Starting /dev/initctl Compatibility Named Pipe.
[  OK  ] Listening on Delayed Shutdown Socket.
[   95.368846] systemd[1]: Listening on Delayed Shutdown Socket.
[   95.368908] systemd[1]: Starting Delayed Shutdown Socket.
[  OK  ] Listening on udev Kernel Socket.
[   95.369088] systemd[1]: Listening on udev Kernel Socket.
[   95.369140] systemd[1]: Starting udev Kernel Socket.
[  OK  ] Listening on Journal Socket.
[   95.369353] systemd[1]: Listening on Journal Socket.
[   95.369567] systemd[1]: Starting Journal Socket.
[   95.370599] systemd[1]: Mounting Huge Pages File System...
         Mounting Huge Pages File System...
[   95.371258] systemd[1]: Mounting POSIX Message Queue File System...
         Mounting POSIX Message Queue File System...
[   95.372016] systemd[1]: Starting Journal Service...
         Starting Journal Service...
[   95.373090] systemd[1]: Starting Create list of required static device nodes for the current kernel...
         Starting Create list of required st... nodes for the current kernel...
[   95.373823] systemd[1]: Mounting NFSD configuration filesystem...
         Mounting NFSD configuration filesystem...
[  OK  ] Created slice User and Session Slice.
[   95.374282] systemd[1]: Created slice User and Session Slice.
[   95.374339] systemd[1]: Starting User and Session Slice.
[  OK  ] Reached target Slices.
[   95.374502] systemd[1]: Reached target Slices.
[   95.374549] systemd[1]: Starting Slices.
[  OK  ] Listening on udev Control Socket.
[   95.374715] systemd[1]: Listening on udev Control Socket.
[   95.374761] systemd[1]: Starting udev Control Socket.
[   95.375315] systemd[1]: Mounting Debug File System...
         Mounting Debug File System...
[   95.375462] systemd[1]: Started Kernel Module supporting RPCSEC_GSS.
[   95.375936] systemd[1]: Starting Collect Read-Ahead Data...
         Starting Collect Read-Ahead Data...
[  OK  ] Mounted Huge Pages File System.
[   95.376940] systemd[1]: Mounted Huge Pages File System.
[  OK  ] Mounted POSIX Message Queue File System.
[   95.377100] systemd[1]: Mounted POSIX Message Queue File System.
[  OK  ] Mounted Debug File System.
[   95.377484] systemd[1]: Mounted Debug File System.
[  OK  ] Started Create list of required sta...ce nodes for the current kernel.
[   95.378019] systemd[1]: Started Create list of required static device nodes for the current kernel.
[  OK  ] Started Collect Read-Ahead Data.
[   95.378363] systemd[1]: Started Collect Read-Ahead Data.
[   95.384051] systemd[1]: Starting Setup Virtual Console...
         Starting Setup Virtual Console...
[   95.384196] systemd[1]: Started Set Up Additional Binary Formats.
[   95.384272] systemd[1]: Started Load legacy module configuration.
[   95.384415] systemd[1]: Started Load Kernel Modules.
[   95.384821] systemd[1]: Starting Apply Kernel Variables...
         Starting Apply Kernel Variables...
[   95.385328] systemd[1]: Mounting Configuration File System...
         Mounting Configuration File System...
[   95.385477] systemd[1]: Mounted FUSE Control File System.
[   95.385889] systemd[1]: Starting Remount Root and Kernel File Systems...
         [   95.400636] Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
Starting Remount Root and Kernel File Systems...
[   95.423448] systemd[1]: Starting Create Static Device Nodes in /dev...
         Starting Create Static Device Nodes in /dev...
[  OK  ] Mounted NFSD configuration filesystem.
[   95.424114] systemd[1]: Mounted NFSD configuration filesystem.
[  OK  ] Mounted Configuration File System.
[   95.424272] systemd[1]: Mounted Configuration File System.
[  OK  ] Started Journal Service.
[   95.424456] systemd[1]: Started Journal Service.
[  OK  ] Started Setup Virtual Console.
[  OK  ] Started Apply Kernel Variables.
[  OK  ] Started Remount Root and Kernel File Systems.
[  OK  ] Started Create Static Device Nodes in /dev.
         Starting udev Kernel Device Manager...
         Starting Rebuild Dynamic Linker Cache...
         Starting Rebuild Hardware Database...
         Starting Configure read-only root support...
         Starting Load/Save Random Seed...
[  OK  ] Reached target Local File Systems (Pre).
         Starting Flush Journal to Persistent Storage...
[  OK  ] Started Load/Save Random Seed.
[  OK  ] Started udev Kernel Device Manager.
[   95.501416] systemd-journald[2322]: Received request to flush runtime journal from PID 1
[  OK  ] Started Flush Journal to Persistent Storage.
[  OK  ] Started Rebuild Dynamic Linker Cache.
[  OK  ] Started Configure read-only root support.
[  OK  ] Reached target Local File Systems.
         Starting Mark the need to relabel after reboot...
         Starting Rebuild Journal Catalog...
         Starting Preprocess NFS configuration...
         Starting Create Volatile Files and Directories...
[  OK  ] Started Mark the need to relabel after reboot.
[  OK  ] Started Rebuild Journal Catalog.
[  OK  ] Started Preprocess NFS configuration.
[  OK  ] Started Create Volatile Files and Directories.
         Starting Update UTMP about System Boot/Shutdown...
[  OK  ] Started Rebuild Hardware Database.
         Starting udev Coldplug all Devices...
         Starting Update is Completed...
[  OK  ] Started Update is Completed.
[  OK  ] Started udev Coldplug all Devices.
[  OK  ] Started Update UTMP about System Boot/Shutdown.
[  OK  ] Reached target System Initialization.
[  OK  ] Listening on D-Bus System Message Bus Socket.
[  OK  ] Listening on RPCbind Server Activation Socket.
[  OK  ] Reached target Sockets.
[  OK  ] Reached target Timers.
[  OK  ] Reached target Paths.
[  OK  ] Reached target Basic System.
         Starting LSB: Bring up/down networking...
[  OK  ] Started irqbalance daemon.
         Starting irqbalance daemon...
         Starting OpenSSH Server Key Generation...
         Starting Dump dmesg to /var/log/dmesg...
         Starting GSSAPI Proxy Daemon...
         Starting System Logging Service...
         Starting Login Service...
[  OK  ] Started D-Bus System Message Bus.
         Starting D-Bus System Message Bus...
[  OK  ] Started System Logging Service.
[  OK  ] Started Dump dmesg to /var/log/dmesg.
[  OK  ] Started GSSAPI Proxy Daemon.
[  OK  ] Reached target NFS client services.
[  OK  ] Reached target Remote File Systems (Pre).
[  OK  ] Reached target Remote File Systems.
         Starting Permit User Sessions...
[  OK  ] Started Permit User Sessions.
[  OK  ] Started Getty on tty1.
         Starting Getty on tty1...
[  OK  ] Started Login Service.
[  OK  ] Started OpenSSH Server Key Generation.
[   96.381899] shpchp: Standard Hot Plug PCI Controller Driver version: 0.4
[   96.390252] nvidiafb 0000:03:00.0: enabling device (0140 -> 0142)
[   96.390332] nvidiafb: Device ID: 10de102d 
[   96.390403] nvidiafb: HW is currently programmed for CRT
[   96.390444] nvidiafb: Using CRT on CRTC 0

A workaround to the rhels7.2 ppc6le diskless is to disable (or blacklist) the driver "nvidiafb" with the following command and run "nodeset "

# chdef c910f05c33 -p addkcmdline="modprobe.blacklist=nvidiafb"
@whowutwut whowutwut changed the title [DEV]rhels7.2 diskless hang on firestone [DEV] rhels7.2 diskless hang on Power S822LC Oct 19, 2015
@whowutwut
Copy link
Member

@immarvin I updated the task to use IBM official naming of the server, Power S822LC This will help users searching our issues to correctly locate problems.

@immarvin
Copy link
Contributor Author

@immarvin
Copy link
Contributor Author

some feedback from LTC:
"
this was for general info. Nvidia released it drivers for Linux pretty recently.
untill then everybody used nouveau driver (open source nvidia driver) for nvidia devices
"

so the solution is "modprobe.blacklist=nvidiafb"

@whowutwut
Copy link
Member

Is this ok to close then?

@immarvin
Copy link
Contributor Author

hi @whowutwut , do we need to hard code "modprobe.blacklist=nvidiafb" in xcat code? or just mention this as a known issue in release page?

@whowutwut whowutwut modified the milestones: 2.11.1, 2.12 Apr 21, 2016
@whowutwut
Copy link
Member

A bug has been opened against RH bugzilla: https://bugzilla.redhat.com/show_bug.cgi?id=1320002

For now, xCAT will document this as a known issue and provide the following work around:

# chdef <noderange> -p addkcmdline="modprobe.blacklist=nvidiafb"

Please re-open if there is a request for xCAT to make a hard coded work around.

@immarvin
Copy link
Contributor Author

This issue is encountered on rhels7.4 diskless

@immarvin
Copy link
Contributor Author

The workaround works well

@zet809 zet809 changed the title [DEV] rhels7.2 diskless hang on Power S822LC [DEV] rhels7.4/rhels7.2 diskless hang on Power S822LC Sep 20, 2017
@immarvin immarvin reopened this Nov 2, 2017
@immarvin immarvin closed this as completed May 8, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants