New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
phosphor-hwmon: remove sensors that fail with EAGAIN from bus #2327
Comments
A couple ideas to consider in supporting this would be:
|
It seems from the service file setup that adding an additional command line parameter is not feasible with how each hwmon instance is started. Going to propose option 2 as the best solution to the mailing list. |
This is a temporary fix until the following issues are completed: openbmc/openbmc#2327 openbmc/openbmc#2329 When an EAGAIN or an EREMOTEIO return code is received by hwmon from the OCC driver in the 4.13 kernel, they should be translated to an unavailable sensor(0x00) and failed sensor(0xFF) scaled values respectively. This will keep the OCC hwmon instance running and allow applications to continue using these sensors as they were reported under the mainline openbmc/linux 4.10 kernel. Tested: Verified return codes are caught and sensor value modified Change-Id: Ie61859863e7d88878caa942e5f5b062acabe67aa Signed-off-by: Matthew Barth <msbarth@us.ibm.com>
|
https://gerrit.openbmc-project.xyz/9825 Re-add removed sensors during monitoring |
This is a temporary fix until the following issues are completed: openbmc/openbmc#2327 openbmc/openbmc#2329 When an EAGAIN or an EREMOTEIO return code is received by hwmon from the OCC driver in the 4.13 kernel, they should be translated to an unavailable sensor(0x00) and failed sensor(0xFF) scaled values respectively. This will keep the OCC hwmon instance running and allow applications to continue using these sensors as they were reported under the mainline openbmc/linux 4.10 kernel. Tested: Verified return codes are caught and sensor value modified Change-Id: Ie61859863e7d88878caa942e5f5b062acabe67aa Signed-off-by: Matthew Barth <msbarth@us.ibm.com> Signed-off-by: Doyle Huang <doyle.sy.huang@mail.foxconn.com>
In the design call today, it was decided to treat sensors that fail with EAGAIN differently than other failures.
If a sensor read fails with EAGAIN past the retry threshold, it should be removed from D-Bus. Any clients that are monitoring that sensor can register for the InterfacesRemoved signal on it to know when it goes away and act accordingly. No hardware callout will be made.
The text was updated successfully, but these errors were encountered: