@@ -164,46 +164,56 @@ Devlink health reporters
164164
165165NPA Reporters
166166-------------
167- The NPA reporters are responsible for reporting and recovering the following group of errors
167+ The NPA reporters are responsible for reporting and recovering the following group of errors:
168+
1681691. GENERAL events
170+
169171 - Error due to operation of unmapped PF.
170172 - Error due to disabled alloc/free for other HW blocks (NIX, SSO, TIM, DPI and AURA).
173+
1711742. ERROR events
175+
172176 - Fault due to NPA_AQ_INST_S read or NPA_AQ_RES_S write.
173177 - AQ Doorbell Error.
178+
1741793. RAS events
180+
175181 - RAS Error Reporting for NPA_AQ_INST_S/NPA_AQ_RES_S.
182+
1761834. RVU events
184+
177185 - Error due to unmapped slot.
178186
179- Sample Output
180- -------------
181- ~# devlink health
182- pci/0002:01:00.0:
183- reporter hw_npa_intr
184- state healthy error 2872 recover 2872 last_dump_date 2020-12-10 last_dump_time 09:39:09 grace_period 0 auto_recover true auto_dump true
185- reporter hw_npa_gen
186- state healthy error 2872 recover 2872 last_dump_date 2020-12-11 last_dump_time 04:43:04 grace_period 0 auto_recover true auto_dump true
187- reporter hw_npa_err
188- state healthy error 2871 recover 2871 last_dump_date 2020-12-10 last_dump_time 09:39:17 grace_period 0 auto_recover true auto_dump true
189- reporter hw_npa_ras
190- state healthy error 0 recover 0 last_dump_date 2020-12-10 last_dump_time 09:32:40 grace_period 0 auto_recover true auto_dump true
187+ Sample Output::
188+
189+ ~# devlink health
190+ pci/0002:01:00.0:
191+ reporter hw_npa_intr
192+ state healthy error 2872 recover 2872 last_dump_date 2020-12-10 last_dump_time 09:39:09 grace_period 0 auto_recover true auto_dump true
193+ reporter hw_npa_gen
194+ state healthy error 2872 recover 2872 last_dump_date 2020-12-11 last_dump_time 04:43:04 grace_period 0 auto_recover true auto_dump true
195+ reporter hw_npa_err
196+ state healthy error 2871 recover 2871 last_dump_date 2020-12-10 last_dump_time 09:39:17 grace_period 0 auto_recover true auto_dump true
197+ reporter hw_npa_ras
198+ state healthy error 0 recover 0 last_dump_date 2020-12-10 last_dump_time 09:32:40 grace_period 0 auto_recover true auto_dump true
191199
192200Each reporter dumps the
201+
193202 - Error Type
194203 - Error Register value
195204 - Reason in words
196205
197- For eg:
198- ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_gen
199- NPA_AF_GENERAL:
200- NPA General Interrupt Reg : 1
201- NIX0: free disabled RX
202- ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_intr
203- NPA_AF_RVU:
204- NPA RVU Interrupt Reg : 1
205- Unmap Slot Error
206- ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_err
207- NPA_AF_ERR:
208- NPA Error Interrupt Reg : 4096
209- AQ Doorbell Error
206+ For example::
207+
208+ ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_gen
209+ NPA_AF_GENERAL:
210+ NPA General Interrupt Reg : 1
211+ NIX0: free disabled RX
212+ ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_intr
213+ NPA_AF_RVU:
214+ NPA RVU Interrupt Reg : 1
215+ Unmap Slot Error
216+ ~# devlink health dump show pci/0002:01:00.0 reporter hw_npa_err
217+ NPA_AF_ERR:
218+ NPA Error Interrupt Reg : 4096
219+ AQ Doorbell Error
0 commit comments