Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rasdaemon: ras-mc-ctl --error-count random sorts output #145

Open
walterav1984 opened this issue Feb 21, 2024 · 0 comments
Open

rasdaemon: ras-mc-ctl --error-count random sorts output #145

walterav1984 opened this issue Feb 21, 2024 · 0 comments

Comments

@walterav1984
Copy link
Contributor

walterav1984 commented Feb 21, 2024

Noticed this random sorting behavior of dimm numbers and channel/riser locations for a while but also verified git master version f9cb13b of 2024-02-05 which shows the same behavior. Not sure if its bug or a feature, but this happens on all tested machines (DQ57TM and MacPro 1,1 2,1 3,1) running repository versions 0.68 of debian 12 /ubuntu 23.10 and f9cb13b.

Its independent of the fact labels are used and or registered.

$ sudo ras-mc-ctl --error-count #1
Label   	CE	UE
DIMM2_RA	0	0
DIMM2_RB	0	0
DIMM1_RB	0	0
DIMM1_RA	0	0
DIMM4_RA	0	0
DIMM4_RB	0	0
DIMM3_RB	0	0
DIMM3_RA	0	0
$ sudo ras-mc-ctl --error-count #2
Label   	CE	UE
DIMM1_RB	0	0
DIMM4_RA	0	0
DIMM1_RA	0	0
DIMM4_RB	0	0
DIMM3_RA	0	0
DIMM2_RB	0	0
DIMM3_RB	0	0
DIMM2_RA	0	0
$ sudo ras-mc-ctl --error-count #3
Label   	CE	UE
DIMM3_RA	0	0
DIMM1_RB	0	0
DIMM2_RB	0	0
DIMM4_RA	0	0
DIMM1_RA	0	0
DIMM3_RB	0	0
DIMM4_RB	0	0
DIMM2_RA	0	0
$ sudo ras-mc-ctl --error-count #4
Label   	CE	UE
DIMM2_RA	0	0
DIMM4_RB	0	0
DIMM3_RA	0	0
DIMM1_RB	0	0
DIMM3_RB	0	0
DIMM1_RA	0	0
DIMM2_RB	0	0
DIMM4_RA	0	0

$ sudo ras-mc-ctl --error-count | sort
DIMM1_RA	0	0
DIMM1_RB	0	0
DIMM2_RA	0	0
DIMM2_RB	0	0
DIMM3_RA	0	0
DIMM3_RB	0	0
DIMM4_RA	0	0
DIMM4_RB	0	0
Label   	CE	UE

$ sudo ras-mc-ctl --error-count #1
Label                      	CE	UE
mc#0branch#0channel#1slot#0	0	0
mc#0branch#1channel#0slot#0	0	0
mc#0branch#0channel#0slot#1	0	0
mc#0branch#1channel#1slot#1	0	0
mc#0branch#1channel#1slot#0	0	0
mc#0branch#0channel#0slot#0	0	0
mc#0branch#1channel#0slot#1	0	0
mc#0branch#0channel#1slot#1	0	0

$ sudo ras-mc-ctl --error-count #2
Label                      	CE	UE
mc#0branch#1channel#0slot#1	0	0
mc#0branch#0channel#0slot#0	0	0
mc#0branch#0channel#0slot#1	0	0
mc#0branch#1channel#0slot#0	0	0
mc#0branch#1channel#1slot#1	0	0
mc#0branch#0channel#1slot#0	0	0
mc#0branch#0channel#1slot#1	0	0
mc#0branch#1channel#1slot#0	0	0

$ sudo ras-mc-ctl --error-count #3
Label                      	CE	UE
mc#0branch#0channel#1slot#1	0	0
mc#0branch#1channel#0slot#0	0	0
mc#0branch#0channel#0slot#1	0	0
mc#0branch#1channel#1slot#0	0	0
mc#0branch#1channel#1slot#1	0	0
mc#0branch#0channel#0slot#0	0	0
mc#0branch#1channel#0slot#1	0	0
mc#0branch#0channel#1slot#0	0	0

$ sudo ras-mc-ctl --error-count #4
Label                      	CE	UE
mc#0branch#0channel#0slot#0	0	0
mc#0branch#1channel#0slot#0	0	0
mc#0branch#1channel#1slot#0	0	0
mc#0branch#0channel#1slot#0	0	0
mc#0branch#1channel#0slot#1	0	0
mc#0branch#0channel#0slot#1	0	0
mc#0branch#0channel#1slot#1	0	0
mc#0branch#1channel#1slot#1	0	0

Compared to --guess-labels and --print-labels which use their own unique but fixed pattern.

$ sudo ras-mc-ctl --guess-labels
memory stick 'DIMM 1' is located at 'DIMM Riser A'
memory stick 'DIMM 2' is located at 'DIMM Riser A'
memory stick 'DIMM 1' is located at 'DIMM Riser B'
memory stick 'DIMM 2' is located at 'DIMM Riser B'
memory stick 'DIMM 3' is located at 'DIMM Riser A'
memory stick 'DIMM 4' is located at 'DIMM Riser A'
memory stick 'DIMM 3' is located at 'DIMM Riser B'
memory stick 'DIMM 4' is located at 'DIMM Riser B'

$ sudo ras-mc-ctl --print-labels #edited labels but not registered yet
LOCATION                            CONFIGURED LABEL     SYSFS CONTENTS      
mc0 branch 0 channel 0 slot 0       DIMM1_RA             mc#0branch#0channel#0slot#0
mc0 branch 0 channel 0 slot 1       DIMM3_RA             mc#0branch#0channel#0slot#1
mc0 branch 0 channel 1 slot 0       DIMM2_RA             mc#0branch#0channel#1slot#0
mc0 branch 0 channel 1 slot 1       DIMM4_RA             mc#0branch#0channel#1slot#1
mc0 branch 1 channel 0 slot 0       DIMM1_RB             mc#0branch#1channel#0slot#0
mc0 branch 1 channel 0 slot 1       DIMM3_RB             mc#0branch#1channel#0slot#1
mc0 branch 1 channel 1 slot 0       DIMM2_RB             mc#0branch#1channel#1slot#0
mc0 branch 1 channel 1 slot 1       DIMM4_RB             mc#0branch#1channel#1slot#1

$ sudo ras-mc-ctl --register-labels
$ sudo ras-mc-ctl --print-labels
LOCATION                            CONFIGURED LABEL     SYSFS CONTENTS      
mc0 branch 0 channel 0 slot 0       DIMM1_RA             DIMM1_RA            
mc0 branch 0 channel 0 slot 1       DIMM3_RA             DIMM3_RA            
mc0 branch 0 channel 1 slot 0       DIMM2_RA             DIMM2_RA            
mc0 branch 0 channel 1 slot 1       DIMM4_RA             DIMM4_RA            
mc0 branch 1 channel 0 slot 0       DIMM1_RB             DIMM1_RB            
mc0 branch 1 channel 0 slot 1       DIMM3_RB             DIMM3_RB            
mc0 branch 1 channel 1 slot 0       DIMM2_RB             DIMM2_RB            
mc0 branch 1 channel 1 slot 1       DIMM4_RB             DIMM4_RB
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant