New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: NVMe Temperature #256

Open
brando56894 opened this Issue Dec 28, 2018 · 2 comments

Comments

Projects
None yet
2 participants
@brando56894
Copy link

brando56894 commented Dec 28, 2018

I have two Samsung NVMe drives in my server and neither show the temperature in the Smart -> Devices even though they do report temperature statistics.

 [bran@mediavault ~]$ sudo smartctl -i /dev/nvme0n1
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.18.0-0.bpo.1-amd64] (local build)

Model Number:                       SAMSUNG MZVLW512HMJP-000H1
Firmware Version:                   CXY70H1Q

[bran@mediavault ~]$ sudo smartctl -a /dev/nvme0n1|grep -i temp
Warning  Comp. Temp. Threshold:     77 Celsius
Critical Comp. Temp. Threshold:     80 Celsius
Temperature:                        33 Celsius
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               33 Celsius

 [bran@mediavault ~]$ sudo smartctl -i /dev/nvme1n1
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.18.0-0.bpo.1-amd64] (local build)

Model Number:                       Samsung SSD 960 PRO 512GB
Firmware Version:                   2B6QCXP7

 [bran@mediavault ~]$ sudo smartctl -a /dev/nvme1n1|grep -i temp
Warning  Comp. Temp. Threshold:     73 Celsius
Critical Comp. Temp. Threshold:     76 Celsius
Temperature:                        30 Celsius
Warning  Comp. Temperature Time:    0
Critical Comp. Temperature Time:    0
Temperature Sensor 1:               30 Celsius
Temperature Sensor 2:               35 Celsius
@votdev

This comment has been minimized.

Copy link
Collaborator

votdev commented Dec 28, 2018

Can you please provide the output from the following command smartctl -x /dev/nvme0n1?

Currently attribute ID's 190, 194 and 231 are processed. If they do not exists, the output is searched for Current Drive Temperature or Current Temperature. See https://github.com/openmediavault/openmediavault/blob/master/deb/openmediavault/usr/share/php/openmediavault/system/storage/smartinformation.inc#L376.

To get the temperature for this devices the code could be enhanced by adding Temperature as search pattern. But it would be better to use the SMART attributes instead of parsing raw text.

@votdev

This comment has been minimized.

Copy link
Collaborator

votdev commented Dec 31, 2018

According to https://www.percona.com/blog/2017/02/09/using-nvme-command-line-tools-to-check-nvme-flash-health/ attribute 194 which is already processed by OMV should contain the temperature. So please provide the command output as mentioned in the last comment.

root@blinky:/var/lib/mysql# smartctl -A /dev/sda
smartctl 6.5 2016-01-24 r4214 [x86_64-linux-4.4.0-62-generic] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
 1 Raw_Read_Error_Rate     0x002f   100   100   000    Pre-fail  Always       -       0
 5 Reallocated_Sector_Ct   0x0032   100   100   010    Old_age   Always       -       0
 9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       41
12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       2
171 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
172 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
173 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       1
174 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   065   059   000    Old_age   Always       -       35 (Min/Max 21/41)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       0
202 Unknown_SSD_Attribute   0x0030   100   100   001    Old_age   Offline      -       0
206 Unknown_SSD_Attribute   0x000e   100   100   000    Old_age   Always       -       0
246 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       145599393
247 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       4550280
248 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       582524
180 Unused_Rsvd_Blk_Cnt_Tot 0x0033   000   000   000    Pre-fail  Always       -       1260
210 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0

votdev added a commit to votdev/openmediavault that referenced this issue Dec 31, 2018

Issue openmediavault#256: Get S.M.A.R.T. drive temperature from Samsu…
…ng NVMe devices.

Signed-off-by: Volker Theile <votdev@gmx.de>

votdev added a commit that referenced this issue Jan 3, 2019

Merge pull request #257 from votdev/issue_256
Issue #256: Get S.M.A.R.T. drive temperature from Samsung NVMe devices.

votdev added a commit that referenced this issue Jan 3, 2019

Issue #256: Get S.M.A.R.T. drive temperature from Samsung NVMe devices.
(cherry picked from commit ee3e5d0)
Signed-off-by: Volker Theile <votdev@gmx.de>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment