Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pp_dpm_set_power_profile_state was not implemented #63

Closed
nabar opened this issue Dec 31, 2016 · 6 comments
Closed

pp_dpm_set_power_profile_state was not implemented #63

nabar opened this issue Dec 31, 2016 · 6 comments

Comments

@nabar
Copy link

nabar commented Dec 31, 2016

Hello,

Recently, I installed ROCm1.4 and tried to run my codes which could run on ROCm1.3.0.
I can compile my C++ code with HCC, and could run it in the first trial. At the same time I got the following message on the dmesg.
pp_dpm_set_power_profile_state was not implemented

From the second trial to run my executable, I got the segmentation error.

So, I am suspecting that the above message can be related to this segmentation error.

Does anyone know about it?

In my code, I am just using matrix calculation together with If statements in the parallel_for loop.

Best regards,
Nandinbaatar Tsog (Nabar)

@scchan
Copy link
Collaborator

scchan commented Jan 1, 2017

Could you try with the vector_copy example in /opt/rocm/hsa/sample and see if you get the same error message?

@nabar
Copy link
Author

nabar commented Jan 1, 2017

vector_copy passes all the tests, however, it returns the same error message as well.

@jedwards-AMD
Copy link
Contributor

Could you send us the output of 'uname -a' to insure the correct kernel is installed? Also, is there any dmesg output that indicates this error?

@nabar
Copy link
Author

nabar commented Jan 22, 2017

Here are the logs:

uname -a:
Linux mdh-hsa 4.6.0-kfd-compute-rocm-rel-1.4-16 #1 SMP Tue Dec 13 13:14:21 EST 2016 x86_64 x86_64 GNU/Linux

dmesg after the first execution of the code:
[17255.677008] pp_dpm_set_power_profile_state was not implemented.

dmesg for all of the executions after the second trial:
[17365.961484] pp_dpm_set_power_profile_state was not implemented.
[17366.420969] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.420988] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421000] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421059] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421065] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421069] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421095] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421100] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421103] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421131] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421136] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421139] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421164] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421168] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421172] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421190] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421194] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421198] kfd kfd: HSA Process (PID 5542) got unhandled exception
[17366.421214] kfd kfd: Invalid PPR device 0:1.0 pasid 1 address 0x7F8700000000 flags 0x144
[17366.421219] kfd kfd: Sending SIGSEGV to HSA Process with PID 5542
[17366.421222] kfd kfd: HSA Process (PID 5542) got unhandled exception

@jedwards-AMD
Copy link
Contributor

The Invalid PPR device indicates that the kernel is accessing unknown memory pages. What is your hardware configuration? Are you using a discrete GPU, and if so, what is the version? If this is an APU, please send me the output of 'sudo dmidecode'.

@nabar
Copy link
Author

nabar commented Jan 23, 2017

My APU is A10-8700P (Carrizo).

Here is sudo dmidecode:

dmidecode 3.0

Getting SMBIOS data from sysfs.
SMBIOS 2.8 present.
48 structures occupying 2063 bytes.
Table at 0x000E4E00.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
Vendor: Insyde Corp.
Version: V1.03
Release Date: 06/23/2015
Address: 0xE0000
Runtime Size: 128 kB
ROM Size: 4608 kB
Characteristics:
PCI is supported
BIOS is upgradeable
BIOS shadowing is allowed
Boot from CD is supported
Selectable boot is supported
EDD is supported
Japanese floppy for NEC 9800 1.2 MB is supported (int 13h)
Japanese floppy for Toshiba 1.2 MB is supported (int 13h)
5.25"/360 kB floppy services are supported (int 13h)
5.25"/1.2 MB floppy services are supported (int 13h)
3.5"/720 kB floppy services are supported (int 13h)
3.5"/2.88 MB floppy services are supported (int 13h)
8042 keyboard services are supported (int 9h)
CGA/mono video services are supported (int 10h)
ACPI is supported
USB legacy is supported
BIOS boot specification is supported
Targeted content distribution is supported
UEFI is supported
BIOS Revision: 0.0
Firmware Revision: 2.80

Handle 0x0001, DMI type 1, 27 bytes
System Information
Manufacturer: Acer
Product Name: Aspire E5-552
Version: V3.72
Serial Number: NXMWAED0105270F1AF7600
UUID: DA02BD99-6E64-214A-9F5D-2C600C9C206A
Wake-up Type: Power Switch
SKU Number: Aspire E5-552_095E_1_03
Family: CZ

Handle 0x0002, DMI type 2, 17 bytes
Base Board Information
Manufacturer: Acer
Product Name: Nami_CZ
Version: Type2 - A01 Board Version
Serial Number: NBMW9110025270F1AF7600
Asset Tag: Type2 - Board Asset Tag
Features:
Board is a hosting board
Board is replaceable
Location In Chassis: Type2 - Board Chassis Location
Chassis Handle: 0x0003
Type: Motherboard
Contained Object Handles: 0

Handle 0x0003, DMI type 3, 24 bytes
Chassis Information
Manufacturer: Chassis Manufacturer
Type: Notebook
Lock: Not Present
Version: Chassis Version
Serial Number: Chassis Serial Number
Asset Tag:
Boot-up State: Safe
Power Supply State: Safe
Thermal State: Safe
Security Status: None
OEM Information: 0x00000000
Height: Unspecified
Number Of Power Cords: 1
Contained Elements: 0
SKU Number:

Handle 0x0004, DMI type 4, 42 bytes
Processor Information
Socket Designation: Socket FP4
Type: Central Processor
Family: A-Series
Manufacturer: Advanced Micro Devices, Inc.
ID: 01 0F 66 00 FF FB 8B 17
Signature: Family 21, Model 96, Stepping 1
Flags:
FPU (Floating-point unit on-chip)
VME (Virtual mode extension)
DE (Debugging extension)
PSE (Page size extension)
TSC (Time stamp counter)
MSR (Model specific registers)
PAE (Physical address extension)
MCE (Machine check exception)
CX8 (CMPXCHG8 instruction supported)
APIC (On-chip APIC hardware supported)
SEP (Fast system call)
MTRR (Memory type range registers)
PGE (Page global enable)
MCA (Machine check architecture)
CMOV (Conditional move instruction supported)
PAT (Page attribute table)
PSE-36 (36-bit page size extension)
CLFSH (CLFLUSH instruction supported)
MMX (MMX technology supported)
FXSR (FXSAVE and FXSTOR instructions supported)
SSE (Streaming SIMD extensions)
SSE2 (Streaming SIMD extensions 2)
HTT (Multi-threading)
Version: AMD A10-8700P Radeon R6, 10 Compute Cores 4C+6G
Voltage: 0.9 V
External Clock: 100 MHz
Max Speed: 1800 MHz
Current Speed: 1800 MHz
Status: Populated, Enabled
Upgrade: None
L1 Cache Handle: 0x0007
L2 Cache Handle: 0x0008
L3 Cache Handle: Not Provided
Serial Number: Unknown
Asset Tag: Unknown
Part Number: Unknown
Core Count: 4
Core Enabled: 4
Thread Count: 4
Characteristics:
64-bit capable

Handle 0x0005, DMI type 5, 19 bytes
Memory Controller Information
Error Detecting Method: None
Error Correcting Capabilities:
None
Supported Interleave: One-way Interleave
Current Interleave: One-way Interleave
Maximum Memory Module Size: 8192 MB
Maximum Total Memory Size: 16384 MB
Supported Speeds:
Other
Supported Memory Types:
Other
Memory Module Voltage: 3.3 V
Associated Memory Slots: 2
0x0006
0xFFFF

Handle 0x0006, DMI type 6, 12 bytes
Memory Module Information
Socket Designation: DIMM 0
Bank Connections: None
Current Speed: 1 ns
Type: DIMM
Installed Size: 8192 MB (Single-bank Connection)
Enabled Size: 8192 MB (Single-bank Connection)
Error Status: OK

Handle 0x0007, DMI type 7, 19 bytes
Cache Information
Socket Designation: L1-Cache
Configuration: Enabled, Not Socketed, Level 1
Operational Mode: Write Back
Location: Internal
Installed Size: 320 kB
Maximum Size: 320 kB
Supported SRAM Types:
Pipeline Burst
Installed SRAM Type: Pipeline Burst
Speed: 1 ns
Error Correction Type: Multi-bit ECC
System Type: Unified
Associativity: 2-way Set-associative

Handle 0x0008, DMI type 7, 19 bytes
Cache Information
Socket Designation: L2-Cache
Configuration: Enabled, Not Socketed, Level 2
Operational Mode: Write Back
Location: Internal
Installed Size: 2048 kB
Maximum Size: 2048 kB
Supported SRAM Types:
Pipeline Burst
Installed SRAM Type: Pipeline Burst
Speed: 1 ns
Error Correction Type: Multi-bit ECC
System Type: Unified
Associativity: 16-way Set-associative

Handle 0x0009, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J2604/J2606
Internal Connector Type: None
External Reference Designator: Keyboard
External Connector Type: PS/2
Port Type: Keyboard Port

Handle 0x000A, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J2605
Internal Connector Type: None
External Reference Designator: Touch pad
External Connector Type: PS/2
Port Type: Mouse Port

Handle 0x000B, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1500
Internal Connector Type: None
External Reference Designator: USB
External Connector Type: Access Bus (USB)
Port Type: USB

Handle 0x000C, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1501
Internal Connector Type: None
External Reference Designator: USB
External Connector Type: Access Bus (USB)
Port Type: USB

Handle 0x000D, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1502
Internal Connector Type: None
External Reference Designator: USB
External Connector Type: Access Bus (USB)
Port Type: USB

Handle 0x000E, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1301
Internal Connector Type: None
External Reference Designator: USB
External Connector Type: Access Bus (USB)
Port Type: USB

Handle 0x000F, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1300
Internal Connector Type: None
External Reference Designator: Network
External Connector Type: RJ-45
Port Type: Network Port

Handle 0x0010, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1707
Internal Connector Type: SAS/SATA Plug Receptacle
External Reference Designator: Sata HDD
External Connector Type: None
Port Type: SATA

Handle 0x0011, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1705
Internal Connector Type: SAS/SATA Plug Receptacle
External Reference Designator: Sata ODD
External Connector Type: None
Port Type: SATA

Handle 0x0012, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1100
Internal Connector Type: None
External Reference Designator: DP0
External Connector Type: None
Port Type: Video Port

Handle 0x0013, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J1102
Internal Connector Type: None
External Reference Designator: DP2
External Connector Type: None
Port Type: Video Port

Handle 0x0014, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J2106
Internal Connector Type: None
External Reference Designator: Microphone In
External Connector Type: Mini Jack (headphones)
Port Type: Audio Port

Handle 0x0015, DMI type 8, 9 bytes
Port Connector Information
Internal Reference Designator: J2105
Internal Connector Type: None
External Reference Designator: Head Phone
External Connector Type: Mini Jack (headphones)
Port Type: Audio Port

Handle 0x0016, DMI type 9, 17 bytes
System Slot Information
Designation: J3600
Type: x1 PCI Express x1
Current Usage: Available
Length: Short
ID: 2
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.2

Handle 0x0017, DMI type 9, 17 bytes
System Slot Information
Designation: J3700
Type: x1 PCI Express x1
Current Usage: Available
Length: Short
ID: 3
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.2

Handle 0x0018, DMI type 9, 17 bytes
System Slot Information
Designation: J3605
Type: x8 PCI Express x8
Current Usage: Available
Length: Short
ID: 3
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.2

Handle 0x0019, DMI type 9, 17 bytes
System Slot Information
Designation: J3702
Type: x1 Other
Current Usage: Available
Length: Short
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.4

Handle 0x001A, DMI type 9, 17 bytes
System Slot Information
Designation: J3703
Type: x1 Other
Current Usage: Available
Length: Short
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.5

Handle 0x001B, DMI type 9, 17 bytes
System Slot Information
Designation: J4000
Type: x1 PCI Express x1
Current Usage: Available
Length: Short
ID: 4
Characteristics:
3.3 V is provided
PME signal is supported
Hot-plug devices are supported
Bus Address: 0200:00:02.6

Handle 0x001C, DMI type 10, 6 bytes
On Board Device Information
Type: Video
Status: Enabled
Description: Video Graphics Controller

Handle 0x001D, DMI type 11, 5 bytes
OEM Strings
String 1: OemString1
String 2: OemString2
String 3: OemString3
String 4: OemString4
String 5: OemString5
String 6: OemString6

Handle 0x001E, DMI type 12, 5 bytes
System Configuration Options
Option 1: ConfigOptions1
Option 2: ConfigOptions2
Option 3: ConfigOptions3
Option 4: ConfigOptions4
Option 5: ConfigOptions5
Option 6: ConfigOptions6

Handle 0x001F, DMI type 13, 22 bytes
BIOS Language Information
Language Description Format: Long
Installable Languages: 4
en|US|iso8859-1
fr|CA|iso8859-1
ja|JP|unicode
zh|TW|unicode
Currently Installed Language: en|US|iso8859-1

Handle 0x0020, DMI type 16, 23 bytes
Physical Memory Array
Location: System Board Or Motherboard
Use: System Memory
Error Correction Type: None
Maximum Capacity: 64 GB
Error Information Handle: No Error
Number Of Devices: 2

Handle 0x0021, DMI type 17, 40 bytes
Memory Device
Array Handle: 0x0020
Error Information Handle: 0x0023
Total Width: 64 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: SODIMM
Set: None
Locator: DIMM 0
Bank Locator: CHANNEL B
Type: DDR3
Type Detail: Synchronous Unbuffered (Unregistered)
Speed: 800 MHz
Manufacturer: Samsung
Serial Number: 96210A20
Asset Tag: Not Specified
Part Number: M471B1G73EB0-YK0
Rank: 2
Configured Clock Speed: 800 MHz
Minimum Voltage: 1.35 V
Maximum Voltage: 1.5 V
Configured Voltage: 1.35 V

Handle 0x0022, DMI type 18, 23 bytes
32-bit Memory Error Information
Type: OK
Granularity: Unknown
Operation: Unknown
Vendor Syndrome: Unknown
Memory Array Address: Unknown
Device Address: Unknown
Resolution: Unknown

Handle 0x0023, DMI type 18, 23 bytes
32-bit Memory Error Information
Type: OK
Granularity: Unknown
Operation: Unknown
Vendor Syndrome: Unknown
Memory Array Address: Unknown
Device Address: Unknown
Resolution: Unknown

Handle 0x0024, DMI type 19, 31 bytes
Memory Array Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x001FFFFFFFF
Range Size: 8 GB
Physical Array Handle: 0x0020
Partition Width: 255

Handle 0x0025, DMI type 20, 35 bytes
Memory Device Mapped Address
Starting Address: 0x00000000000
Ending Address: 0x001FFFFFFFF
Range Size: 8 GB
Physical Device Handle: 0xFFFF
Memory Array Mapped Address Handle: 0x0024
Partition Row Position: Unknown
Interleave Position: Unknown
Interleaved Data Depth: Unknown

Handle 0x0026, DMI type 21, 7 bytes
Built-in Pointing Device
Type: Touch Pad
Interface: PS/2
Buttons: 4

Handle 0x0027, DMI type 26, 22 bytes
Voltage Probe
Description: Voltage Probe Description
Location: Unknown
Status: Unknown
Maximum Value: Unknown
Minimum Value: Unknown
Resolution: Unknown
Tolerance: Unknown
Accuracy: Unknown
OEM-specific Information: 0x00000000
Nominal Value: Unknown

Handle 0x0028, DMI type 32, 20 bytes
System Boot Information
Status: No errors detected

Handle 0x0029, DMI type 40, 18 bytes
Additional Information 1
Referenced Handle: 0x0019
Referenced Offset: 0x05
String: PCIExpressx16
Value: 0xaa
Additional Information 2
Referenced Handle: 0x0000
Referenced Offset: 0x05
String: Compiler Version: VC 9.0
Value: 0x05dc

Handle 0x002A, DMI type 41, 11 bytes
Onboard Device
Reference Designation: Realtek RTL8153
Type: Ethernet
Status: Enabled
Type Instance: 1
Bus Address: 0000:00:00.1

Handle 0x002B, DMI type 41, 11 bytes
Onboard Device
Reference Designation: Realtek ALC288
Type: Sound
Status: Enabled
Type Instance: 1
Bus Address: 0000:00:02.4

Handle 0x002C, DMI type 170, 78 bytes
Acer Hotkey Function
Function bitmap for Communication Button: 0x0801
WiFi: Yes
3G: No
WiMAX: No
Bluetooth: Yes
Function bitmap for Application Button: 0x0000
Function bitmap for Media Button: 0x007f
Function bitmap for Display Button: 0x000f
Function bitmap for Others Button: 0x100e
Communication Function Key Number: 1

Handle 0x002D, DMI type 171, 39 bytes
OEM-specific Type
Header and Data:
AB 27 2D 00 03 DA 0B 29 01 04 CF 1B 81 2C 08 89
04 9C E0 01 02 10 74 98 02 EC 10 68 81 07 8C 16
42 00 0D 00 00 00 00

Handle 0x002E, DMI type 172, 27 bytes
OEM-specific Type
Header and Data:
AC 1B 2E 00 02 21 01 FF 00 02 01 00 03 FF 00 04
01 00 05 0F 00 06 FF 00 07 FF 00

Handle 0xFEFF, DMI type 127, 4 bytes
End Of Table

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants