RPL: on '210319_075847' from '/home/yoann/rocprofiler' in '/home/yoann' RPL: profiling '"python3" "./test.py"' RPL: input file '' RPL: output dir 'traces/rpl_data_210319_075847_861461' RPL: result dir 'traces/rpl_data_210319_075847_861461/input_results_210319_075847' 2021-03-19 07:58:49.133161: I tensorflow/compiler/jit/xla_cpu_device.cc:41] Not creating XLA devices, tf_xla_enable_xla_devices not set 2021-03-19 07:58:49.133303: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libamdhip64.so ROCProfiler: input from "traces/rpl_data_210319_075847_861461/input.xml" 0 metrics ROCTracer (pid=861483): KFD-trace() HSA-trace() HSA-activity-trace() HIP-trace() 2021-03-19 07:58:49.178590: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1738] Found device 0 with properties: pciBusID: 0000:04:00.0 name: Vega 10 XL/XT [Radeon RX Vega 56/64] ROCm AMD GPU ISA: gfx900 coreClock: 1.59GHz coreCount: 56 deviceMemorySize: 7.98GiB deviceMemoryBandwidth: 381.47GiB/s 2021-03-19 07:58:49.180902: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocblas.so 2021-03-19 07:58:49.182370: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libMIOpen.so 2021-03-19 07:58:49.194901: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocfft.so 2021-03-19 07:58:49.195209: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocrand.so 2021-03-19 07:58:49.195776: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0 2021-03-19 07:58:49.196037: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN) to use the following CPU instructions in performance-critical operations: SSE3 SSE4.1 SSE4.2 AVX AVX2 FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2021-03-19 07:58:49.196444: I tensorflow/compiler/jit/xla_gpu_device.cc:99] Not creating XLA devices, tf_xla_enable_xla_devices not set 2021-03-19 07:58:49.196774: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1738] Found device 0 with properties: pciBusID: 0000:04:00.0 name: Vega 10 XL/XT [Radeon RX Vega 56/64] ROCm AMD GPU ISA: gfx900 coreClock: 1.59GHz coreCount: 56 deviceMemorySize: 7.98GiB deviceMemoryBandwidth: 381.47GiB/s 2021-03-19 07:58:49.196792: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocblas.so 2021-03-19 07:58:49.196800: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libMIOpen.so 2021-03-19 07:58:49.196818: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocfft.so 2021-03-19 07:58:49.196825: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocrand.so 2021-03-19 07:58:49.197310: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1862] Adding visible gpu devices: 0 2021-03-19 07:58:49.197461: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1261] Device interconnect StreamExecutor with strength 1 edge matrix: 2021-03-19 07:58:49.197470: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1267] 0 2021-03-19 07:58:49.197487: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1280] 0: N 2021-03-19 07:58:49.198294: I tensorflow/core/common_runtime/gpu/gpu_device.cc:1406] Created TensorFlow device (/job:localhost/replica:0/task:0/device:GPU:0 with 7685 MB memory) -> physical GPU (device: 0, name: Vega 10 XL/XT [Radeon RX Vega 56/64], pci bus id: 0000:04:00.0) 2021-03-19 07:58:49.837876: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:116] None of the MLIR optimization passes are enabled (registered 2) 2021-03-19 07:58:49.838318: I tensorflow/core/platform/profile_utils/cpu_utils.cc:112] CPU Frequency: 3397895000 Hz 2021-03-19 07:58:50.109339: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library librocblas.so 2021-03-19 07:58:50.139677: I tensorflow/stream_executor/platform/default/dso_loader.cc:49] Successfully opened dynamic library libMIOpen.so 2.4.0 x_train shape: (60000, 28, 28, 1) 60000 train samples 10000 test samples Model: "sequential" _________________________________________________________________ Layer (type) Output Shape Param # ================================================================= conv2d (Conv2D) (None, 26, 26, 32) 320 _________________________________________________________________ max_pooling2d (MaxPooling2D) (None, 13, 13, 32) 0 _________________________________________________________________ conv2d_1 (Conv2D) (None, 11, 11, 64) 18496 _________________________________________________________________ max_pooling2d_1 (MaxPooling2 (None, 5, 5, 64) 0 _________________________________________________________________ flatten (Flatten) (None, 1600) 0 _________________________________________________________________ dropout (Dropout) (None, 1600) 0 _________________________________________________________________ dense (Dense) (None, 10) 16010 ================================================================= Total params: 34,826 Trainable params: 34,826 Non-trainable params: 0 _________________________________________________________________ Epoch 1/2 422/422 [==============================] - 9s 12ms/step - loss: 0.7736 - accuracy: 0.7592 - val_loss: 0.0783 - val_accuracy: 0.9805 Epoch 2/2 422/422 [==============================] - 5s 12ms/step - loss: 0.1250 - accuracy: 0.9620 - val_loss: 0.0586 - val_accuracy: 0.9852 Test loss: 0.0593130849301815 Test accuracy: 0.9812999963760376 free(): corrupted unsorted chunks ./rocprofiler/bin/rocprof: line 271: 861483 Aborted (core dumped) "python3" "./test.py"