Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance fixes for LRP rules #106

Merged
merged 3 commits into from
Nov 11, 2022
Merged

Performance fixes for LRP rules #106

merged 3 commits into from
Nov 11, 2022

Conversation

adrhill
Copy link
Member

@adrhill adrhill commented Nov 11, 2022

No description provided.

@adrhill adrhill added the run benchmark Run benchmark CI action label Nov 11, 2022
@github-actions
Copy link

Benchmark result

Judge result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 Nov 2022 - 16:38
    • Baseline: 11 Nov 2022 - 16:43
  • Package commits:
    • Target: 599c38
    • Baseline: 2c9c6a
  • Julia commits:
    • Target: 36034a
    • Baseline: 36034a
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: None
    • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["Layer", "Conv", "EpsilonRule", "modify layer"] 1.13 (5%) ❌ 1.00 (1%)
["Layer", "Conv", "ZBoxRule", "modify layer"] 0.88 (5%) ✅ 1.00 (1%)
["Layer", "Conv", "ZPlusRule", "apply rule"] 1.00 (5%) 0.95 (1%) ✅
["Layer", "Conv", "ZPlusRule", "modify layer"] 0.85 (5%) ✅ 1.00 (1%)
["Layer", "Dense", "AlphaBetaRule", "modify layer"] 1.13 (5%) ❌ 1.00 (1%)
["Layer", "Dense", "FlatRule", "apply rule"] 0.82 (5%) ✅ 1.00 (1%)
["Layer", "Dense", "GammaRule", "apply rule"] 0.95 (5%) ✅ 1.00 (1%)
["Layer", "Dense", "ZBoxRule", "modify layer"] 1.10 (5%) ❌ 1.00 (1%)
["Layer", "Dense", "ZPlusRule", "apply rule"] 0.99 (5%) 0.99 (1%) ✅
["Layer", "Dense", "ZPlusRule", "modify layer"] 1.13 (5%) ❌ 1.00 (1%)
["Layer", "Dense", "ZeroRule", "apply rule"] 0.94 (5%) ✅ 1.00 (1%)
["VGG", "Gradient", "construct analyzer"] 0.95 (5%) ✅ 1.00 (1%)
["VGG", "InputTimesGradient", "analyze"] 0.09 (5%) ✅ 0.68 (1%) ✅
["VGG", "InputTimesGradient", "construct analyzer"] 1.06 (5%) ❌ 1.00 (1%)
["VGG", "LREpsilonPlusFlat", "analyze"] 0.48 (5%) ✅ 0.85 (1%) ✅

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Target

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       7867 s          2 s        553 s       2353 s          0 s
       #2  2793 MHz       1690 s          1 s        237 s       8850 s          0 s
  Memory: 6.78125 GB (2293.49609375 MB free)
  Uptime: 1084.61 sec
  Load Avg:  1.04  1.07  0.83
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Baseline

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz      11115 s          2 s        656 s       2400 s          0 s
       #2  2793 MHz       1747 s          1 s        249 s      12170 s          0 s
  Memory: 6.78125 GB (2956.44140625 MB free)
  Uptime: 1424.56 sec
  Load Avg:  1.02  1.03  0.91
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Target result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmark: 11 Nov 2022 - 16:38
  • Package commit: 599c38
  • Julia commit: 36034a
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Layer", "Conv", "AlphaBetaRule", "apply rule"] 1.875 ms (5%) 5.58 MiB (1%) 496
["Layer", "Conv", "AlphaBetaRule", "modify layer"] 2.656 μs (5%) 5.53 KiB (1%) 46
["Layer", "Conv", "EpsilonRule", "apply rule"] 443.897 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "EpsilonRule", "modify layer"] 3.599 ns (5%)
["Layer", "Conv", "FlatRule", "apply rule"] 448.498 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "FlatRule", "modify layer"] 111.026 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "GammaRule", "apply rule"] 444.397 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "GammaRule", "modify layer"] 257.223 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "WSquareRule", "apply rule"] 449.697 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "WSquareRule", "modify layer"] 186.586 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "ZBoxRule", "apply rule"] 1.353 ms (5%) 4.12 MiB (1%) 360
["Layer", "Conv", "ZBoxRule", "modify layer"] 1.550 μs (5%) 3.25 KiB (1%) 28
["Layer", "Conv", "ZPlusRule", "apply rule"] 941.594 μs (5%) 2.89 MiB (1%) 259
["Layer", "Conv", "ZPlusRule", "modify layer"] 1.320 μs (5%) 2.77 KiB (1%) 23
["Layer", "Conv", "ZeroRule", "apply rule"] 444.197 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "ZeroRule", "modify layer"] 3.599 ns (5%)
["Layer", "Dense", "AlphaBetaRule", "apply rule"] 155.999 μs (5%) 814.69 KiB (1%) 56
["Layer", "Dense", "AlphaBetaRule", "modify layer"] 339.998 μs (5%) 1.57 MiB (1%) 58
["Layer", "Dense", "EpsilonRule", "apply rule"] 19.000 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "EpsilonRule", "modify layer"] 4.799 ns (5%)
["Layer", "Dense", "FlatRule", "apply rule"] 74.922 ns (5%)
["Layer", "Dense", "FlatRule", "modify layer"] 15.600 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "GammaRule", "apply rule"] 19.000 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "GammaRule", "modify layer"] 17.100 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "WSquareRule", "apply rule"] 18.400 μs (5%) 576 bytes (1%) 5
["Layer", "Dense", "WSquareRule", "modify layer"] 16.600 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "ZBoxRule", "apply rule"] 109.799 μs (5%) 602.48 KiB (1%) 32
["Layer", "Dense", "ZBoxRule", "modify layer"] 170.099 μs (5%) 805.25 KiB (1%) 34
["Layer", "Dense", "ZPlusRule", "apply rule"] 72.900 μs (5%) 416.27 KiB (1%) 37
["Layer", "Dense", "ZPlusRule", "modify layer"] 169.499 μs (5%) 804.75 KiB (1%) 29
["Layer", "Dense", "ZeroRule", "apply rule"] 18.899 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "ZeroRule", "modify layer"] 4.499 ns (5%)
["VGG", "Gradient", "analyze"] 1.381 s (5%) 10.264 ms 1.10 GiB (1%) 1887
["VGG", "Gradient", "construct analyzer"] 9.400 μs (5%) 11.45 KiB (1%) 200
["VGG", "InputTimesGradient", "analyze"] 1.390 s (5%) 13.407 ms 1.10 GiB (1%) 1890
["VGG", "InputTimesGradient", "construct analyzer"] 9.400 μs (5%) 11.45 KiB (1%) 200
["VGG", "IntegratedGradients", "analyze"] 6.832 s (5%) 37.696 ms 5.30 GiB (1%) 8273
["VGG", "IntegratedGradients", "construct analyzer"] 9.067 μs (5%) 9.48 KiB (1%) 198
["VGG", "LREpsilonPlusFlat", "analyze"] 1.857 s (5%) 11.131 ms 1.10 GiB (1%) 2670
["VGG", "LREpsilonPlusFlat", "construct analyzer"] 72.458 ms (5%) 143.04 MiB (1%) 520
["VGG", "LRP", "analyze"] 1.287 s (5%) 4.638 ms 704.88 MiB (1%) 1681
["VGG", "LRP", "construct analyzer"] 20.800 μs (5%) 73.23 KiB (1%) 171
["VGG", "SmoothGrad", "analyze"] 7.104 s (5%) 46.346 ms 5.30 GiB (1%) 8228
["VGG", "SmoothGrad", "construct analyzer"] 8.867 μs (5%) 9.48 KiB (1%) 198

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       7867 s          2 s        553 s       2353 s          0 s
       #2  2793 MHz       1690 s          1 s        237 s       8850 s          0 s
  Memory: 6.78125 GB (2293.49609375 MB free)
  Uptime: 1084.61 sec
  Load Avg:  1.04  1.07  0.83
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Baseline result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmark: 11 Nov 2022 - 16:43
  • Package commit: 2c9c6a
  • Julia commit: 36034a
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Layer", "Conv", "AlphaBetaRule", "apply rule"] 1.891 ms (5%) 5.58 MiB (1%) 496
["Layer", "Conv", "AlphaBetaRule", "modify layer"] 2.700 μs (5%) 5.53 KiB (1%) 46
["Layer", "Conv", "EpsilonRule", "apply rule"] 442.499 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "EpsilonRule", "modify layer"] 3.199 ns (5%)
["Layer", "Conv", "FlatRule", "apply rule"] 449.997 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "FlatRule", "modify layer"] 108.972 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "GammaRule", "apply rule"] 447.497 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "GammaRule", "modify layer"] 260.113 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "WSquareRule", "apply rule"] 452.697 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "WSquareRule", "modify layer"] 181.967 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "ZBoxRule", "apply rule"] 1.363 ms (5%) 4.12 MiB (1%) 360
["Layer", "Conv", "ZBoxRule", "modify layer"] 1.770 μs (5%) 3.25 KiB (1%) 28
["Layer", "Conv", "ZPlusRule", "apply rule"] 942.697 μs (5%) 3.03 MiB (1%) 268
["Layer", "Conv", "ZPlusRule", "modify layer"] 1.550 μs (5%) 2.77 KiB (1%) 23
["Layer", "Conv", "ZeroRule", "apply rule"] 443.798 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "ZeroRule", "modify layer"] 3.599 ns (5%)
["Layer", "Dense", "AlphaBetaRule", "apply rule"] 154.099 μs (5%) 814.62 KiB (1%) 56
["Layer", "Dense", "AlphaBetaRule", "modify layer"] 301.798 μs (5%) 1.57 MiB (1%) 58
["Layer", "Dense", "EpsilonRule", "apply rule"] 20.000 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "EpsilonRule", "modify layer"] 4.799 ns (5%)
["Layer", "Dense", "FlatRule", "apply rule"] 91.777 ns (5%)
["Layer", "Dense", "FlatRule", "modify layer"] 15.099 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "GammaRule", "apply rule"] 20.099 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "GammaRule", "modify layer"] 16.600 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "WSquareRule", "apply rule"] 18.499 μs (5%) 576 bytes (1%) 5
["Layer", "Dense", "WSquareRule", "modify layer"] 16.800 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "ZBoxRule", "apply rule"] 113.000 μs (5%) 602.48 KiB (1%) 32
["Layer", "Dense", "ZBoxRule", "modify layer"] 154.599 μs (5%) 805.25 KiB (1%) 34
["Layer", "Dense", "ZPlusRule", "apply rule"] 73.799 μs (5%) 422.39 KiB (1%) 40
["Layer", "Dense", "ZPlusRule", "modify layer"] 149.699 μs (5%) 804.75 KiB (1%) 29
["Layer", "Dense", "ZeroRule", "apply rule"] 19.999 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "ZeroRule", "modify layer"] 4.599 ns (5%)
["VGG", "Gradient", "analyze"] 1.372 s (5%) 8.617 ms 1.10 GiB (1%) 1887
["VGG", "Gradient", "construct analyzer"] 9.900 μs (5%) 11.45 KiB (1%) 200
["VGG", "InputTimesGradient", "analyze"] 15.628 s (5%) 292.292 ms 1.62 GiB (1%) 10263538
["VGG", "InputTimesGradient", "construct analyzer"] 8.900 μs (5%) 11.45 KiB (1%) 200
["VGG", "IntegratedGradients", "analyze"] 6.677 s (5%) 70.885 ms 5.30 GiB (1%) 8273
["VGG", "IntegratedGradients", "construct analyzer"] 9.033 μs (5%) 9.48 KiB (1%) 198
["VGG", "LREpsilonPlusFlat", "analyze"] 3.847 s (5%) 82.582 ms 1.29 GiB (1%) 3465156
["VGG", "LREpsilonPlusFlat", "construct analyzer"] 73.499 ms (5%) 143.04 MiB (1%) 520
["VGG", "LRP", "analyze"] 1.283 s (5%) 5.087 ms 704.88 MiB (1%) 1681
["VGG", "LRP", "construct analyzer"] 20.300 μs (5%) 73.23 KiB (1%) 171
["VGG", "SmoothGrad", "analyze"] 6.968 s (5%) 97.400 ms 5.30 GiB (1%) 107776
["VGG", "SmoothGrad", "construct analyzer"] 8.833 μs (5%) 9.48 KiB (1%) 198

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz      11115 s          2 s        656 s       2400 s          0 s
       #2  2793 MHz       1747 s          1 s        249 s      12170 s          0 s
  Memory: 6.78125 GB (2956.44140625 MB free)
  Uptime: 1424.56 sec
  Load Avg:  1.02  1.03  0.91
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Runtime information

Runtime Info
BLAS #threads 1
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:                    x86_64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
Address sizes:                   46 bits physical, 48 bits virtual
CPU(s):                          2
On-line CPU(s) list:             0,1
Thread(s) per core:              1
Core(s) per socket:              2
Socket(s):                       1
NUMA node(s):                    1
Vendor ID:                       GenuineIntel
CPU family:                      6
Model:                           106
Model name:                      Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Stepping:                        6
CPU MHz:                         2793.438
BogoMIPS:                        5586.87
Hypervisor vendor:               Microsoft
Virtualization type:             full
L1d cache:                       96 KiB
L1i cache:                       64 KiB
L2 cache:                        2.5 MiB
L3 cache:                        48 MiB
NUMA node0 CPU(s):               0,1
Vulnerability Itlb multihit:     KVM: Mitigation: VMX unsupported
Vulnerability L1tf:              Mitigation; PTE Inversion
Vulnerability Mds:               Mitigation; Clear CPU buffers; SMT Host state unknown
Vulnerability Meltdown:          Mitigation; PTI
Vulnerability Mmio stale data:   Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown
Vulnerability Retbleed:          Not affected
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling, PBRSB-eIBRS Not affected
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Mitigation; Clear CPU buffers; SMT Host state unknown
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Vendor :Intel
Architecture :UnknownIntel
Model Family: 0x06, Model: 0x6a, Stepping: 0x06, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (48, 1280, 49152) kbytes
64 byte cache line size
Address Size 48 bits virtual, 46 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link

Benchmark result

Judge result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 Nov 2022 - 16:38
    • Baseline: 11 Nov 2022 - 16:43
  • Package commits:
    • Target: 599c38
    • Baseline: 2c9c6a
  • Julia commits:
    • Target: 36034a
    • Baseline: 36034a
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: None
    • Baseline: None

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["Layer", "Conv", "ZBoxRule", "modify layer"] 0.82 (5%) ✅ 1.00 (1%)
["Layer", "Conv", "ZPlusRule", "apply rule"] 1.00 (5%) 0.95 (1%) ✅
["Layer", "Conv", "ZPlusRule", "modify layer"] 0.80 (5%) ✅ 0.99 (1%)
["Layer", "Dense", "FlatRule", "apply rule"] 0.95 (5%) ✅ 1.00 (1%)
["Layer", "Dense", "GammaRule", "modify layer"] 1.10 (5%) ❌ 1.00 (1%)
["Layer", "Dense", "ZPlusRule", "apply rule"] 1.01 (5%) 0.99 (1%) ✅
["Layer", "Dense", "ZeroRule", "modify layer"] 0.85 (5%) ✅ 1.00 (1%)
["VGG", "Gradient", "construct analyzer"] 0.91 (5%) ✅ 1.00 (1%)
["VGG", "InputTimesGradient", "analyze"] 0.09 (5%) ✅ 0.68 (1%) ✅
["VGG", "InputTimesGradient", "construct analyzer"] 1.06 (5%) ❌ 1.00 (1%)
["VGG", "LREpsilonPlusFlat", "analyze"] 1.00 (5%) 0.97 (1%) ✅

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Target

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       6509 s          1 s        511 s       2024 s          0 s
       #2  2793 MHz       3118 s          1 s        280 s       5643 s          0 s
  Memory: 6.78125 GB (1744.57421875 MB free)
  Uptime: 911.41 sec
  Load Avg:  1.04  1.13  0.89
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Baseline

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       6576 s          1 s        524 s       5361 s          0 s
       #2  2793 MHz       6386 s          1 s        382 s       5700 s          0 s
  Memory: 6.78125 GB (2900.125 MB free)
  Uptime: 1254.2 sec
  Load Avg:  1.0  1.03  0.93
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Target result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmark: 11 Nov 2022 - 16:38
  • Package commit: 599c38
  • Julia commit: 36034a
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Layer", "Conv", "AlphaBetaRule", "apply rule"] 1.886 ms (5%) 5.58 MiB (1%) 496
["Layer", "Conv", "AlphaBetaRule", "modify layer"] 2.656 μs (5%) 5.53 KiB (1%) 46
["Layer", "Conv", "EpsilonRule", "apply rule"] 443.899 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "EpsilonRule", "modify layer"] 2.799 ns (5%)
["Layer", "Conv", "FlatRule", "apply rule"] 448.698 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "FlatRule", "modify layer"] 107.527 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "GammaRule", "apply rule"] 444.898 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "GammaRule", "modify layer"] 261.677 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "WSquareRule", "apply rule"] 451.198 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "WSquareRule", "modify layer"] 186.276 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "ZBoxRule", "apply rule"] 1.361 ms (5%) 4.12 MiB (1%) 360
["Layer", "Conv", "ZBoxRule", "modify layer"] 1.590 μs (5%) 3.25 KiB (1%) 28
["Layer", "Conv", "ZPlusRule", "apply rule"] 940.297 μs (5%) 2.89 MiB (1%) 259
["Layer", "Conv", "ZPlusRule", "modify layer"] 1.340 μs (5%) 2.77 KiB (1%) 23
["Layer", "Conv", "ZeroRule", "apply rule"] 442.598 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "ZeroRule", "modify layer"] 3.199 ns (5%)
["Layer", "Dense", "AlphaBetaRule", "apply rule"] 156.499 μs (5%) 814.67 KiB (1%) 56
["Layer", "Dense", "AlphaBetaRule", "modify layer"] 308.899 μs (5%) 1.57 MiB (1%) 58
["Layer", "Dense", "EpsilonRule", "apply rule"] 19.500 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "EpsilonRule", "modify layer"] 4.799 ns (5%)
["Layer", "Dense", "FlatRule", "apply rule"] 79.813 ns (5%)
["Layer", "Dense", "FlatRule", "modify layer"] 16.100 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "GammaRule", "apply rule"] 19.599 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "GammaRule", "modify layer"] 17.900 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "WSquareRule", "apply rule"] 18.700 μs (5%) 576 bytes (1%) 5
["Layer", "Dense", "WSquareRule", "modify layer"] 17.100 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "ZBoxRule", "apply rule"] 107.900 μs (5%) 602.48 KiB (1%) 32
["Layer", "Dense", "ZBoxRule", "modify layer"] 154.400 μs (5%) 805.25 KiB (1%) 34
["Layer", "Dense", "ZPlusRule", "apply rule"] 72.799 μs (5%) 416.25 KiB (1%) 37
["Layer", "Dense", "ZPlusRule", "modify layer"] 153.300 μs (5%) 804.75 KiB (1%) 29
["Layer", "Dense", "ZeroRule", "apply rule"] 19.499 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "ZeroRule", "modify layer"] 4.599 ns (5%)
["VGG", "Gradient", "analyze"] 1.381 s (5%) 10.214 ms 1.10 GiB (1%) 1887
["VGG", "Gradient", "construct analyzer"] 8.466 μs (5%) 11.45 KiB (1%) 200
["VGG", "InputTimesGradient", "analyze"] 1.391 s (5%) 18.254 ms 1.10 GiB (1%) 1890
["VGG", "InputTimesGradient", "construct analyzer"] 8.667 μs (5%) 11.45 KiB (1%) 200
["VGG", "IntegratedGradients", "analyze"] 6.803 s (5%) 39.723 ms 5.30 GiB (1%) 8273
["VGG", "IntegratedGradients", "construct analyzer"] 8.067 μs (5%) 9.48 KiB (1%) 198
["VGG", "LREpsilonPlusFlat", "analyze"] 1.850 s (5%) 10.939 ms 1.10 GiB (1%) 2670
["VGG", "LREpsilonPlusFlat", "construct analyzer"] 73.974 ms (5%) 143.04 MiB (1%) 520
["VGG", "LRP", "analyze"] 1.285 s (5%) 10.805 ms 704.88 MiB (1%) 1681
["VGG", "LRP", "construct analyzer"] 20.600 μs (5%) 73.23 KiB (1%) 171
["VGG", "SmoothGrad", "analyze"] 6.931 s (5%) 40.135 ms 5.30 GiB (1%) 8228
["VGG", "SmoothGrad", "construct analyzer"] 8.050 μs (5%) 9.48 KiB (1%) 198

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       6509 s          1 s        511 s       2024 s          0 s
       #2  2793 MHz       3118 s          1 s        280 s       5643 s          0 s
  Memory: 6.78125 GB (1744.57421875 MB free)
  Uptime: 911.41 sec
  Load Avg:  1.04  1.13  0.89
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Baseline result

Benchmark Report for /home/runner/work/ExplainableAI.jl/ExplainableAI.jl

Job Properties

  • Time of benchmark: 11 Nov 2022 - 16:43
  • Package commit: 2c9c6a
  • Julia commit: 36034a
  • Julia command flags: None
  • Environment variables: None

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["Layer", "Conv", "AlphaBetaRule", "apply rule"] 1.860 ms (5%) 5.58 MiB (1%) 496
["Layer", "Conv", "AlphaBetaRule", "modify layer"] 2.722 μs (5%) 5.56 KiB (1%) 46
["Layer", "Conv", "EpsilonRule", "apply rule"] 442.297 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "EpsilonRule", "modify layer"] 2.799 ns (5%)
["Layer", "Conv", "FlatRule", "apply rule"] 445.998 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "FlatRule", "modify layer"] 111.934 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "GammaRule", "apply rule"] 442.899 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "GammaRule", "modify layer"] 269.458 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "WSquareRule", "apply rule"] 447.600 μs (5%) 1.41 MiB (1%) 123
["Layer", "Conv", "WSquareRule", "modify layer"] 184.234 ns (5%) 400 bytes (1%) 3
["Layer", "Conv", "ZBoxRule", "apply rule"] 1.359 ms (5%) 4.12 MiB (1%) 360
["Layer", "Conv", "ZBoxRule", "modify layer"] 1.950 μs (5%) 3.27 KiB (1%) 28
["Layer", "Conv", "ZPlusRule", "apply rule"] 943.894 μs (5%) 3.03 MiB (1%) 268
["Layer", "Conv", "ZPlusRule", "modify layer"] 1.680 μs (5%) 2.78 KiB (1%) 23
["Layer", "Conv", "ZeroRule", "apply rule"] 443.598 μs (5%) 1.36 MiB (1%) 120
["Layer", "Conv", "ZeroRule", "modify layer"] 3.199 ns (5%)
["Layer", "Dense", "AlphaBetaRule", "apply rule"] 151.699 μs (5%) 814.64 KiB (1%) 56
["Layer", "Dense", "AlphaBetaRule", "modify layer"] 301.999 μs (5%) 1.57 MiB (1%) 58
["Layer", "Dense", "EpsilonRule", "apply rule"] 18.899 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "EpsilonRule", "modify layer"] 4.799 ns (5%)
["Layer", "Dense", "FlatRule", "apply rule"] 84.160 ns (5%)
["Layer", "Dense", "FlatRule", "modify layer"] 15.400 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "GammaRule", "apply rule"] 18.899 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "GammaRule", "modify layer"] 16.299 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "WSquareRule", "apply rule"] 18.499 μs (5%) 576 bytes (1%) 5
["Layer", "Dense", "WSquareRule", "modify layer"] 16.600 μs (5%) 195.84 KiB (1%) 3
["Layer", "Dense", "ZBoxRule", "apply rule"] 105.200 μs (5%) 602.48 KiB (1%) 32
["Layer", "Dense", "ZBoxRule", "modify layer"] 150.899 μs (5%) 805.25 KiB (1%) 34
["Layer", "Dense", "ZPlusRule", "apply rule"] 72.100 μs (5%) 422.41 KiB (1%) 40
["Layer", "Dense", "ZPlusRule", "modify layer"] 148.799 μs (5%) 804.75 KiB (1%) 29
["Layer", "Dense", "ZeroRule", "apply rule"] 18.899 μs (5%) 1.45 KiB (1%) 3
["Layer", "Dense", "ZeroRule", "modify layer"] 5.400 ns (5%)
["VGG", "Gradient", "analyze"] 1.391 s (5%) 9.878 ms 1.10 GiB (1%) 1887
["VGG", "Gradient", "construct analyzer"] 9.300 μs (5%) 11.45 KiB (1%) 200
["VGG", "InputTimesGradient", "analyze"] 15.943 s (5%) 305.062 ms 1.62 GiB (1%) 10276394
["VGG", "InputTimesGradient", "construct analyzer"] 8.167 μs (5%) 11.45 KiB (1%) 200
["VGG", "IntegratedGradients", "analyze"] 6.749 s (5%) 72.006 ms 5.30 GiB (1%) 8273
["VGG", "IntegratedGradients", "construct analyzer"] 8.167 μs (5%) 9.48 KiB (1%) 198
["VGG", "LREpsilonPlusFlat", "analyze"] 1.848 s (5%) 20.053 ms 1.13 GiB (1%) 2733
["VGG", "LREpsilonPlusFlat", "construct analyzer"] 73.262 ms (5%) 143.04 MiB (1%) 520
["VGG", "LRP", "analyze"] 1.300 s (5%) 21.662 ms 704.88 MiB (1%) 1681
["VGG", "LRP", "construct analyzer"] 21.000 μs (5%) 73.23 KiB (1%) 171
["VGG", "SmoothGrad", "analyze"] 7.121 s (5%) 101.540 ms 5.30 GiB (1%) 107812
["VGG", "SmoothGrad", "construct analyzer"] 7.925 μs (5%) 9.48 KiB (1%) 198

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["Layer", "Conv", "AlphaBetaRule"]
  • ["Layer", "Conv", "EpsilonRule"]
  • ["Layer", "Conv", "FlatRule"]
  • ["Layer", "Conv", "GammaRule"]
  • ["Layer", "Conv", "WSquareRule"]
  • ["Layer", "Conv", "ZBoxRule"]
  • ["Layer", "Conv", "ZPlusRule"]
  • ["Layer", "Conv", "ZeroRule"]
  • ["Layer", "Dense", "AlphaBetaRule"]
  • ["Layer", "Dense", "EpsilonRule"]
  • ["Layer", "Dense", "FlatRule"]
  • ["Layer", "Dense", "GammaRule"]
  • ["Layer", "Dense", "WSquareRule"]
  • ["Layer", "Dense", "ZBoxRule"]
  • ["Layer", "Dense", "ZPlusRule"]
  • ["Layer", "Dense", "ZeroRule"]
  • ["VGG", "Gradient"]
  • ["VGG", "InputTimesGradient"]
  • ["VGG", "IntegratedGradients"]
  • ["VGG", "LREpsilonPlusFlat"]
  • ["VGG", "LRP"]
  • ["VGG", "SmoothGrad"]

Julia versioninfo

Julia Version 1.8.2
Commit 36034abf260 (2022-09-29 15:21 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
      Ubuntu 20.04.5 LTS
  uname: Linux 5.15.0-1022-azure #27~20.04.1-Ubuntu SMP Mon Oct 17 02:03:50 UTC 2022 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz: 
              speed         user         nice          sys         idle          irq
       #1  2793 MHz       6576 s          1 s        524 s       5361 s          0 s
       #2  2793 MHz       6386 s          1 s        382 s       5700 s          0 s
  Memory: 6.78125 GB (2900.125 MB free)
  Uptime: 1254.2 sec
  Load Avg:  1.0  1.03  0.93
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, icelake-server)
  Threads: 1 on 2 virtual cores

Runtime information

Runtime Info
BLAS #threads 1
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:                    x86_64
CPU op-mode(s):                  32-bit, 64-bit
Byte Order:                      Little Endian
Address sizes:                   46 bits physical, 48 bits virtual
CPU(s):                          2
On-line CPU(s) list:             0,1
Thread(s) per core:              1
Core(s) per socket:              2
Socket(s):                       1
NUMA node(s):                    1
Vendor ID:                       GenuineIntel
CPU family:                      6
Model:                           106
Model name:                      Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Stepping:                        6
CPU MHz:                         2793.435
BogoMIPS:                        5586.87
Hypervisor vendor:               Microsoft
Virtualization type:             full
L1d cache:                       96 KiB
L1i cache:                       64 KiB
L2 cache:                        2.5 MiB
L3 cache:                        48 MiB
NUMA node0 CPU(s):               0,1
Vulnerability Itlb multihit:     KVM: Mitigation: VMX unsupported
Vulnerability L1tf:              Mitigation; PTE Inversion
Vulnerability Mds:               Mitigation; Clear CPU buffers; SMT Host state unknown
Vulnerability Meltdown:          Mitigation; PTI
Vulnerability Mmio stale data:   Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown
Vulnerability Retbleed:          Not affected
Vulnerability Spec store bypass: Vulnerable
Vulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization
Vulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling, PBRSB-eIBRS Not affected
Vulnerability Srbds:             Not affected
Vulnerability Tsx async abort:   Mitigation; Clear CPU buffers; SMT Host state unknown
Flags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8370C CPU @ 2.80GHz
Vendor :Intel
Architecture :UnknownIntel
Model Family: 0x06, Model: 0x6a, Stepping: 0x06, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading hardware capability detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (48, 1280, 49152) kbytes
64 byte cache line size
Address Size 48 bits virtual, 46 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@adrhill adrhill removed the run benchmark Run benchmark CI action label Nov 11, 2022
@adrhill adrhill merged commit 148b5b6 into master Nov 11, 2022
@adrhill adrhill deleted the ah/lrp-perf branch November 11, 2022 16:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant