Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make parallel reduce work with Dict #268

Merged
merged 5 commits into from
May 11, 2020
Merged

Make parallel reduce work with Dict #268

merged 5 commits into from
May 11, 2020

Conversation

tkf
Copy link
Member

@tkf tkf commented May 11, 2020

needs JuliaFolds/SplittablesBase.jl#22

Commit Message

Make parallel reduce work with Dict (#268)

Use SplittablesBase.amount instead of length for implementing
parallel reduce. Dict still does not work with dreduce because
it uses index-based partitioning.

@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 05:16
    • Baseline: 11 May 2020 - 05:20
  • Package commits:
    • Target: 22d6ce
    • Baseline: 38c606
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.08 (5%) ❌ 1.00 (1%)
["filter_map_map!", "xf"] 0.82 (5%) ✅ 1.00 (1%)
["findall", "xf-iter"] 1.05 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "16"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 1.05 (5%) ❌ 1.00 (1%)
["gemm", "mul", "linalg", "32"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "8"] 0.95 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "256"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "32"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 0.85 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "256"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 0.85 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "256"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.21 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "256"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "false", "32"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 1.17 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "256"] 0.90 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "32"] 0.88 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.84 (5%) ✅ 1.00 (1%)
["missing_dot", "man"] 0.91 (5%) ✅ 1.00 (1%)
["missing_dot", "xf"] 1.07 (5%) ❌ 1.01 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      46049 s          0 s       1285 s       7636 s          0 s
       #2  2095 MHz       6804 s          0 s       1503 s      47235 s          0 s
       
  Memory: 6.7827301025390625 GB (3199.9296875 MB free)
  Uptime: 570.0 sec
  Load Avg:  1.0615234375  1.0244140625  0.62548828125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      61798 s          0 s       1369 s      14718 s          0 s
       #2  2095 MHz      13995 s          0 s       1677 s      62834 s          0 s
       
  Memory: 6.7827301025390625 GB (3160.69921875 MB free)
  Uptime: 801.0 sec
  Load Avg:  1.0009765625  1.01416015625  0.72314453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 5:16
  • Package commit: 22d6ce
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 2.078 μs (5%)
["cat", "xf"] 1.810 μs (5%)
["collect", "filter-missing"] 103.001 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 75.401 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 372.204 μs (5%) 285.75 KiB (1%) 6664
["dot", "blas"] 1.360 μs (5%)
["dot", "man"] 1.320 μs (5%)
["dot", "rf"] 2.389 μs (5%)
["dot", "xf"] 2.400 μs (5%)
["filter_map_map!", "man"] 61.100 μs (5%)
["filter_map_map!", "xf"] 57.801 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.703 μs (5%)
["filter_map_reduce", "xf"] 255.700 μs (5%)
["findall", "base"] 899.802 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 693.401 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 944.903 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 4.701 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.520 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.755 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.728 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.860 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 544.405 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.869 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.521 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 993.712 μs (5%)
["gemm", "mul", "linalg", "32"] 3.300 μs (5%)
["gemm", "mul", "linalg", "8"] 284.536 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.707 ms (5%)
["gemm", "mul", "man", "false", "32"] 4.771 μs (5%)
["gemm", "mul", "man", "false", "8"] 422.613 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.748 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.229 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 423.121 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.722 ms (5%)
["gemm", "mul", "man", "true", "32"] 4.886 μs (5%)
["gemm", "mul", "man", "true", "8"] 483.082 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.710 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 4.757 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 418.598 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.771 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.086 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 467.857 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.702 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.000 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 419.603 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.778 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.789 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.610 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.370 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.250 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.520 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.590 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 230.803 μs (5%) 72.17 KiB (1%) 3744
["missing_dot", "xf_nota"] 229.604 μs (5%) 72.02 KiB (1%) 3736
["partition_by", "man"] 2.202 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.079 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      46049 s          0 s       1285 s       7636 s          0 s
       #2  2095 MHz       6804 s          0 s       1503 s      47235 s          0 s
       
  Memory: 6.7827301025390625 GB (3199.9296875 MB free)
  Uptime: 570.0 sec
  Load Avg:  1.0615234375  1.0244140625  0.62548828125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 5:20
  • Package commit: 38c606
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 1.922 μs (5%)
["cat", "xf"] 1.820 μs (5%)
["collect", "filter-missing"] 105.600 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 76.400 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 372.802 μs (5%) 285.10 KiB (1%) 6687
["dot", "blas"] 1.360 μs (5%)
["dot", "man"] 1.310 μs (5%)
["dot", "rf"] 2.389 μs (5%)
["dot", "xf"] 2.400 μs (5%)
["filter_map_map!", "man"] 62.801 μs (5%)
["filter_map_map!", "xf"] 70.901 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 255.702 μs (5%)
["filter_map_reduce", "xf"] 255.801 μs (5%)
["findall", "base"] 884.205 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 701.404 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 897.605 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 4.778 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.575 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.520 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.811 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.396 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 540.003 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.446 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.489 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 993.305 μs (5%)
["gemm", "mul", "linalg", "32"] 3.500 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.906 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.200 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.866 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.300 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.889 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.100 μs (5%)
["gemm", "mul", "man", "true", "8"] 400.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.904 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.200 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.856 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.200 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.899 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.700 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.000 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.733 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.778 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.500 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.267 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.510 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.630 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 216.601 μs (5%) 71.81 KiB (1%) 3726
["missing_dot", "xf_nota"] 219.801 μs (5%) 71.83 KiB (1%) 3728
["partition_by", "man"] 2.208 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.089 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      61798 s          0 s       1369 s      14718 s          0 s
       #2  2095 MHz      13995 s          0 s       1677 s      62834 s          0 s
       
  Memory: 6.7827301025390625 GB (3160.69921875 MB free)
  Uptime: 801.0 sec
  Load Avg:  1.0009765625  1.01416015625  0.72314453125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.077
BogoMIPS:            4190.15
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 05:18
    • Baseline: 11 May 2020 - 05:23
  • Package commits:
    • Target: 22d6ce
    • Baseline: 38c606
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "unordered", "basesize=1024"] 0.97 (5%) 0.98 (1%) ✅
["parallel_histogram", "comm", "basesize=4096"] 1.08 (5%) ❌ 1.01 (1%)
["parallel_histogram", "comm", "basesize=8192"] 1.08 (5%) ❌ 1.00 (1%)
["words", "nthreads=2"] 0.86 (5%) ✅ 0.99 (1%)
["words", "nthreads=4"] 0.84 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      47415 s          0 s       2038 s      17312 s          0 s
       #2  2394 MHz      51440 s          0 s       2119 s      13038 s          0 s
       
  Memory: 6.782737731933594 GB (3235.6328125 MB free)
  Uptime: 683.0 sec
  Load Avg:  1.787109375  1.54052734375  0.9013671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, haswell)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      71829 s          0 s       2594 s      22083 s          0 s
       #2  2394 MHz      72611 s          0 s       2691 s      21003 s          0 s
       
  Memory: 6.782737731933594 GB (3203.4765625 MB free)
  Uptime: 981.0 sec
  Load Avg:  1.72216796875  1.60205078125  1.103515625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, haswell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 5:18
  • Package commit: 22d6ce
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 386.063 ms (5%) 12.260 ms 87.05 MiB (1%) 1558035
["collect", "assoc", "basesize=1024"] 245.913 ms (5%) 1.84 MiB (1%) 1780
["collect", "assoc", "basesize=32"] 251.457 ms (5%) 5.63 MiB (1%) 52993
["collect", "seq"] 489.638 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 486.272 ms (5%) 29.16 MiB (1%) 403087
["collect", "unordered", "basesize=1024"] 282.225 ms (5%) 813.72 KiB (1%) 5170
["collect", "unordered", "basesize=32"] 274.646 ms (5%) 1.50 MiB (1%) 18494
["findfirst", "n=1000", "foldl"] 713.679 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 360.412 ms (5%) 563.98 KiB (1%) 10226
["findfirst", "n=1000", "reduce", "basesize=256"] 360.026 ms (5%) 287.03 KiB (1%) 5212
["findfirst", "n=1000", "reduce", "basesize=512"] 365.179 ms (5%) 149.13 KiB (1%) 2713
["findfirst", "n=400", "foldl"] 534.853 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 271.391 ms (5%) 1.02 MiB (1%) 18965
["findfirst", "n=400", "reduce", "basesize=256"] 270.483 ms (5%) 525.98 KiB (1%) 9563
["findfirst", "n=400", "reduce", "basesize=512"] 271.309 ms (5%) 267.02 KiB (1%) 4866
["findfirst", "n=500", "foldl"] 92.454 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 47.440 ms (5%) 157.20 KiB (1%) 2842
["findfirst", "n=500", "reduce", "basesize=256"] 47.160 ms (5%) 84.33 KiB (1%) 1523
["findfirst", "n=500", "reduce", "basesize=512"] 50.186 ms (5%) 48.08 KiB (1%) 868
["overhead", "default"] 202.401 μs (5%) 146.14 KiB (1%) 2628
["overhead", "stoppable=false"] 204.902 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 361.902 μs (5%) 146.42 KiB (1%) 2646
["parallel_histogram", "assoc", "basesize=16384"] 5.265 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 6.012 ms (5%) 1.80 MiB (1%) 497
["parallel_histogram", "assoc", "basesize=8192"] 5.643 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 12.209 ms (5%) 1.22 MiB (1%) 156
["parallel_histogram", "comm", "basesize=4096"] 18.814 ms (5%) 1.04 MiB (1%) 514
["parallel_histogram", "comm", "basesize=8192"] 14.937 ms (5%) 1.22 MiB (1%) 423
["parallel_histogram", "seq"] 9.644 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 18.471 ms (5%)
["sum", "random", "reduce", "basesize=128"] 9.713 ms (5%) 313.34 KiB (1%) 6068
["sum", "random", "reduce", "basesize=256"] 9.520 ms (5%) 155.08 KiB (1%) 3010
["sum", "random", "reduce", "basesize=512"] 9.399 ms (5%) 76.25 KiB (1%) 1484
["sum", "uniform", "foldl"] 18.066 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 9.521 ms (5%) 313.39 KiB (1%) 6071
["sum", "uniform", "reduce", "basesize=256"] 9.328 ms (5%) 155.09 KiB (1%) 3011
["sum", "uniform", "reduce", "basesize=512"] 9.186 ms (5%) 76.27 KiB (1%) 1485
["sum", "valley", "foldl"] 18.538 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 9.735 ms (5%) 313.28 KiB (1%) 6064
["sum", "valley", "reduce", "basesize=256"] 9.579 ms (5%) 155.08 KiB (1%) 3010
["sum", "valley", "reduce", "basesize=512"] 9.425 ms (5%) 76.25 KiB (1%) 1484
["words", "nthreads=1"] 43.439 ms (5%) 8.143 ms 64.65 MiB (1%) 2091783
["words", "nthreads=2"] 21.442 ms (5%) 65.01 MiB (1%) 2091859
["words", "nthreads=4"] 22.134 ms (5%) 65.72 MiB (1%) 2092013

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      47415 s          0 s       2038 s      17312 s          0 s
       #2  2394 MHz      51440 s          0 s       2119 s      13038 s          0 s
       
  Memory: 6.782737731933594 GB (3235.6328125 MB free)
  Uptime: 683.0 sec
  Load Avg:  1.787109375  1.54052734375  0.9013671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, haswell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 5:23
  • Package commit: 38c606
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 386.435 ms (5%) 10.424 ms 87.05 MiB (1%) 1557954
["collect", "assoc", "basesize=1024"] 245.901 ms (5%) 1.84 MiB (1%) 1780
["collect", "assoc", "basesize=32"] 250.410 ms (5%) 5.63 MiB (1%) 52991
["collect", "seq"] 490.028 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 493.846 ms (5%) 29.16 MiB (1%) 403183
["collect", "unordered", "basesize=1024"] 290.865 ms (5%) 830.80 KiB (1%) 6218
["collect", "unordered", "basesize=32"] 273.187 ms (5%) 1.50 MiB (1%) 18500
["findfirst", "n=1000", "foldl"] 718.223 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 364.299 ms (5%) 563.86 KiB (1%) 10218
["findfirst", "n=1000", "reduce", "basesize=256"] 362.455 ms (5%) 287.08 KiB (1%) 5215
["findfirst", "n=1000", "reduce", "basesize=512"] 365.160 ms (5%) 149.22 KiB (1%) 2719
["findfirst", "n=400", "foldl"] 534.079 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 273.490 ms (5%) 1.02 MiB (1%) 18952
["findfirst", "n=400", "reduce", "basesize=256"] 269.672 ms (5%) 526.02 KiB (1%) 9565
["findfirst", "n=400", "reduce", "basesize=512"] 272.264 ms (5%) 267.11 KiB (1%) 4872
["findfirst", "n=500", "foldl"] 92.283 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 47.385 ms (5%) 157.22 KiB (1%) 2843
["findfirst", "n=500", "reduce", "basesize=256"] 47.162 ms (5%) 84.36 KiB (1%) 1525
["findfirst", "n=500", "reduce", "basesize=512"] 50.158 ms (5%) 48.09 KiB (1%) 869
["overhead", "default"] 199.501 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=false"] 203.501 μs (5%) 146.13 KiB (1%) 2627
["overhead", "stoppable=true"] 363.401 μs (5%) 146.42 KiB (1%) 2646
["parallel_histogram", "assoc", "basesize=16384"] 5.253 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 6.023 ms (5%) 1.80 MiB (1%) 497
["parallel_histogram", "assoc", "basesize=8192"] 5.637 ms (5%) 1.43 MiB (1%) 243
["parallel_histogram", "comm", "basesize=16384"] 12.598 ms (5%) 1.22 MiB (1%) 156
["parallel_histogram", "comm", "basesize=4096"] 17.340 ms (5%) 1.04 MiB (1%) 1407
["parallel_histogram", "comm", "basesize=8192"] 13.781 ms (5%) 1.23 MiB (1%) 458
["parallel_histogram", "seq"] 9.654 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 18.418 ms (5%)
["sum", "random", "reduce", "basesize=128"] 9.671 ms (5%) 313.38 KiB (1%) 6070
["sum", "random", "reduce", "basesize=256"] 9.453 ms (5%) 155.09 KiB (1%) 3011
["sum", "random", "reduce", "basesize=512"] 9.382 ms (5%) 76.25 KiB (1%) 1484
["sum", "uniform", "foldl"] 18.054 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 9.503 ms (5%) 313.34 KiB (1%) 6068
["sum", "uniform", "reduce", "basesize=256"] 9.316 ms (5%) 155.11 KiB (1%) 3012
["sum", "uniform", "reduce", "basesize=512"] 9.180 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 18.539 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 9.729 ms (5%) 313.27 KiB (1%) 6063
["sum", "valley", "reduce", "basesize=256"] 9.562 ms (5%) 155.08 KiB (1%) 3010
["sum", "valley", "reduce", "basesize=512"] 9.421 ms (5%) 76.27 KiB (1%) 1485
["words", "nthreads=1"] 43.475 ms (5%) 7.794 ms 64.62 MiB (1%) 2090970
["words", "nthreads=2"] 25.008 ms (5%) 65.34 MiB (1%) 2091124
["words", "nthreads=4"] 26.270 ms (5%) 65.97 MiB (1%) 2091415

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz: 
              speed         user         nice          sys         idle          irq
       #1  2394 MHz      71829 s          0 s       2594 s      22083 s          0 s
       #2  2394 MHz      72611 s          0 s       2691 s      21003 s          0 s
       
  Memory: 6.782737731933594 GB (3203.4765625 MB free)
  Uptime: 981.0 sec
  Load Avg:  1.72216796875  1.60205078125  1.103515625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, haswell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               63
Model name:          Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
Stepping:            2
CPU MHz:             2394.454
BogoMIPS:            4788.90
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            30720K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm invpcid_single pti fsgsbase bmi1 avx2 smep bmi2 erms invpcid xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v3 @ 2.40GHz
Vendor :Intel
Architecture :Haswell
Model Family: 0x06, Model: 0x3f, Stepping: 0x02, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 30720) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 06:10
    • Baseline: 11 May 2020 - 06:13
  • Package commits:
    • Target: 035dcb
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.08 (5%) ❌ 1.00 (1%)
["filter_map_map!", "xf"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 1.18 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "8"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "8"] 1.09 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 1.32 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.95 (5%) ✅ 1.00 (1%)
["missing_dot", "xf"] 1.11 (5%) ❌ 1.00 (1%)
["missing_dot", "xf_nota"] 1.08 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      44482 s          0 s       1613 s      60265 s          0 s
       #2  2095 MHz      10588 s          0 s       1502 s      94430 s          0 s
       
  Memory: 6.782737731933594 GB (3177.40234375 MB free)
  Uptime: 1118.0 sec
  Load Avg:  1.0927734375  0.943359375  0.59375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      66199 s          0 s       1860 s      61637 s          0 s
       #2  2095 MHz      12141 s          0 s       1644 s     116045 s          0 s
       
  Memory: 6.782737731933594 GB (3047.70703125 MB free)
  Uptime: 1352.0 sec
  Load Avg:  1.09716796875  1.01611328125  0.71044921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:10
  • Package commit: 035dcb
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 2.345 μs (5%)
["cat", "xf"] 2.023 μs (5%)
["collect", "filter-missing"] 114.814 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 85.710 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 431.751 μs (5%) 285.75 KiB (1%) 6671
["dot", "blas"] 1.510 μs (5%)
["dot", "man"] 1.480 μs (5%)
["dot", "rf"] 2.678 μs (5%)
["dot", "xf"] 2.689 μs (5%)
["filter_map_map!", "man"] 57.305 μs (5%)
["filter_map_map!", "xf"] 59.405 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 287.636 μs (5%)
["filter_map_reduce", "xf"] 287.637 μs (5%)
["findall", "base"] 1.004 ms (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 782.429 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 1.014 ms (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 5.209 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.876 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.163 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.081 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.137 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 622.759 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.982 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.309 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.110 ms (5%)
["gemm", "mul", "linalg", "32"] 3.738 μs (5%)
["gemm", "mul", "linalg", "8"] 303.206 ns (5%)
["gemm", "mul", "man", "false", "256"] 2.032 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.767 μs (5%)
["gemm", "mul", "man", "false", "8"] 476.066 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 2.002 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.743 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 476.456 ns (5%)
["gemm", "mul", "man", "true", "256"] 2.011 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.734 μs (5%)
["gemm", "mul", "man", "true", "8"] 543.670 ns (5%)
["gemm", "mul", "xf", "false", "256"] 2.017 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.767 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 472.495 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 2.002 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.572 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 528.330 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 2.031 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.867 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 474.541 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.350 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 3.125 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 3.125 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.770 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.720 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.867 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.690 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.780 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 262.132 μs (5%) 72.00 KiB (1%) 3734
["missing_dot", "xf_nota"] 255.432 μs (5%) 71.77 KiB (1%) 3728
["partition_by", "man"] 2.465 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.339 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      44482 s          0 s       1613 s      60265 s          0 s
       #2  2095 MHz      10588 s          0 s       1502 s      94430 s          0 s
       
  Memory: 6.782737731933594 GB (3177.40234375 MB free)
  Uptime: 1118.0 sec
  Load Avg:  1.0927734375  0.943359375  0.59375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:13
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 2.167 μs (5%)
["cat", "xf"] 2.045 μs (5%)
["collect", "filter-missing"] 117.708 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 84.606 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 412.931 μs (5%) 285.67 KiB (1%) 6675
["dot", "blas"] 1.480 μs (5%)
["dot", "man"] 1.470 μs (5%)
["dot", "rf"] 2.667 μs (5%)
["dot", "xf"] 2.678 μs (5%)
["filter_map_map!", "man"] 58.904 μs (5%)
["filter_map_map!", "xf"] 64.904 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 287.622 μs (5%)
["filter_map_reduce", "xf"] 287.522 μs (5%)
["findall", "base"] 997.981 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 784.463 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 1.009 ms (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 5.287 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.853 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.209 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.066 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.119 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 616.040 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 9.318 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.452 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.109 ms (5%)
["gemm", "mul", "linalg", "32"] 3.800 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 2.027 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.800 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 2.019 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.800 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 2.017 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.800 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 2.029 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.800 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 2.027 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.600 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 2.039 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.900 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.350 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 3.100 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 3.113 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.770 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.690 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.917 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.670 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.790 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 236.717 μs (5%) 72.23 KiB (1%) 3746
["missing_dot", "xf_nota"] 236.518 μs (5%) 72.08 KiB (1%) 3740
["partition_by", "man"] 2.478 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.347 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      66199 s          0 s       1860 s      61637 s          0 s
       #2  2095 MHz      12141 s          0 s       1644 s     116045 s          0 s
       
  Memory: 6.782737731933594 GB (3047.70703125 MB free)
  Uptime: 1352.0 sec
  Load Avg:  1.09716796875  1.01611328125  0.71044921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.197
BogoMIPS:            4190.39
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 06:11
    • Baseline: 11 May 2020 - 06:16
  • Package commits:
    • Target: 035dcb
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1024"] 1.10 (5%) ❌ 1.00 (1%)
["collect", "unordered", "basesize=1"] 1.10 (5%) ❌ 1.00 (1%)
["collect", "unordered", "basesize=1024"] 0.85 (5%) ✅ 0.92 (1%) ✅
["collect", "unordered", "basesize=32"] 1.07 (5%) ❌ 1.00 (1%)
["findfirst", "n=400", "foldl"] 1.05 (5%) ❌ 1.00 (1%)
["overhead", "stoppable=true"] 1.15 (5%) ❌ 1.00 (1%)
["parallel_histogram", "assoc", "basesize=4096"] 1.04 (5%) 1.15 (1%) ❌
["parallel_histogram", "comm", "basesize=16384"] 0.90 (5%) ✅ 1.00 (1%)
["parallel_histogram", "comm", "basesize=4096"] 1.01 (5%) 0.99 (1%) ✅
["parallel_histogram", "comm", "basesize=8192"] 0.93 (5%) ✅ 1.01 (1%)
["words", "nthreads=1"] 0.91 (5%) ✅ 1.00 (1%)
["words", "nthreads=4"] 1.08 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      46148 s          0 s       2432 s      17483 s          0 s
       #2  2294 MHz      51683 s          0 s       2462 s      12769 s          0 s
       
  Memory: 6.7827301025390625 GB (3288.38671875 MB free)
  Uptime: 684.0 sec
  Load Avg:  1.7080078125  1.52099609375  0.9013671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      67960 s          0 s       3024 s      24472 s          0 s
       #2  2294 MHz      75200 s          0 s       3125 s      18008 s          0 s
       
  Memory: 6.7827301025390625 GB (3210.23046875 MB free)
  Uptime: 980.0 sec
  Load Avg:  1.720703125  1.6044921875  1.107421875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:11
  • Package commit: 035dcb
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 318.008 ms (5%) 13.947 ms 87.05 MiB (1%) 1557915
["collect", "assoc", "basesize=1024"] 181.229 ms (5%) 1.84 MiB (1%) 1779
["collect", "assoc", "basesize=32"] 168.001 ms (5%) 5.63 MiB (1%) 52989
["collect", "seq"] 314.552 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 446.733 ms (5%) 29.19 MiB (1%) 405117
["collect", "unordered", "basesize=1024"] 230.342 ms (5%) 857.45 KiB (1%) 7969
["collect", "unordered", "basesize=32"] 202.990 ms (5%) 1.50 MiB (1%) 18706
["findfirst", "n=1000", "foldl"] 533.154 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 277.970 ms (5%) 563.86 KiB (1%) 10218
["findfirst", "n=1000", "reduce", "basesize=256"] 272.126 ms (5%) 287.05 KiB (1%) 5213
["findfirst", "n=1000", "reduce", "basesize=512"] 275.641 ms (5%) 149.13 KiB (1%) 2713
["findfirst", "n=400", "foldl"] 402.072 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 209.912 ms (5%) 1.02 MiB (1%) 18964
["findfirst", "n=400", "reduce", "basesize=256"] 209.756 ms (5%) 526.02 KiB (1%) 9565
["findfirst", "n=400", "reduce", "basesize=512"] 214.920 ms (5%) 267.05 KiB (1%) 4868
["findfirst", "n=500", "foldl"] 65.019 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 33.471 ms (5%) 157.16 KiB (1%) 2839
["findfirst", "n=500", "reduce", "basesize=256"] 34.130 ms (5%) 84.33 KiB (1%) 1523
["findfirst", "n=500", "reduce", "basesize=512"] 36.115 ms (5%) 48.09 KiB (1%) 869
["overhead", "default"] 168.701 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=false"] 167.801 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 322.201 μs (5%) 146.42 KiB (1%) 2646
["parallel_histogram", "assoc", "basesize=16384"] 3.444 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 4.237 ms (5%) 2.07 MiB (1%) 503
["parallel_histogram", "assoc", "basesize=8192"] 3.923 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 11.559 ms (5%) 1.22 MiB (1%) 276
["parallel_histogram", "comm", "basesize=4096"] 20.012 ms (5%) 1.03 MiB (1%) 4058
["parallel_histogram", "comm", "basesize=8192"] 15.732 ms (5%) 1.23 MiB (1%) 1070
["parallel_histogram", "seq"] 6.155 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 11.196 ms (5%)
["sum", "random", "reduce", "basesize=128"] 6.647 ms (5%) 313.31 KiB (1%) 6066
["sum", "random", "reduce", "basesize=256"] 6.330 ms (5%) 155.08 KiB (1%) 3010
["sum", "random", "reduce", "basesize=512"] 6.071 ms (5%) 76.25 KiB (1%) 1484
["sum", "uniform", "foldl"] 11.949 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 6.437 ms (5%) 313.34 KiB (1%) 6068
["sum", "uniform", "reduce", "basesize=256"] 6.169 ms (5%) 155.11 KiB (1%) 3012
["sum", "uniform", "reduce", "basesize=512"] 6.262 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 12.086 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 6.554 ms (5%) 313.22 KiB (1%) 6060
["sum", "valley", "reduce", "basesize=256"] 6.236 ms (5%) 155.08 KiB (1%) 3010
["sum", "valley", "reduce", "basesize=512"] 6.413 ms (5%) 76.25 KiB (1%) 1484
["words", "nthreads=1"] 34.346 ms (5%) 6.032 ms 64.75 MiB (1%) 2095126
["words", "nthreads=2"] 19.657 ms (5%) 65.47 MiB (1%) 2095279
["words", "nthreads=4"] 22.094 ms (5%) 66.11 MiB (1%) 2095569

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      46148 s          0 s       2432 s      17483 s          0 s
       #2  2294 MHz      51683 s          0 s       2462 s      12769 s          0 s
       
  Memory: 6.7827301025390625 GB (3288.38671875 MB free)
  Uptime: 684.0 sec
  Load Avg:  1.7080078125  1.52099609375  0.9013671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:16
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 311.246 ms (5%) 12.737 ms 87.05 MiB (1%) 1557929
["collect", "assoc", "basesize=1024"] 164.144 ms (5%) 1.84 MiB (1%) 1780
["collect", "assoc", "basesize=32"] 170.201 ms (5%) 5.63 MiB (1%) 52986
["collect", "seq"] 312.988 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 407.410 ms (5%) 7.729 ms 29.19 MiB (1%) 405453
["collect", "unordered", "basesize=1024"] 270.411 ms (5%) 933.95 KiB (1%) 12865
["collect", "unordered", "basesize=32"] 189.048 ms (5%) 1.50 MiB (1%) 18684
["findfirst", "n=1000", "foldl"] 551.590 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 284.415 ms (5%) 563.94 KiB (1%) 10223
["findfirst", "n=1000", "reduce", "basesize=256"] 278.929 ms (5%) 287.14 KiB (1%) 5219
["findfirst", "n=1000", "reduce", "basesize=512"] 283.124 ms (5%) 149.17 KiB (1%) 2716
["findfirst", "n=400", "foldl"] 382.715 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 214.273 ms (5%) 1.02 MiB (1%) 18969
["findfirst", "n=400", "reduce", "basesize=256"] 208.191 ms (5%) 525.88 KiB (1%) 9556
["findfirst", "n=400", "reduce", "basesize=512"] 206.684 ms (5%) 267.09 KiB (1%) 4871
["findfirst", "n=500", "foldl"] 66.381 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 33.238 ms (5%) 157.17 KiB (1%) 2840
["findfirst", "n=500", "reduce", "basesize=256"] 33.019 ms (5%) 84.34 KiB (1%) 1524
["findfirst", "n=500", "reduce", "basesize=512"] 37.112 ms (5%) 48.08 KiB (1%) 868
["overhead", "default"] 170.501 μs (5%) 146.14 KiB (1%) 2628
["overhead", "stoppable=false"] 165.500 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 279.402 μs (5%) 146.41 KiB (1%) 2645
["parallel_histogram", "assoc", "basesize=16384"] 3.480 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 4.074 ms (5%) 1.80 MiB (1%) 497
["parallel_histogram", "assoc", "basesize=8192"] 3.810 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 12.841 ms (5%) 1.22 MiB (1%) 156
["parallel_histogram", "comm", "basesize=4096"] 19.886 ms (5%) 1.05 MiB (1%) 4212
["parallel_histogram", "comm", "basesize=8192"] 16.839 ms (5%) 1.23 MiB (1%) 614
["parallel_histogram", "seq"] 6.219 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 11.422 ms (5%)
["sum", "random", "reduce", "basesize=128"] 6.577 ms (5%) 313.31 KiB (1%) 6066
["sum", "random", "reduce", "basesize=256"] 6.428 ms (5%) 155.06 KiB (1%) 3009
["sum", "random", "reduce", "basesize=512"] 6.027 ms (5%) 76.25 KiB (1%) 1484
["sum", "uniform", "foldl"] 11.422 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 6.160 ms (5%) 313.30 KiB (1%) 6065
["sum", "uniform", "reduce", "basesize=256"] 6.160 ms (5%) 155.08 KiB (1%) 3010
["sum", "uniform", "reduce", "basesize=512"] 5.979 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 11.629 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 6.277 ms (5%) 313.25 KiB (1%) 6062
["sum", "valley", "reduce", "basesize=256"] 6.381 ms (5%) 155.06 KiB (1%) 3009
["sum", "valley", "reduce", "basesize=512"] 6.187 ms (5%) 76.28 KiB (1%) 1486
["words", "nthreads=1"] 37.747 ms (5%) 7.354 ms 64.83 MiB (1%) 2098226
["words", "nthreads=2"] 20.462 ms (5%) 65.55 MiB (1%) 2098379
["words", "nthreads=4"] 20.421 ms (5%) 66.00 MiB (1%) 2098528

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      67960 s          0 s       3024 s      24472 s          0 s
       #2  2294 MHz      75200 s          0 s       3125 s      18008 s          0 s
       
  Memory: 6.7827301025390625 GB (3210.23046875 MB free)
  Uptime: 980.0 sec
  Load Avg:  1.720703125  1.6044921875  1.107421875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.689
BogoMIPS:            4589.37
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

Using commit:
Mention `amount` in README
JuliaFolds/SplittablesBase.jl@27d67f2
@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 06:35
    • Baseline: 11 May 2020 - 06:39
  • Package commits:
    • Target: 723b90
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.10 (5%) ❌ 1.00 (1%)
["cat", "xf"] 1.17 (5%) ❌ 1.00 (1%)
["collect", "filter-missing"] 0.91 (5%) ✅ 1.00 (1%)
["collect", "identity-union"] 0.92 (5%) ✅ 1.00 (1%)
["dot", "blas"] 0.92 (5%) ✅ 1.00 (1%)
["dot", "man"] 0.95 (5%) ✅ 1.00 (1%)
["dot", "rf"] 0.89 (5%) ✅ 1.00 (1%)
["dot", "xf"] 0.92 (5%) ✅ 1.00 (1%)
["filter_map_reduce", "man"] 0.94 (5%) ✅ 1.00 (1%)
["filter_map_reduce", "xf"] 0.92 (5%) ✅ 1.00 (1%)
["findall", "xf-array"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "256"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "32"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 0.92 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "true", "32"] 0.84 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.09 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 0.95 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "32"] 0.86 (5%) ✅ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.89 (5%) ✅ 1.00 (1%)
["missing_dot", "equiv"] 0.93 (5%) ✅ 1.00 (1%)
["missing_dot", "naive"] 0.91 (5%) ✅ 1.00 (1%)
["missing_dot", "xf_nota"] 0.92 (5%) ✅ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz       8353 s          0 s       1432 s      50902 s          0 s
       #2  2294 MHz      47779 s          0 s       1277 s      11321 s          0 s
       
  Memory: 6.782737731933594 GB (3154.328125 MB free)
  Uptime: 623.0 sec
  Load Avg:  1.06640625  1.0224609375  0.630859375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      26052 s          0 s       1640 s      57350 s          0 s
       #2  2294 MHz      54351 s          0 s       1485 s      28885 s          0 s
       
  Memory: 6.782737731933594 GB (3121.234375 MB free)
  Uptime: 867.0 sec
  Load Avg:  1.02978515625  1.0458984375  0.744140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:35
  • Package commit: 723b90
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 1.730 μs (5%)
["cat", "xf"] 1.710 μs (5%)
["collect", "filter-missing"] 80.800 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 71.101 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 316.601 μs (5%) 285.47 KiB (1%) 6666
["dot", "blas"] 2.267 μs (5%)
["dot", "man"] 2.300 μs (5%)
["dot", "rf"] 2.656 μs (5%)
["dot", "xf"] 2.656 μs (5%)
["filter_map_map!", "man"] 71.400 μs (5%)
["filter_map_map!", "xf"] 73.500 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 194.900 μs (5%)
["filter_map_reduce", "xf"] 194.900 μs (5%)
["findall", "base"] 857.201 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 571.600 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 944.201 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 5.613 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.959 ms (5%)
["gemm", "fusedmul", "blas", "32"] 8.185 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.352 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.438 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 653.401 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.946 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.710 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.304 ms (5%)
["gemm", "mul", "linalg", "32"] 3.714 μs (5%)
["gemm", "mul", "linalg", "8"] 287.823 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.776 ms (5%)
["gemm", "mul", "man", "false", "32"] 7.520 μs (5%)
["gemm", "mul", "man", "false", "8"] 366.332 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.769 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 7.020 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 415.075 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.748 ms (5%)
["gemm", "mul", "man", "true", "32"] 7.025 μs (5%)
["gemm", "mul", "man", "true", "8"] 402.347 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.814 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 7.425 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 434.343 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.722 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 6.500 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 379.310 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 4.750 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 6.680 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 356.286 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 990.000 ns (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.233 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.333 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.056 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.167 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.057 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.200 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.016 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 187.600 μs (5%) 71.97 KiB (1%) 3736
["missing_dot", "xf_nota"] 184.800 μs (5%) 72.19 KiB (1%) 3744
["partition_by", "man"] 1.897 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.857 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz       8353 s          0 s       1432 s      50902 s          0 s
       #2  2294 MHz      47779 s          0 s       1277 s      11321 s          0 s
       
  Memory: 6.782737731933594 GB (3154.328125 MB free)
  Uptime: 623.0 sec
  Load Avg:  1.06640625  1.0224609375  0.630859375
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:39
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 1.570 μs (5%)
["cat", "xf"] 1.460 μs (5%)
["collect", "filter-missing"] 89.000 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 74.600 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 345.800 μs (5%) 285.28 KiB (1%) 6684
["dot", "blas"] 2.456 μs (5%)
["dot", "man"] 2.422 μs (5%)
["dot", "rf"] 3.000 μs (5%)
["dot", "xf"] 2.889 μs (5%)
["filter_map_map!", "man"] 71.000 μs (5%)
["filter_map_map!", "xf"] 73.900 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 206.400 μs (5%)
["filter_map_reduce", "xf"] 212.700 μs (5%)
["findall", "base"] 875.202 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 611.001 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 902.002 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 5.769 ms (5%)
["gemm", "fusedmul", "blas", "2"] 4.008 ms (5%)
["gemm", "fusedmul", "blas", "32"] 8.262 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.333 ms (5%)
["gemm", "fusedmul", "xf", "16"] 5.449 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 647.101 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 10.954 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.715 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.382 ms (5%)
["gemm", "mul", "linalg", "32"] 4.100 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 4.877 ms (5%)
["gemm", "mul", "man", "false", "32"] 7.700 μs (5%)
["gemm", "mul", "man", "false", "8"] 400.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 4.829 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 7.100 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 400.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 4.862 ms (5%)
["gemm", "mul", "man", "true", "32"] 8.400 μs (5%)
["gemm", "mul", "man", "true", "8"] 400.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 4.941 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 7.600 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 4.828 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 6.200 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 4.881 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 7.800 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 990.000 ns (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.244 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.256 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.132 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.158 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.457 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.250 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.009 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 193.401 μs (5%) 71.89 KiB (1%) 3732
["missing_dot", "xf_nota"] 201.701 μs (5%) 72.22 KiB (1%) 3746
["partition_by", "man"] 1.915 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.848 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      26052 s          0 s       1640 s      57350 s          0 s
       #2  2294 MHz      54351 s          0 s       1485 s      28885 s          0 s
       
  Memory: 6.782737731933594 GB (3121.234375 MB free)
  Uptime: 867.0 sec
  Load Avg:  1.02978515625  1.0458984375  0.744140625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.687
BogoMIPS:            4589.37
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 06:37
    • Baseline: 11 May 2020 - 06:42
  • Package commits:
    • Target: 723b90
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "assoc", "basesize=1"] 0.92 (5%) ✅ 1.00 (1%)
["collect", "seq"] 1.08 (5%) ❌ 1.00 (1%)
["collect", "unordered", "basesize=1"] 0.88 (5%) ✅ 1.00 (1%)
["collect", "unordered", "basesize=1024"] 1.04 (5%) 1.02 (1%) ❌
["findfirst", "n=1000", "foldl"] 1.13 (5%) ❌ 1.00 (1%)
["findfirst", "n=1000", "reduce", "basesize=256"] 0.94 (5%) ✅ 1.00 (1%)
["findfirst", "n=400", "foldl"] 0.94 (5%) ✅ 1.00 (1%)
["findfirst", "n=400", "reduce", "basesize=128"] 1.19 (5%) ❌ 1.00 (1%)
["findfirst", "n=400", "reduce", "basesize=256"] 1.08 (5%) ❌ 1.00 (1%)
["findfirst", "n=400", "reduce", "basesize=512"] 0.95 (5%) ✅ 1.00 (1%)
["findfirst", "n=500", "foldl"] 1.11 (5%) ❌ 1.00 (1%)
["findfirst", "n=500", "reduce", "basesize=128"] 0.87 (5%) ✅ 1.00 (1%)
["findfirst", "n=500", "reduce", "basesize=256"] 0.92 (5%) ✅ 1.00 (1%)
["overhead", "default"] 1.07 (5%) ❌ 1.00 (1%)
["parallel_histogram", "assoc", "basesize=4096"] 1.13 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=4096"] 0.84 (5%) ✅ 0.96 (1%) ✅
["parallel_histogram", "comm", "basesize=8192"] 0.90 (5%) ✅ 1.00 (1%)
["sum", "random", "foldl"] 1.09 (5%) ❌ 1.00 (1%)
["sum", "random", "reduce", "basesize=128"] 0.95 (5%) ✅ 1.00 (1%)
["sum", "valley", "foldl"] 1.06 (5%) ❌ 1.00 (1%)
["sum", "valley", "reduce", "basesize=256"] 1.10 (5%) ❌ 1.00 (1%)
["words", "nthreads=1"] 0.94 (5%) ✅ 1.00 (1%)
["words", "nthreads=2"] 1.07 (5%) ❌ 1.00 (1%)
["words", "nthreads=4"] 1.23 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      49682 s          0 s       2619 s      15628 s          0 s
       #2  2294 MHz      49244 s          0 s       2466 s      17204 s          0 s
       
  Memory: 6.782924652099609 GB (3260.45703125 MB free)
  Uptime: 712.0 sec
  Load Avg:  1.58935546875  1.4501953125  0.8681640625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      70911 s          0 s       3280 s      23286 s          0 s
       #2  2294 MHz      73248 s          0 s       3074 s      22162 s          0 s
       
  Memory: 6.782924652099609 GB (3253.99609375 MB free)
  Uptime: 1011.0 sec
  Load Avg:  1.74755859375  1.57177734375  1.08056640625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:37
  • Package commit: 723b90
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 295.815 ms (5%) 10.966 ms 87.05 MiB (1%) 1557939
["collect", "assoc", "basesize=1024"] 151.967 ms (5%) 1.84 MiB (1%) 1778
["collect", "assoc", "basesize=32"] 162.041 ms (5%) 5.63 MiB (1%) 52990
["collect", "seq"] 316.758 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 403.171 ms (5%) 29.20 MiB (1%) 405629
["collect", "unordered", "basesize=1024"] 261.250 ms (5%) 904.56 KiB (1%) 10984
["collect", "unordered", "basesize=32"] 187.841 ms (5%) 1.50 MiB (1%) 18349
["findfirst", "n=1000", "foldl"] 564.891 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 268.216 ms (5%) 563.83 KiB (1%) 10216
["findfirst", "n=1000", "reduce", "basesize=256"] 261.094 ms (5%) 287.08 KiB (1%) 5215
["findfirst", "n=1000", "reduce", "basesize=512"] 269.845 ms (5%) 149.19 KiB (1%) 2717
["findfirst", "n=400", "foldl"] 374.858 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 239.379 ms (5%) 1.02 MiB (1%) 18966
["findfirst", "n=400", "reduce", "basesize=256"] 221.704 ms (5%) 525.95 KiB (1%) 9561
["findfirst", "n=400", "reduce", "basesize=512"] 197.678 ms (5%) 267.03 KiB (1%) 4867
["findfirst", "n=500", "foldl"] 70.187 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 30.822 ms (5%) 157.19 KiB (1%) 2841
["findfirst", "n=500", "reduce", "basesize=256"] 32.309 ms (5%) 84.33 KiB (1%) 1523
["findfirst", "n=500", "reduce", "basesize=512"] 36.966 ms (5%) 48.09 KiB (1%) 869
["overhead", "default"] 186.901 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=false"] 173.601 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 314.102 μs (5%) 146.42 KiB (1%) 2646
["parallel_histogram", "assoc", "basesize=16384"] 3.461 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 4.608 ms (5%) 2.07 MiB (1%) 503
["parallel_histogram", "assoc", "basesize=8192"] 4.003 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 11.867 ms (5%) 1.22 MiB (1%) 264
["parallel_histogram", "comm", "basesize=4096"] 17.114 ms (5%) 1002.16 KiB (1%) 505
["parallel_histogram", "comm", "basesize=8192"] 15.005 ms (5%) 1.24 MiB (1%) 1117
["parallel_histogram", "seq"] 6.172 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 13.358 ms (5%)
["sum", "random", "reduce", "basesize=128"] 6.227 ms (5%) 313.31 KiB (1%) 6066
["sum", "random", "reduce", "basesize=256"] 5.986 ms (5%) 155.08 KiB (1%) 3010
["sum", "random", "reduce", "basesize=512"] 5.944 ms (5%) 76.27 KiB (1%) 1485
["sum", "uniform", "foldl"] 11.623 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 6.037 ms (5%) 313.38 KiB (1%) 6070
["sum", "uniform", "reduce", "basesize=256"] 5.975 ms (5%) 155.08 KiB (1%) 3010
["sum", "uniform", "reduce", "basesize=512"] 5.678 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 12.332 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 6.523 ms (5%) 313.20 KiB (1%) 6059
["sum", "valley", "reduce", "basesize=256"] 6.496 ms (5%) 155.08 KiB (1%) 3010
["sum", "valley", "reduce", "basesize=512"] 5.796 ms (5%) 76.25 KiB (1%) 1484
["words", "nthreads=1"] 38.676 ms (5%) 6.746 ms 64.68 MiB (1%) 2093296
["words", "nthreads=2"] 22.111 ms (5%) 65.40 MiB (1%) 2093451
["words", "nthreads=4"] 24.557 ms (5%) 65.85 MiB (1%) 2093599

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      49682 s          0 s       2619 s      15628 s          0 s
       #2  2294 MHz      49244 s          0 s       2466 s      17204 s          0 s
       
  Memory: 6.782924652099609 GB (3260.45703125 MB free)
  Uptime: 712.0 sec
  Load Avg:  1.58935546875  1.4501953125  0.8681640625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 6:42
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 320.261 ms (5%) 11.040 ms 87.05 MiB (1%) 1557946
["collect", "assoc", "basesize=1024"] 149.555 ms (5%) 1.84 MiB (1%) 1776
["collect", "assoc", "basesize=32"] 167.123 ms (5%) 5.63 MiB (1%) 52990
["collect", "seq"] 292.808 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 456.260 ms (5%) 29.19 MiB (1%) 405508
["collect", "unordered", "basesize=1024"] 250.204 ms (5%) 888.23 KiB (1%) 9939
["collect", "unordered", "basesize=32"] 193.593 ms (5%) 1.49 MiB (1%) 18226
["findfirst", "n=1000", "foldl"] 501.690 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 280.594 ms (5%) 564.14 KiB (1%) 10236
["findfirst", "n=1000", "reduce", "basesize=256"] 277.715 ms (5%) 287.19 KiB (1%) 5222
["findfirst", "n=1000", "reduce", "basesize=512"] 262.955 ms (5%) 149.23 KiB (1%) 2720
["findfirst", "n=400", "foldl"] 400.145 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 200.806 ms (5%) 1.02 MiB (1%) 18992
["findfirst", "n=400", "reduce", "basesize=256"] 205.911 ms (5%) 526.27 KiB (1%) 9581
["findfirst", "n=400", "reduce", "basesize=512"] 208.776 ms (5%) 267.20 KiB (1%) 4878
["findfirst", "n=500", "foldl"] 63.059 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 35.487 ms (5%) 157.30 KiB (1%) 2848
["findfirst", "n=500", "reduce", "basesize=256"] 35.224 ms (5%) 84.38 KiB (1%) 1526
["findfirst", "n=500", "reduce", "basesize=512"] 36.668 ms (5%) 48.13 KiB (1%) 871
["overhead", "default"] 175.001 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=false"] 171.401 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 316.802 μs (5%) 146.41 KiB (1%) 2645
["parallel_histogram", "assoc", "basesize=16384"] 3.437 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 4.072 ms (5%) 2.07 MiB (1%) 503
["parallel_histogram", "assoc", "basesize=8192"] 3.899 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 11.875 ms (5%) 1.22 MiB (1%) 155
["parallel_histogram", "comm", "basesize=4096"] 20.405 ms (5%) 1.02 MiB (1%) 3474
["parallel_histogram", "comm", "basesize=8192"] 16.580 ms (5%) 1.23 MiB (1%) 837
["parallel_histogram", "seq"] 6.361 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 12.305 ms (5%)
["sum", "random", "reduce", "basesize=128"] 6.569 ms (5%) 313.31 KiB (1%) 6066
["sum", "random", "reduce", "basesize=256"] 6.224 ms (5%) 155.11 KiB (1%) 3012
["sum", "random", "reduce", "basesize=512"] 6.136 ms (5%) 76.25 KiB (1%) 1484
["sum", "uniform", "foldl"] 11.206 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 6.321 ms (5%) 313.36 KiB (1%) 6069
["sum", "uniform", "reduce", "basesize=256"] 6.060 ms (5%) 155.08 KiB (1%) 3010
["sum", "uniform", "reduce", "basesize=512"] 5.969 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 11.625 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 6.248 ms (5%) 313.23 KiB (1%) 6061
["sum", "valley", "reduce", "basesize=256"] 5.930 ms (5%) 155.08 KiB (1%) 3010
["sum", "valley", "reduce", "basesize=512"] 5.868 ms (5%) 76.27 KiB (1%) 1485
["words", "nthreads=1"] 40.947 ms (5%) 7.248 ms 64.75 MiB (1%) 2095946
["words", "nthreads=2"] 20.728 ms (5%) 65.47 MiB (1%) 2096099
["words", "nthreads=4"] 19.911 ms (5%) 65.92 MiB (1%) 2096246

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      70911 s          0 s       3280 s      23286 s          0 s
       #2  2294 MHz      73248 s          0 s       3074 s      22162 s          0 s
       
  Memory: 6.782924652099609 GB (3253.99609375 MB free)
  Uptime: 1011.0 sec
  Load Avg:  1.74755859375  1.57177734375  1.08056640625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.682
BogoMIPS:            4589.36
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 07:14
    • Baseline: 11 May 2020 - 07:17
  • Package commits:
    • Target: 789c6a
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1
    • Baseline: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["cat", "base"] 1.09 (5%) ❌ 1.00 (1%)
["cat", "xf"] 1.15 (5%) ❌ 1.00 (1%)
["collect", "filter-missing"] 1.11 (5%) ❌ 1.00 (1%)
["collect", "identity-float"] 1.18 (5%) ❌ 1.00 (1%)
["collect", "identity-union"] 0.88 (5%) ✅ 1.00 (1%)
["dot", "blas"] 1.07 (5%) ❌ 1.00 (1%)
["dot", "rf"] 1.11 (5%) ❌ 1.00 (1%)
["filter_map_map!", "man"] 0.88 (5%) ✅ 1.00 (1%)
["filter_map_reduce", "man"] 1.23 (5%) ❌ 1.00 (1%)
["filter_map_reduce", "xf"] 1.15 (5%) ❌ 1.00 (1%)
["findall", "base"] 1.21 (5%) ❌ 1.00 (1%)
["findall", "xf-array"] 1.29 (5%) ❌ 1.00 (1%)
["findall", "xf-iter"] 1.22 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "32"] 1.18 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "blas", "8"] 1.09 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "16"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "32"] 1.09 (5%) ❌ 1.00 (1%)
["gemm", "fusedmul", "xf", "8"] 1.22 (5%) ❌ 1.00 (1%)
["gemm", "mul", "linalg", "256"] 1.20 (5%) ❌ 1.00 (1%)
["gemm", "mul", "linalg", "32"] 0.91 (5%) ✅ 1.00 (1%)
["gemm", "mul", "linalg", "8"] 0.94 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "false", "256"] 1.08 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "false", "32"] 1.17 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "false", "8"] 0.93 (5%) ✅ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "256"] 1.11 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "32"] 1.09 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "ivdep", "8"] 1.07 (5%) ❌ 1.00 (1%)
["gemm", "mul", "man", "true", "32"] 1.13 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "256"] 1.21 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "false", "8"] 1.15 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "ivdep", "8"] 1.19 (5%) ❌ 1.00 (1%)
["gemm", "mul", "xf", "true", "8"] 0.86 (5%) ✅ 1.00 (1%)
["missing_argmax", "man"] 1.08 (5%) ❌ 1.00 (1%)
["missing_argmax", "rf"] 1.07 (5%) ❌ 1.00 (1%)
["missing_argmax", "xf"] 1.08 (5%) ❌ 1.00 (1%)
["missing_dot", "naive"] 1.15 (5%) ❌ 1.00 (1%)
["missing_dot", "rf"] 1.23 (5%) ❌ 1.00 (1%)
["missing_dot", "rf_nota"] 1.10 (5%) ❌ 1.00 (1%)
["missing_dot", "xf"] 1.06 (5%) ❌ 1.00 (1%)
["missing_dot", "xf_nota"] 1.19 (5%) ❌ 0.99 (1%)
["partition_by", "xf"] 1.23 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       8855 s          0 s       1366 s      53244 s          0 s
       #2  2095 MHz      45101 s          0 s       1551 s      17677 s          0 s
       
  Memory: 6.782737731933594 GB (3204.9453125 MB free)
  Uptime: 655.0 sec
  Load Avg:  1.078125  1.0  0.61669921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      17082 s          0 s       1466 s      67816 s          0 s
       #2  2095 MHz      59666 s          0 s       1826 s      25722 s          0 s
       
  Memory: 6.782737731933594 GB (3129.375 MB free)
  Uptime: 885.0 sec
  Load Avg:  1.08349609375  1.044921875  0.7353515625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 7:14
  • Package commit: 789c6a
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 2.089 μs (5%)
["cat", "xf"] 1.820 μs (5%)
["collect", "filter-missing"] 104.006 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 81.505 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 326.420 μs (5%) 285.58 KiB (1%) 6678
["dot", "blas"] 1.440 μs (5%)
["dot", "man"] 1.310 μs (5%)
["dot", "rf"] 2.411 μs (5%)
["dot", "xf"] 2.400 μs (5%)
["filter_map_map!", "man"] 60.303 μs (5%)
["filter_map_map!", "xf"] 69.103 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 276.019 μs (5%)
["filter_map_reduce", "xf"] 255.919 μs (5%)
["findall", "base"] 938.379 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 784.764 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 986.176 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 5.034 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.624 ms (5%)
["gemm", "fusedmul", "blas", "32"] 7.173 ms (5%)
["gemm", "fusedmul", "blas", "8"] 4.034 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.882 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 563.323 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.731 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 2.390 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 1.002 ms (5%)
["gemm", "mul", "linalg", "32"] 3.363 μs (5%)
["gemm", "mul", "linalg", "8"] 283.053 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.924 ms (5%)
["gemm", "mul", "man", "false", "32"] 5.614 μs (5%)
["gemm", "mul", "man", "false", "8"] 463.980 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.970 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.563 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 429.166 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.831 ms (5%)
["gemm", "mul", "man", "true", "32"] 5.515 μs (5%)
["gemm", "mul", "man", "true", "8"] 487.965 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.910 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.150 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 461.327 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.837 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.086 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 476.046 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.893 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.229 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 429.171 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 3.225 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.978 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.570 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 5.258 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.620 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.590 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 206.114 μs (5%) 71.86 KiB (1%) 3731
["missing_dot", "xf_nota"] 234.916 μs (5%) 71.72 KiB (1%) 3724
["partition_by", "man"] 2.087 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 2.245 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz       8855 s          0 s       1366 s      53244 s          0 s
       #2  2095 MHz      45101 s          0 s       1551 s      17677 s          0 s
       
  Memory: 6.782737731933594 GB (3204.9453125 MB free)
  Uptime: 655.0 sec
  Load Avg:  1.078125  1.0  0.61669921875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 7:17
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: OMP_NUM_THREADS => 1 JULIA_NUM_THREADS => 1

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["cat", "base"] 1.922 μs (5%)
["cat", "xf"] 1.580 μs (5%)
["collect", "filter-missing"] 93.403 μs (5%) 33.03 KiB (1%) 19
["collect", "identity-float"] 69.102 μs (5%) 256.89 KiB (1%) 19
["collect", "identity-union"] 371.911 μs (5%) 285.23 KiB (1%) 6675
["dot", "blas"] 1.350 μs (5%)
["dot", "man"] 1.300 μs (5%)
["dot", "rf"] 2.167 μs (5%)
["dot", "xf"] 2.389 μs (5%)
["filter_map_map!", "man"] 68.502 μs (5%)
["filter_map_map!", "xf"] 71.802 μs (5%) 144 bytes (1%) 8
["filter_map_reduce", "man"] 224.007 μs (5%)
["filter_map_reduce", "xf"] 223.407 μs (5%)
["findall", "base"] 776.126 μs (5%) 2.00 MiB (1%) 21
["findall", "xf-array"] 606.921 μs (5%) 3.05 MiB (1%) 100014
["findall", "xf-iter"] 805.827 μs (5%) 2.00 MiB (1%) 28
["gemm", "fusedmul", "blas", "16"] 4.822 ms (5%)
["gemm", "fusedmul", "blas", "2"] 3.602 ms (5%)
["gemm", "fusedmul", "blas", "32"] 6.103 ms (5%)
["gemm", "fusedmul", "blas", "8"] 3.693 ms (5%)
["gemm", "fusedmul", "xf", "16"] 4.386 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "2"] 576.116 μs (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "32"] 8.037 ms (5%) 160 bytes (1%) 6
["gemm", "fusedmul", "xf", "8"] 1.958 ms (5%) 160 bytes (1%) 6
["gemm", "mul", "linalg", "256"] 835.326 μs (5%)
["gemm", "mul", "linalg", "32"] 3.700 μs (5%)
["gemm", "mul", "linalg", "8"] 300.000 ns (5%)
["gemm", "mul", "man", "false", "256"] 1.782 ms (5%)
["gemm", "mul", "man", "false", "32"] 4.800 μs (5%)
["gemm", "mul", "man", "false", "8"] 500.000 ns (5%)
["gemm", "mul", "man", "ivdep", "256"] 1.773 ms (5%)
["gemm", "mul", "man", "ivdep", "32"] 4.200 μs (5%)
["gemm", "mul", "man", "ivdep", "8"] 400.000 ns (5%)
["gemm", "mul", "man", "true", "256"] 1.780 ms (5%)
["gemm", "mul", "man", "true", "32"] 4.900 μs (5%)
["gemm", "mul", "man", "true", "8"] 500.000 ns (5%)
["gemm", "mul", "xf", "false", "256"] 1.574 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "32"] 5.200 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "false", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "256"] 1.785 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "32"] 4.100 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "ivdep", "8"] 400.000 ns (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "256"] 1.832 ms (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "32"] 5.100 μs (5%) 48 bytes (1%) 2
["gemm", "mul", "xf", "true", "8"] 500.000 ns (5%) 48 bytes (1%) 2
["missing_argmax", "man"] 2.975 μs (5%) 32 bytes (1%) 1
["missing_argmax", "rf"] 2.778 μs (5%) 32 bytes (1%) 1
["missing_argmax", "xf"] 2.756 μs (5%) 32 bytes (1%) 1
["missing_dot", "equiv"] 1.590 μs (5%) 16 bytes (1%) 1
["missing_dot", "man"] 1.510 μs (5%) 16 bytes (1%) 1
["missing_dot", "naive"] 4.586 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf"] 1.320 μs (5%) 16 bytes (1%) 1
["missing_dot", "rf_nota"] 1.440 μs (5%) 16 bytes (1%) 1
["missing_dot", "xf"] 194.806 μs (5%) 72.17 KiB (1%) 3744
["missing_dot", "xf_nota"] 196.906 μs (5%) 72.33 KiB (1%) 3750
["partition_by", "man"] 2.148 ms (5%) 352 bytes (1%) 4
["partition_by", "xf"] 1.821 ms (5%) 576 bytes (1%) 7

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["cat"]
  • ["collect"]
  • ["dot"]
  • ["filter_map_map!"]
  • ["filter_map_reduce"]
  • ["findall"]
  • ["gemm", "fusedmul", "blas"]
  • ["gemm", "fusedmul", "xf"]
  • ["gemm", "mul", "linalg"]
  • ["gemm", "mul", "man", "false"]
  • ["gemm", "mul", "man", "ivdep"]
  • ["gemm", "mul", "man", "true"]
  • ["gemm", "mul", "xf", "false"]
  • ["gemm", "mul", "xf", "ivdep"]
  • ["gemm", "mul", "xf", "true"]
  • ["missing_argmax"]
  • ["missing_dot"]
  • ["partition_by"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz: 
              speed         user         nice          sys         idle          irq
       #1  2095 MHz      17082 s          0 s       1466 s      67816 s          0 s
       #2  2095 MHz      59666 s          0 s       1826 s      25722 s          0 s
       
  Memory: 6.782737731933594 GB (3129.375 MB free)
  Uptime: 885.0 sec
  Load Avg:  1.08349609375  1.044921875  0.7353515625
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, skylake)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               85
Model name:          Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Stepping:            4
CPU MHz:             2095.192
BogoMIPS:            4190.38
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            1024K
L3 cache:            36608K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f avx512dq rdseed adx smap clflushopt avx512cd avx512bw avx512vl xsaveopt xsavec xsaves
Cpu Property Value
Brand Intel(R) Xeon(R) Platinum 8171M CPU @ 2.60GHz
Vendor :Intel
Architecture :Skylake
Model Family: 0x06, Model: 0x55, Stepping: 0x04, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 1024, 36608) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 512 bit = 64 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@github-actions
Copy link
Contributor

Multi-thread benchmark result

Judge result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmarks:
    • Target: 11 May 2020 - 07:16
    • Baseline: 11 May 2020 - 07:21
  • Package commits:
    • Target: 789c6a
    • Baseline: 62ad9f
  • Julia commits:
    • Target: 381693
    • Baseline: 381693
  • Julia command flags:
    • Target: None
    • Baseline: None
  • Environment variables:
    • Target: JULIA_NUM_THREADS => 2
    • Baseline: JULIA_NUM_THREADS => 2

Results

A ratio greater than 1.0 denotes a possible regression (marked with ❌), while a ratio less
than 1.0 denotes a possible improvement (marked with ✅). Only significant results - results
that indicate possible regressions or improvements - are shown below (thus, an empty table means that all
benchmark results remained invariant between builds).

ID time ratio memory ratio
["collect", "unordered", "basesize=1024"] 1.06 (5%) ❌ 1.07 (1%) ❌
["findfirst", "n=500", "reduce", "basesize=128"] 0.95 (5%) ✅ 1.00 (1%)
["overhead", "stoppable=false"] 1.07 (5%) ❌ 1.00 (1%)
["parallel_histogram", "comm", "basesize=16384"] 1.04 (5%) 1.28 (1%) ❌
["parallel_histogram", "comm", "basesize=4096"] 0.87 (5%) ✅ 0.94 (1%) ✅
["parallel_histogram", "comm", "basesize=8192"] 1.09 (5%) ❌ 1.00 (1%)
["parallel_histogram", "seq"] 0.91 (5%) ✅ 1.00 (1%)
["sum", "random", "foldl"] 0.89 (5%) ✅ 1.00 (1%)
["sum", "random", "reduce", "basesize=128"] 0.86 (5%) ✅ 1.00 (1%)
["sum", "random", "reduce", "basesize=256"] 1.09 (5%) ❌ 1.00 (1%)
["sum", "uniform", "foldl"] 1.07 (5%) ❌ 1.00 (1%)
["sum", "uniform", "reduce", "basesize=256"] 0.92 (5%) ✅ 1.00 (1%)
["sum", "valley", "foldl"] 1.10 (5%) ❌ 1.00 (1%)
["sum", "valley", "reduce", "basesize=256"] 1.07 (5%) ❌ 1.00 (1%)
["words", "nthreads=2"] 1.06 (5%) ❌ 1.00 (1%)

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Target

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      42282 s          0 s       2147 s      28157 s          0 s
       #2  2294 MHz      57016 s          0 s       2606 s      13406 s          0 s
       
  Memory: 6.782737731933594 GB (3214.56640625 MB free)
  Uptime: 748.0 sec
  Load Avg:  1.6171875  1.46728515625  0.908203125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      62648 s          0 s       2796 s      37471 s          0 s
       #2  2294 MHz      82734 s          0 s       3196 s      17416 s          0 s
       
  Memory: 6.782737731933594 GB (3201.85546875 MB free)
  Uptime: 1054.0 sec
  Load Avg:  1.83203125  1.6435546875  1.138671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Target result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 7:16
  • Package commit: 789c6a
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 347.489 ms (5%) 13.016 ms 87.05 MiB (1%) 1557894
["collect", "assoc", "basesize=1024"] 192.151 ms (5%) 1.84 MiB (1%) 1779
["collect", "assoc", "basesize=32"] 189.832 ms (5%) 5.63 MiB (1%) 52983
["collect", "seq"] 378.603 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 457.796 ms (5%) 29.19 MiB (1%) 405406
["collect", "unordered", "basesize=1024"] 306.551 ms (5%) 966.55 KiB (1%) 14951
["collect", "unordered", "basesize=32"] 227.548 ms (5%) 1.50 MiB (1%) 18801
["findfirst", "n=1000", "foldl"] 602.722 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 313.779 ms (5%) 563.94 KiB (1%) 10223
["findfirst", "n=1000", "reduce", "basesize=256"] 301.316 ms (5%) 287.08 KiB (1%) 5215
["findfirst", "n=1000", "reduce", "basesize=512"] 319.100 ms (5%) 149.14 KiB (1%) 2714
["findfirst", "n=400", "foldl"] 450.001 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 235.508 ms (5%) 1.02 MiB (1%) 18971
["findfirst", "n=400", "reduce", "basesize=256"] 234.422 ms (5%) 526.02 KiB (1%) 9565
["findfirst", "n=400", "reduce", "basesize=512"] 231.464 ms (5%) 266.97 KiB (1%) 4863
["findfirst", "n=500", "foldl"] 79.407 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 38.551 ms (5%) 157.17 KiB (1%) 2840
["findfirst", "n=500", "reduce", "basesize=256"] 37.575 ms (5%) 84.31 KiB (1%) 1522
["findfirst", "n=500", "reduce", "basesize=512"] 41.822 ms (5%) 48.11 KiB (1%) 870
["overhead", "default"] 192.702 μs (5%) 146.14 KiB (1%) 2628
["overhead", "stoppable=false"] 202.403 μs (5%) 146.13 KiB (1%) 2627
["overhead", "stoppable=true"] 334.204 μs (5%) 146.42 KiB (1%) 2646
["parallel_histogram", "assoc", "basesize=16384"] 4.163 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 5.212 ms (5%) 1.80 MiB (1%) 497
["parallel_histogram", "assoc", "basesize=8192"] 4.631 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 13.780 ms (5%) 1.22 MiB (1%) 156
["parallel_histogram", "comm", "basesize=4096"] 18.421 ms (5%) 1005.52 KiB (1%) 675
["parallel_histogram", "comm", "basesize=8192"] 16.330 ms (5%) 1.23 MiB (1%) 518
["parallel_histogram", "seq"] 7.852 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 13.144 ms (5%)
["sum", "random", "reduce", "basesize=128"] 6.480 ms (5%) 313.28 KiB (1%) 6064
["sum", "random", "reduce", "basesize=256"] 7.291 ms (5%) 155.08 KiB (1%) 3010
["sum", "random", "reduce", "basesize=512"] 7.171 ms (5%) 76.28 KiB (1%) 1486
["sum", "uniform", "foldl"] 13.779 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 7.181 ms (5%) 313.33 KiB (1%) 6067
["sum", "uniform", "reduce", "basesize=256"] 6.445 ms (5%) 155.09 KiB (1%) 3011
["sum", "uniform", "reduce", "basesize=512"] 6.607 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 14.215 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 7.645 ms (5%) 313.28 KiB (1%) 6064
["sum", "valley", "reduce", "basesize=256"] 7.874 ms (5%) 155.09 KiB (1%) 3011
["sum", "valley", "reduce", "basesize=512"] 7.478 ms (5%) 76.23 KiB (1%) 1483
["words", "nthreads=1"] 47.973 ms (5%) 8.885 ms 64.61 MiB (1%) 2090858
["words", "nthreads=2"] 24.108 ms (5%) 65.33 MiB (1%) 2091012
["words", "nthreads=4"] 25.414 ms (5%) 65.78 MiB (1%) 2091159

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      42282 s          0 s       2147 s      28157 s          0 s
       #2  2294 MHz      57016 s          0 s       2606 s      13406 s          0 s
       
  Memory: 6.782737731933594 GB (3214.56640625 MB free)
  Uptime: 748.0 sec
  Load Avg:  1.6171875  1.46728515625  0.908203125
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Baseline result

Benchmark Report for /home/runner/work/Transducers.jl/Transducers.jl

Job Properties

  • Time of benchmark: 11 May 2020 - 7:21
  • Package commit: 62ad9f
  • Julia commit: 381693
  • Julia command flags: None
  • Environment variables: JULIA_NUM_THREADS => 2

Results

Below is a table of this job's results, obtained by running the benchmarks.
The values listed in the ID column have the structure [parent_group, child_group, ..., key], and can be used to
index into the BaseBenchmarks suite to retrieve the corresponding benchmarks.
The percentages accompanying time and memory values in the below table are noise tolerances. The "true"
time/memory value for a given benchmark is expected to fall within this percentage of the reported value.
An empty cell means that the value was zero.

ID time GC time memory allocations
["collect", "assoc", "basesize=1"] 344.884 ms (5%) 15.341 ms 87.05 MiB (1%) 1557924
["collect", "assoc", "basesize=1024"] 200.121 ms (5%) 1.84 MiB (1%) 1780
["collect", "assoc", "basesize=32"] 196.385 ms (5%) 5.63 MiB (1%) 52983
["collect", "seq"] 369.082 ms (5%) 512.98 KiB (1%) 22
["collect", "unordered", "basesize=1"] 476.003 ms (5%) 29.19 MiB (1%) 405332
["collect", "unordered", "basesize=1024"] 288.434 ms (5%) 899.50 KiB (1%) 10660
["collect", "unordered", "basesize=32"] 221.798 ms (5%) 1.50 MiB (1%) 18581
["findfirst", "n=1000", "foldl"] 608.386 ms (5%)
["findfirst", "n=1000", "reduce", "basesize=128"] 313.891 ms (5%) 563.72 KiB (1%) 10209
["findfirst", "n=1000", "reduce", "basesize=256"] 313.501 ms (5%) 287.08 KiB (1%) 5215
["findfirst", "n=1000", "reduce", "basesize=512"] 315.580 ms (5%) 149.11 KiB (1%) 2712
["findfirst", "n=400", "foldl"] 462.548 ms (5%)
["findfirst", "n=400", "reduce", "basesize=128"] 234.211 ms (5%) 1.02 MiB (1%) 18965
["findfirst", "n=400", "reduce", "basesize=256"] 234.887 ms (5%) 525.80 KiB (1%) 9551
["findfirst", "n=400", "reduce", "basesize=512"] 235.591 ms (5%) 267.00 KiB (1%) 4865
["findfirst", "n=500", "foldl"] 79.701 ms (5%)
["findfirst", "n=500", "reduce", "basesize=128"] 40.712 ms (5%) 157.19 KiB (1%) 2841
["findfirst", "n=500", "reduce", "basesize=256"] 39.410 ms (5%) 84.34 KiB (1%) 1524
["findfirst", "n=500", "reduce", "basesize=512"] 43.216 ms (5%) 48.09 KiB (1%) 869
["overhead", "default"] 185.604 μs (5%) 146.17 KiB (1%) 2630
["overhead", "stoppable=false"] 189.404 μs (5%) 146.16 KiB (1%) 2629
["overhead", "stoppable=true"] 345.808 μs (5%) 146.41 KiB (1%) 2645
["parallel_histogram", "assoc", "basesize=16384"] 4.332 ms (5%) 732.06 KiB (1%) 103
["parallel_histogram", "assoc", "basesize=4096"] 5.206 ms (5%) 1.80 MiB (1%) 497
["parallel_histogram", "assoc", "basesize=8192"] 4.731 ms (5%) 1.43 MiB (1%) 242
["parallel_histogram", "comm", "basesize=16384"] 13.258 ms (5%) 978.38 KiB (1%) 155
["parallel_histogram", "comm", "basesize=4096"] 21.229 ms (5%) 1.05 MiB (1%) 698
["parallel_histogram", "comm", "basesize=8192"] 15.018 ms (5%) 1.23 MiB (1%) 696
["parallel_histogram", "seq"] 8.587 ms (5%) 364.63 KiB (1%) 25
["sum", "random", "foldl"] 14.780 ms (5%)
["sum", "random", "reduce", "basesize=128"] 7.552 ms (5%) 313.27 KiB (1%) 6063
["sum", "random", "reduce", "basesize=256"] 6.717 ms (5%) 155.08 KiB (1%) 3010
["sum", "random", "reduce", "basesize=512"] 7.220 ms (5%) 76.27 KiB (1%) 1485
["sum", "uniform", "foldl"] 12.875 ms (5%)
["sum", "uniform", "reduce", "basesize=128"] 7.409 ms (5%) 313.34 KiB (1%) 6068
["sum", "uniform", "reduce", "basesize=256"] 7.035 ms (5%) 155.08 KiB (1%) 3010
["sum", "uniform", "reduce", "basesize=512"] 6.922 ms (5%) 76.25 KiB (1%) 1484
["sum", "valley", "foldl"] 12.909 ms (5%)
["sum", "valley", "reduce", "basesize=128"] 7.723 ms (5%) 313.27 KiB (1%) 6063
["sum", "valley", "reduce", "basesize=256"] 7.349 ms (5%) 155.06 KiB (1%) 3009
["sum", "valley", "reduce", "basesize=512"] 7.596 ms (5%) 76.25 KiB (1%) 1484
["words", "nthreads=1"] 46.339 ms (5%) 8.558 ms 64.87 MiB (1%) 2099319
["words", "nthreads=2"] 22.837 ms (5%) 65.23 MiB (1%) 2099396
["words", "nthreads=4"] 24.383 ms (5%) 66.13 MiB (1%) 2099694

Benchmark Group List

Here's a list of all the benchmark groups executed by this job:

  • ["collect", "assoc"]
  • ["collect"]
  • ["collect", "unordered"]
  • ["findfirst", "n=1000"]
  • ["findfirst", "n=1000", "reduce"]
  • ["findfirst", "n=400"]
  • ["findfirst", "n=400", "reduce"]
  • ["findfirst", "n=500"]
  • ["findfirst", "n=500", "reduce"]
  • ["overhead"]
  • ["parallel_histogram", "assoc"]
  • ["parallel_histogram", "comm"]
  • ["parallel_histogram"]
  • ["sum", "random"]
  • ["sum", "random", "reduce"]
  • ["sum", "uniform"]
  • ["sum", "uniform", "reduce"]
  • ["sum", "valley"]
  • ["sum", "valley", "reduce"]
  • ["words"]

Julia versioninfo

Julia Version 1.4.1
Commit 381693d3df* (2020-04-14 17:20 UTC)
Platform Info:
  OS: Linux (x86_64-pc-linux-gnu)
      Ubuntu 18.04.4 LTS
  uname: Linux 5.0.0-1035-azure #37-Ubuntu SMP Wed Mar 18 11:21:35 UTC 2020 x86_64 x86_64
  CPU: Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz: 
              speed         user         nice          sys         idle          irq
       #1  2294 MHz      62648 s          0 s       2796 s      37471 s          0 s
       #2  2294 MHz      82734 s          0 s       3196 s      17416 s          0 s
       
  Memory: 6.782737731933594 GB (3201.85546875 MB free)
  Uptime: 1054.0 sec
  Load Avg:  1.83203125  1.6435546875  1.138671875
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-8.0.1 (ORCJIT, broadwell)

Runtime information

Runtime Info
BLAS #threads 2
BLAS.vendor() openblas64
Sys.CPU_THREADS 2

lscpu output:

Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              2
On-line CPU(s) list: 0,1
Thread(s) per core:  1
Core(s) per socket:  2
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               79
Model name:          Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Stepping:            1
CPU MHz:             2294.685
BogoMIPS:            4589.37
Hypervisor vendor:   Microsoft
Virtualization type: full
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            51200K
NUMA node0 CPU(s):   0,1
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology cpuid pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase bmi1 hle avx2 smep bmi2 erms invpcid rtm rdseed adx smap xsaveopt md_clear
Cpu Property Value
Brand Intel(R) Xeon(R) CPU E5-2673 v4 @ 2.30GHz
Vendor :Intel
Architecture :Broadwell
Model Family: 0x06, Model: 0x4f, Stepping: 0x01, Type: 0x00
Cores 2 physical cores, 2 logical cores (on executing CPU)
No Hyperthreading detected
Clock Frequencies Not supported by CPU
Data Cache Level 1:3 : (32, 256, 51200) kbytes
64 byte cache line size
Address Size 48 bits virtual, 44 bits physical
SIMD 256 bit = 32 byte max. SIMD vector size
Time Stamp Counter TSC is accessible via rdtsc
TSC increased at every clock cycle (non-invariant TSC)
Perf. Monitoring Performance Monitoring Counters (PMC) are not supported
Hypervisor Yes, Microsoft

@mergify mergify bot merged commit 62f35e6 into master May 11, 2020
@mergify mergify bot deleted the amount branch May 11, 2020 07:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant