Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CPU performance on different CPUs and multi-socket (NUMA) servers #13

Open
szilard opened this issue May 1, 2019 · 5 comments
Open

Comments

@szilard
Copy link
Owner

szilard commented May 1, 2019

c5.9xlarge:

This is a newer/faster CPU vs the one used in the main benchmark

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                36
On-line CPU(s) list:   0-35
Thread(s) per core:    2
Core(s) per socket:    18
Socket(s):             1
NUMA node(s):          1
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 85
Model name:            Intel(R) Xeon(R) Platinum 8124M CPU @ 3.00GHz
Stepping:              4
CPU MHz:               3000.000
BogoMIPS:              6000.00
Hypervisor vendor:     KVM
Virtualization type:   full
L1d cache:             32K
L1i cache:             32K
L2 cache:              1024K
L3 cache:              25344K
NUMA node0 CPU(s):     0-35
Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat

Note: need to change to Dockerfile to use cores 0-17 instead of 0-15.

0.1m:
8.731 0.7022154
3.242 0.7324224
1.958 0.7298355
4.567 0.7225903
1m:
14.123 0.762363
10.831 0.7494959
4.325 0.7636987
33.879 0.7402029
10m:
84.734 0.7763153
59.556 0.7551197
32.721 0.7742033
341.073 0.7439026

2-socket instance c5.18xlarge:

Note: need to change to Dockerfile to use cores 0-35 instead of 0-15.

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                72
On-line CPU(s) list:   0-71
Thread(s) per core:    2
Core(s) per socket:    18
Socket(s):             2
NUMA node(s):          2
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 85
Model name:            Intel(R) Xeon(R) Platinum 8124M CPU @ 3.00GHz
Stepping:              3
CPU MHz:               3000.000
BogoMIPS:              6000.00
Hypervisor vendor:     KVM
Virtualization type:   full
L1d cache:             32K
L1i cache:             32K
L2 cache:              1024K
L3 cache:              25344K
NUMA node0 CPU(s):     0-17,36-53
NUMA node1 CPU(s):     18-35,54-71
Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc aperfmperf eagerfpu pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat
0.1m:
11.705 0.7022184
58.622 0.7324224
5.495 0.7298355
7.589 0.7225903
1m:
17.387 0.7623644
59.489 0.7494959
10.048 0.7636987
37.498 0.7402029
10m:
50.278 0.776314
140.663 0.7551197
63.908 0.7742033
351.112 0.7439026

for comparison r4.8xlarge (main benchmark):

0.1m:
16.206 0.7022242
3.859 0.7324224
2.378 0.7298355
5.502 0.7225903
1m:
20.308 0.7623565
11.433 0.7494959
5.217 0.7636987
49.431 0.7402029
10m:
100.612 0.7763295
78.177 0.7551197
41.929 0.7742033
489.229 0.7439026
@szilard szilard changed the title CPU performance c5.9xlarge Intel(R) Xeon(R) Platinum 8124M CPU @ 3.00GHz CPU performance c5.9xlarge Xeon 8124M 3.00GHz May 1, 2019
@szilard
Copy link
Owner Author

szilard commented May 1, 2019

  c5 r4  
  0.1m: 0.1m: %
h2o 8.731 16.206 86
xgboost 3.242 3.859 19
lightgbm 1.958 2.378 21
catboost 4.567 5.502 20
  1m: 1m:  
h2o 14.123 20.308 44
xgboost 10.831 11.433 6
lightgbm 4.325 5.217 21
catboost 33.879 49.431 46
  10m: 10m:  
h2o 84.734 100.612 19
xgboost 59.556 78.177 31
lightgbm 32.721 41.929 28
catboost 341.073 489.229 43

r4 vs c5 is 20-40% slower, the ranking of libs stay the same for given data size when changing CPU that is lightgbm > xgboost > h2o > catboost (with a >b meaning a better than b) for larger data and lightgbm > xgboost > catboost > h2o for very small data



  c5 c5-2s  
  0.1m: 0.1m: %
h2o 8.731 11.705 34
xgboost 3.242 58.622 1708
lightgbm 1.958 5.495 181
catboost 4.567 7.589 66
  1m: 1m:  
h2o 14.123 17.387 23
xgboost 10.831 59.489 449
lightgbm 4.325 10.048 132
catboost 33.879 37.498 11
  10m: 10m:  
h2o 84.734 50.278 -41
xgboost 59.556 140.663 136
lightgbm 32.721 63.908 95
catboost 341.073 351.112 3

going from 1 to 2-socket CPU is:

  • faster/better only for h2o 10M
  • not much (<10%) worse for catboost (for the larger sizes)
  • 2x worse for lightgbm (for the larger sizes)
  • 3-5x worse for xgboost (for the larger sizes)
  • for smaller data the effect is way worse (relatively less computation and more fetching from memory)

@szilard
Copy link
Owner Author

szilard commented May 1, 2019

RAM usage (for 10M data):

while true; do free -m | grep Mem; sleep 1; done
Mem:          70348        4692       60234           8        5421       65025
Mem:          70348        4437       60489           8        5421       65280
Mem:          70348        5378       59548           8        5421       64339

Note: this is just a very rough (and maybe even wrong) estimate (tools might reduce footprint while running if approaching RAM ceilings by using e.g. free() or gc etc; also it depends not only on the tool but also on R/data format/data reading etc.

Could also limit RAM with:

docker run -m 12g -e TOOL=1-h2o --rm gbmperf_cpu

h2o:

Mem:          70348         533       67960           8        1855       69234
Mem:          70348         647       67813           8        1888       69120
Mem:          70348         946       67514           8        1888       68822
Mem:          70348        2350       66110           8        1888       67417
Mem:          70348        2592       65867           8        1888       67175
Mem:          70348        2589       65870           8        1888       67178
Mem:          70348        2598       65862           8        1888       67169
Mem:          70348        2610       65849           8        1888       67157
Mem:          70348        3103       65356           8        1888       66664
Mem:          70348        3105       65355           8        1888       66662
Mem:          70348        3105       65355           8        1888       66662
Mem:          70348        3595       64864           8        1888       66172
Mem:          70348        3596       64863           8        1888       66171
Mem:          70348        3716       64744           8        1888       66052
Mem:          70348        3841       64619           8        1888       65926
Mem:          70348        3953       64507           8        1888       65814
Mem:          70348        3997       64462           8        1888       65770
Mem:          70348        4472       63988           8        1888       65295
Mem:          70348        4478       63981           8        1888       65289
Mem:          70348        4520       63940           8        1888       65247
Mem:          70348        4624       63835           8        1888       65143
Mem:          70348        5282       63177           8        1888       64485
Mem:          70348        5282       63177           8        1888       64485
Mem:          70348        5316       63143           8        1888       64451
Mem:          70348        5316       63144           8        1888       64451
Mem:          70348        5312       63148           8        1888       64455
Mem:          70348        5807       62653           8        1888       63960
Mem:          70348        5808       62652           8        1888       63960
Mem:          70348        5890       62570           8        1888       63877
Mem:          70348        5890       62570           8        1888       63877
Mem:          70348        6369       62090           8        1888       63398
Mem:          70348        6371       62088           8        1888       63396
Mem:          70348        6372       62088           8        1888       63395
Mem:          70348        6475       61984           8        1888       63292
Mem:          70348        6475       61984           8        1888       63292
Mem:          70348        6476       61984           8        1888       63291
Mem:          70348        6813       61646           8        1888       62954
Mem:          70348        6813       61646           8        1888       62954
Mem:          70348        6813       61647           8        1888       62954
Mem:          70348        6945       61514           8        1888       62822
Mem:          70348        6945       61514           8        1888       62822
Mem:          70348        6946       61514           8        1888       62821
Mem:          70348        7118       61341           8        1888       62649
Mem:          70348        7118       61341           8        1888       62649
Mem:          70348        7118       61342           8        1888       62649
Mem:          70348        7120       61340           8        1888       62648
Mem:          70348        7120       61339           8        1888       62647
Mem:          70348        7120       61340           8        1888       62647
Mem:          70348        7140       61320           8        1888       62627
Mem:          70348        7140       61319           8        1888       62627
Mem:          70348        7140       61320           8        1888       62627
Mem:          70348        7142       61318           8        1888       62626
Mem:          70348        7142       61318           8        1888       62626
Mem:          70348        7142       61318           8        1888       62626
Mem:          70348        7247       61213           8        1888       62521
Mem:          70348        7246       61214           8        1888       62521
Mem:          70348        7248       61212           8        1888       62520
Mem:          70348        7249       61211           8        1888       62518
Mem:          70348        7250       61210           8        1888       62517
Mem:          70348        7249       61210           8        1888       62518
Mem:          70348        7250       61210           8        1888       62517
Mem:          70348        7249       61210           8        1888       62518
Mem:          70348        7287       61172           8        1888       62480
Mem:          70348        7287       61173           8        1888       62480
Mem:          70348        7288       61172           8        1888       62480
Mem:          70348        7288       61172           8        1888       62479
Mem:          70348        7287       61172           8        1888       62480
Mem:          70348        7288       61172           8        1888       62480
Mem:          70348        7300       61160           8        1888       62468
Mem:          70348        7300       61160           8        1888       62468
Mem:          70348        7300       61160           8        1888       62467
Mem:          70348        7333       61127           8        1888       62435
Mem:          70348        7333       61127           8        1888       62435
Mem:          70348        7332       61127           8        1888       62435
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61077           8        1888       62384
Mem:          70348        7383       61076           8        1888       62384
Mem:          70348        7407       61052           8        1888       62360
Mem:          70348        7407       61053           8        1888       62360
Mem:          70348        7407       61052           8        1888       62360
Mem:          70348        7390       61070           8        1888       62377
Mem:          70348        7390       61069           8        1888       62377
Mem:          70348        7391       61069           8        1888       62377
Mem:          70348        7304       61155           8        1888       62463
Mem:          70348        7304       61155           8        1888       62463
Mem:          70348        7178       61282           8        1888       62589
Mem:          70348        7157       61303           8        1888       62611
Mem:          70348         429       68065           8        1853       69339
Mem:          70348         428       68066           8        1853       69340
Mem:          70348         429       68065           8        1853       69340

xgboost:

Mem:          70348         457       68038           8        1852       69311
Mem:          70348         554       67939           8        1855       69212
Mem:          70348        1199       67293           8        1856       68567
Mem:          70348        1270       67222           8        1856       68496
Mem:          70348        1444       67047           8        1856       68321
Mem:          70348        2139       66353           8        1856       67627
Mem:          70348        2601       65890           8        1856       67164
Mem:          70348        3110       65381           8        1856       66655
Mem:          70348        3581       64910           8        1856       66184
Mem:          70348        3937       64554           8        1856       65828
Mem:          70348        4012       64479           8        1856       65753
Mem:          70348        3373       65118           8        1856       66392
Mem:          70348        3426       65065           8        1856       66339
Mem:          70348        3406       65085           8        1856       66359
Mem:          70348        4060       64431           8        1856       65705
Mem:          70348        2963       65528           8        1856       66802
Mem:          70348        3564       64927           8        1856       66201
Mem:          70348        3677       64814           8        1856       66088
Mem:          70348        4333       64158           8        1856       65432
Mem:          70348        3406       65085           8        1856       66359
Mem:          70348        3795       64696           8        1856       65970
Mem:          70348        4153       64339           8        1856       65613
Mem:          70348        5040       63451           8        1856       64725
Mem:          70348        4281       64210           8        1856       65484
Mem:          70348        4900       63591           8        1856       64865
Mem:          70348        5000       63491           8        1856       64765
Mem:          70348        4807       63684           8        1856       64958
Mem:          70348        5354       63138           8        1856       64412
Mem:          70348        5689       62802           8        1856       64076
Mem:          70348        5589       62903           8        1856       64177
Mem:          70348        6135       62356           8        1856       63630
Mem:          70348        6562       61930           8        1856       63204
Mem:          70348        8712       59778           8        1856       61052
Mem:          70348        8788       59703           8        1857       60977
Mem:          70348        7405       61085           8        1857       62361
Mem:          70348        4434       64057           8        1857       65331
Mem:          70348        4953       63538           8        1857       64812
Mem:          70348        5449       63041           8        1857       64315
Mem:          70348        5534       62956           8        1857       64230


Mem:          70348        5559       62932           8        1857       64206
Mem:          70348        5680       62807           8        1860       64081
Mem:          70348        5686       62799           8        1863       64073
Mem:          70348        5683       62802           8        1863       64076
Mem:          70348        5689       62794           8        1864       64069
Mem:          70348        5684       62802           8        1862       64076
Mem:          70348        5688       62795           8        1864       64069
Mem:          70348        5686       62798           8        1863       64072
Mem:          70348        5695       62789           8        1863       64063
Mem:          70348        5693       62792           8        1863       64065
Mem:          70348        5695       62788           8        1864       64062
Mem:          70348        5697       62788           8        1862       64061
Mem:          70348        5702       62780           8        1865       64054
Mem:          70348        5703       62782           8        1863       64056
Mem:          70348        5698       62786           8        1863       64060
Mem:          70348        5703       62782           8        1862       64056
Mem:          70348        5700       62785           8        1862       64058
Mem:          70348        5701       62784           8        1863       64057
Mem:          70348        5704       62781           8        1863       64055
Mem:          70348        5701       62784           8        1863       64057
Mem:          70348        5702       62782           8        1864       64055
Mem:          70348        5708       62776           8        1863       64050
Mem:          70348        5704       62781           8        1863       64054
Mem:          70348        5703       62782           8        1863       64056
Mem:          70348        5709       62774           8        1865       64048
Mem:          70348        5707       62778           8        1862       64052
Mem:          70348        5707       62778           8        1862       64052
Mem:          70348        5703       62782           8        1863       64056
Mem:          70348        5784       62700           8        1863       63974
Mem:          70348        5782       62701           8        1864       63975
Mem:          70348        5781       62703           8        1863       63977
Mem:          70348        5783       62702           8        1863       63976
Mem:          70348        5780       62704           8        1863       63978
Mem:          70348        5779       62705           8        1863       63979
Mem:          70348        5780       62706           8        1862       63979
Mem:          70348        5784       62701           8        1863       63975
Mem:          70348        5785       62701           8        1862       63975
Mem:          70348        5781       62703           8        1863       63977
Mem:          70348        5783       62702           8        1862       63976
Mem:          70348        5781       62704           8        1863       63977
Mem:          70348        5781       62703           8        1863       63977
Mem:          70348        5783       62701           8        1863       63975
Mem:          70348        5784       62702           8        1862       63975
Mem:          70348        5782       62702           8        1863       63976
Mem:          70348        5783       62703           8        1861       63977
Mem:          70348        5783       62703           8        1861       63977
Mem:          70348        5783       62703           8        1862       63977
Mem:          70348        5786       62699           8        1862       63973
Mem:          70348        5785       62700           8        1862       63974
Mem:          70348        5783       62703           8        1861       63977
Mem:          70348        5785       62700           8        1862       63974
Mem:          70348        5786       62699           8        1862       63973
Mem:          70348        5787       62696           8        1865       63969
Mem:          70348        5786       62700           8        1862       63974
Mem:          70348        5787       62697           8        1863       63971
Mem:          70348        5786       62700           8        1862       63974
Mem:          70348        5785       62699           8        1863       63973
Mem:          70348         447       68038           8        1863       69311
Mem:          70348         431       68064           8        1852       69337
Mem:          70348         431       68064           8        1852       69338

lightgbm:

Mem:          70348         425       68072           8        1850       69345
Mem:          70348         495       67999           8        1854       69272
Mem:          70348         815       67678           8        1854       68952
Mem:          70348        1242       67250           8        1855       68524
Mem:          70348        1284       67208           8        1855       68482
Mem:          70348        1829       66663           8        1855       67937
Mem:          70348        2524       65968           8        1855       67242
Mem:          70348        2950       65543           8        1855       66817
Mem:          70348        3413       65079           8        1855       66353
Mem:          70348        3414       65078           8        1855       66352
Mem:          70348        3979       64514           8        1855       65787
Mem:          70348        3158       65334           8        1855       66608
Mem:          70348        3272       65220           8        1855       66494
Mem:          70348        3465       65028           8        1855       66301
Mem:          70348        3968       64525           8        1855       65799
Mem:          70348        3913       64579           8        1855       65853
Mem:          70348        3501       64992           8        1855       66265
Mem:          70348        3701       64792           8        1855       66065
Mem:          70348        4179       64313           8        1855       65587
Mem:          70348        4127       64365           8        1855       65639
Mem:          70348        3908       64584           8        1855       65858
Mem:          70348        4062       64430           8        1855       65704
Mem:          70348        4880       63613           8        1855       64886
Mem:          70348        4167       64326           8        1855       65600
Mem:          70348        4618       63874           8        1855       65148
Mem:          70348        4271       64222           8        1855       65495
Mem:          70348        4877       63615           8        1855       64889
Mem:          70348        4896       63597           8        1855       64870
Mem:          70348        5147       63345           8        1855       64619
Mem:          70348        6276       62216           8        1855       63490
Mem:          70348        5770       62723           8        1855       63997
Mem:          70348        7180       61313           8        1855       62587
Mem:          70348        6605       61888           8        1855       63162
Mem:          70348        3458       65035           8        1855       66309
Mem:          70348        3597       64895           8        1855       66169
Mem:          70348        4000       64492           8        1855       65766
Mem:          70348        4000       64492           8        1855       65766

## training start around here

Mem:          70348        4000       64492           8        1855       65766
Mem:          70348        4001       64492           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65764
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4002       64491           8        1855       65764
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4002       64490           8        1855       65764
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4000       64492           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348        4001       64491           8        1855       65764
Mem:          70348        4001       64491           8        1855       65765
Mem:          70348         433       68059           8        1856       69332
Mem:          70348         427       68070           8        1851       69343
Mem:          70348         426       68071           8        1851       69344

catboost:

Mem:          70348         389       68104           8        1854       69378
Mem:          70348         389       68104           8        1854       69378
Mem:          70348         389       68104           8        1854       69378
Mem:          70348         733       67760           8        1855       69034
Mem:          70348        1128       67365           8        1855       68639
Mem:          70348        1200       67292           8        1855       68567
Mem:          70348        1383       67109           8        1855       68384
Mem:          70348        1906       66587           8        1855       67861
Mem:          70348        1971       66521           8        1855       67796
Mem:          70348        2326       66166           8        1855       67441
Mem:          70348        2824       65668           8        1855       66943
Mem:          70348        3385       65108           8        1855       66382
Mem:          70348        2171       66321           8        1855       67596
Mem:          70348        2295       66198           8        1855       67472
Mem:          70348        3078       65414           8        1855       66688
Mem:          70348        3184       65308           8        1855       66583
Mem:          70348        4451       64041           8        1855       65316
Mem:          70348        7205       61288           8        1855       62562
Mem:          70348        8180       60312           8        1855       61587
Mem:          70348        9247       59245           8        1855       60520
Mem:          70348       10427       58065           8        1855       59340
Mem:          70348       10430       58062           8        1855       59336
Mem:          70348       10468       58024           8        1855       59299
Mem:          70348       10858       57634           8        1855       58909
Mem:          70348       11661       56831           8        1855       58105
Mem:          70348       12234       56258           8        1855       57532
Mem:          70348       12599       55893           8        1855       57167
Mem:          70348       13554       54938           8        1855       56213
Mem:          70348       13971       54521           8        1855       55796
Mem:          70348       14343       54149           8        1856       55423
Mem:          70348       13606       54886           8        1856       56161
Mem:          70348       13606       54885           8        1856       56160
Mem:          70348       13610       54881           8        1856       56156
Mem:          70348       13610       54882           8        1856       56156
Mem:          70348       14202       54290           8        1856       55565
Mem:          70348       14609       53883           8        1856       55157
Mem:          70348       15466       53026           8        1856       54301
Mem:          70348       15832       52660           8        1856       53935
Mem:          70348       15631       52861           8        1856       54135
Mem:          70348       14218       54274           8        1856       55549
Mem:          70348       14219       54273           8        1856       55547
Mem:          70348       13666       54826           8        1856       56101
Mem:          70348       13667       54825           8        1856       56100
Mem:          70348       13667       54825           8        1856       56099
Mem:          70348       12571       55921           8        1855       57195
Mem:          70348       12572       55920           8        1855       57194
Mem:          70348       12573       55919           8        1855       57194
Mem:          70348       12574       55918           8        1855       57192
Mem:          70348       12582       55910           8        1855       57185
Mem:          70348       12583       55909           8        1855       57184
Mem:          70348       12589       55903           8        1855       57178
Mem:          70348       12593       55899           8        1855       57174
Mem:          70348       13109       55383           8        1855       56657
Mem:          70348       13490       55003           8        1855       56277
Mem:          70348       13009       55483           8        1855       56758
Mem:          70348       13003       55489           8        1855       56764
Mem:          70348       12971       55521           8        1855       56796
Mem:          70348       12974       55518           8        1855       56792
Mem:          70348       12975       55517           8        1855       56792
Mem:          70348       13344       55148           8        1856       56422
Mem:          70348       13039       55453           8        1855       56728
Mem:          70348       13040       55452           8        1855       56726
Mem:          70348       13001       55491           8        1856       56766
Mem:          70348       12993       55499           8        1855       56774
Mem:          70348       13384       55108           8        1856       56383
Mem:          70348       13536       54956           8        1856       56231
Mem:          70348       13903       54589           8        1856       55863
Mem:          70348       14726       53766           8        1856       55041
Mem:          70348       15375       53117           8        1856       54391
Mem:          70348       15949       52543           8        1855       53818
Mem:          70348       12830       55662           8        1855       56937
Mem:          70348       12874       55618           8        1855       56893
Mem:          70348       13480       55012           8        1855       56286
Mem:          70348       12534       55958           8        1855       57233
Mem:          70348       12541       55952           8        1855       57226
Mem:          70348       12541       55951           8        1855       57226
Mem:          70348       12543       55949           8        1855       57224
Mem:          70348       12722       55770           8        1855       57045
Mem:          70348       12868       55625           8        1855       56899
Mem:          70348       13440       55052           8        1855       56327
Mem:          70348       13997       54495           8        1855       55770
Mem:          70348       14133       54359           8        1855       55633
Mem:          70348       14338       54155           8        1855       55429
Mem:          70348       14339       54153           8        1855       55428
Mem:          70348       14506       53987           8        1855       55261
Mem:          70348       15010       53482           8        1855       54757
Mem:          70348       15057       53435           8        1855       54709
Mem:          70348       16193       52299           8        1855       53574
Mem:          70348       16265       52227           8        1855       53501
Mem:          70348       12641       55852           8        1855       57126
Mem:          70348       13881       54611           8        1855       55886
Mem:          70348       14774       53718           8        1855       54993
Mem:          70348       15067       53425           8        1855       54700
Mem:          70348       15726       52767           8        1855       54041
Mem:          70348       16647       51846           8        1855       53120
Mem:          70348       16653       51840           8        1855       53114
Mem:          70348       16809       51683           8        1855       52957
Mem:          70348       17658       50834           8        1855       52109
Mem:          70348       17675       50817           8        1855       52092
Mem:          70348       14641       53851           8        1855       55126
Mem:          70348       14683       53809           8        1855       55083
Mem:          70348       15225       53268           8        1855       54542
Mem:          70348       15368       53124           8        1855       54398
Mem:          70348       16292       52200           8        1855       53474
Mem:          70348       16685       51807           8        1855       53082
Mem:          70348       16714       51778           8        1855       53053
Mem:          70348       14587       53905           8        1855       55180
Mem:          70348       15656       52836           8        1855       54110
Mem:          70348       15929       52563           8        1855       53838
Mem:          70348       13365       55127           8        1855       56401
Mem:          70348       13901       54591           8        1855       55866
Mem:          70348       15022       53469           8        1856       54744
Mem:          70348       11609       56883           8        1855       58158
Mem:          70348       12963       55529           8        1855       56803
Mem:          70348       13932       54560           8        1855       55834
Mem:          70348       14169       54323           8        1855       55598
Mem:          70348       15075       53417           8        1855       54692
Mem:          70348       15764       52728           8        1855       54002
Mem:          70348       15861       52631           8        1855       53906
Mem:          70348       16832       51660           8        1855       52934
Mem:          70348       17429       51063           8        1855       52338
Mem:          70348       17702       50790           8        1855       52064
Mem:          70348       18330       50162           8        1855       51436
Mem:          70348       18962       49530           8        1855       50805
Mem:          70348       19154       49338           8        1855       50613
Mem:          70348       12678       55814           8        1855       57089
Mem:          70348       13404       55088           8        1856       56363
Mem:          70348       14133       54358           8        1856       55633
Mem:          70348       10267       58225           8        1855       59499
Mem:          70348       11103       57389           8        1855       58663
Mem:          70348       12258       56234           8        1855       57509
Mem:          70348       12451       56041           8        1855       57315
Mem:          70348       13449       55042           8        1855       56317
Mem:          70348       14235       54257           8        1855       55531
Mem:          70348       14705       53787           8        1856       55062
Mem:          70348       14871       53621           8        1856       54895
...
...
...
Mem:          70348       15543       52950           8        1855       54224
Mem:          70348       15545       52948           8        1855       54222
Mem:          70348       15611       52882           8        1855       54156
Mem:          70348       16359       52134           8        1855       53408
Mem:          70348       16623       51870           8        1855       53144
Mem:          70348       16855       51637           8        1855       52912
Mem:          70348       17915       50577           8        1855       51852
Mem:          70348       19059       49434           8        1855       50708
Mem:          70348       19315       49177           8        1855       50452
Mem:          70348       16471       52022           8        1855       53297
Mem:          70348       17338       51154           8        1855       52429
Mem:          70348       14318       54174           8        1855       55449
Mem:          70348       15314       53179           8        1855       54453
Mem:          70348       16668       51825           8        1855       53100
Mem:          70348       17315       51178           8        1855       52452
Mem:          70348       17371       51122           8        1855       52396
Mem:          70348       14169       54323           8        1855       55598
Mem:          70348       14785       53707           8        1855       54982
Mem:          70348       15323       53169           8        1855       54444
Mem:          70348       15381       53111           8        1855       54386
Mem:          70348       11502       56990           8        1855       58265
Mem:          70348       13142       55351           8        1855       56625
Mem:          70348       14180       54312           8        1855       55587
Mem:          70348       14470       54023           8        1855       55297
Mem:          70348       14769       53724           8        1855       54998
Mem:          70348       15712       52780           8        1855       54055
Mem:          70348       16255       52237           8        1855       53512
Mem:          70348       16257       52235           8        1855       53510
Mem:          70348       17179       51314           8        1855       52588
Mem:          70348       13937       54556           8        1855       55831
Mem:          70348       15152       53340           8        1855       54615
Mem:          70348       15318       53175           8        1855       54449
Mem:          70348       16378       52114           8        1855       53389
Mem:          70348       16778       51714           8        1855       52989
Mem:          70348       14016       54477           8        1855       55751
Mem:          70348       15392       53100           8        1855       54375
Mem:          70348       15524       52968           8        1855       54243
Mem:          70348       15887       52605           8        1855       53880
Mem:          70348       12844       55648           8        1855       56923
Mem:          70348       13950       54542           8        1855       55817
Mem:          70348       14074       54418           8        1855       55693
Mem:          70348       14546       53946           8        1855       55221
Mem:          70348       14712       53781           8        1855       55055
Mem:          70348       15064       53428           8        1855       54703
Mem:          70348       15148       53344           8        1855       54619
Mem:          70348       15150       53342           8        1855       54617
Mem:          70348       16095       52398           8        1855       53672
Mem:          70348       16096       52396           8        1855       53671
Mem:          70348       16745       51747           8        1855       53022
Mem:          70348       16777       51715           8        1855       52990
Mem:          70348       13582       54910           8        1855       56185
Mem:          70348       14607       53885           8        1855       55160
Mem:          70348       15487       53005           8        1855       54280
Mem:          70348       15802       52690           8        1855       53965
Mem:          70348       12176       56317           8        1855       57593
Mem:          70348       14116       54377           8        1855       55651
Mem:          70348       14489       54003           8        1855       55278
Mem:          70348       14549       53943           8        1855       55218
Mem:          70348         418       68074           8        1855       69349
Mem:          70348         407       68087           8        1854       69361
Mem:          70348         406       68087           8        1854       69362
Mem:          70348         406       68087           8        1854       69362
   GB
h2o 7.4
xgboost 5.7
lightgbm 4
catboost 19.3

so RAM (low) footprint rank same as speed rank: lightgbm > xgboost > h2o > catboost (with a >b meaning a better than b)

@szilard
Copy link
Owner Author

szilard commented May 2, 2019

m5.12xlarge

Architecture:          x86_64
CPU op-mode(s):        32-bit, 64-bit
Byte Order:            Little Endian
CPU(s):                48
On-line CPU(s) list:   0-47
Thread(s) per core:    2
Core(s) per socket:    24
Socket(s):             1
NUMA node(s):          1
Vendor ID:             GenuineIntel
CPU family:            6
Model:                 85
Model name:            Intel(R) Xeon(R) Platinum 8175M CPU @ 2.50GHz
Stepping:              4
CPU MHz:               2500.000
BogoMIPS:              5000.00
Hypervisor vendor:     KVM
Virtualization type:   full
L1d cache:             32K
L1i cache:             32K
L2 cache:              1024K
L3 cache:              33792K
NUMA node0 CPU(s):     0-47
Flags:                 fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc aperfmperf pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single kaiser fsgsbase tsc_adjust bmi1 hle avx2 smep bmi2 erms invpcid rtm mpx avx512f rdseed adx smap clflushopt clwb avx512cd xsaveopt xsavec xgetbv1 ida arat pku

Note: need to change to Dockerfile to use cores 0-23 instead of 0-15.

0.1m:
9.852 0.702207
4.077 0.7324224
2.567 0.7298355
6.163 0.7225903
1m:
13.943 0.7623543
11.991 0.7494959
4.741 0.7636987
31.824 0.7402029
10m:
61.855 0.7763263
64.534 0.7551197
31.426 0.7742033
312.482 0.7439026
c5:
CPU MHz:               3000.000
NUMA node0 CPU(s):     0-35
m5:
CPU MHz:               2500.000
NUMA node0 CPU(s):     0-47
  c5 m5  
  0.1m: 0.1m: %
h2o 8.731 9.852 13
xgboost 3.242 4.077 26
lightgbm 1.958 2.567 31
catboost 4.567 6.163 35
  1m: 1m:  
h2o 14.123 13.943 -1
xgboost 10.831 11.991 11
lightgbm 4.325 4.741 10
catboost 33.879 31.824 -6
  10m: 10m:  
h2o 84.734 61.855 -27
xgboost 59.556 64.534 8
lightgbm 32.721 31.426 -4
catboost 341.073 312.482 -8

for large data more cores (m5) better than high frequency CPU (c5), for small data the other way around

the ranking of libs stay the same for given data size when changing CPU that is lightgbm > xgboost > h2o > catboost (with a >b meaning a better than b) for larger data and lightgbm > xgboost > catboost > h2o for very small data, except for m5 lightgbm > h2o > xgboost > catboost for the largest size (h2o and xgboost swapped places)

@szilard
Copy link
Owner Author

szilard commented May 2, 2019

2 sockets ran with leaving 2 cores free for the VM host:

Note: need to change to Dockerfile to use cores 0-33 instead of 0-15.

0.1m:
11.648 0.7022191
98.647 0.7324224
4.421 0.7298355
7.497 0.7225903
1m:
16.418 0.7623671
73.397 0.7494959
7.224 0.7636987
36.877 0.7402029
10m:
51.171 0.776318
149.687 0.7551197
39.955 0.7742033
351.072 0.7439026
  c5-2s c5-2s-m2c  
  0.1m: 0.1m: %
h2o 11.705 11.648 0
xgboost 58.622 98.647 68
lightgbm 5.495 4.421 -20
catboost 7.589 7.497 -1
  1m: 1m:  
h2o 17.387 16.418 -6
xgboost 59.489 73.397 23
lightgbm 10.048 7.224 -28
catboost 37.498 36.877 -2
  10m: 10m:  
h2o 50.278 51.171 2
xgboost 140.663 149.687 6
lightgbm 63.908 39.955 -37
catboost 351.112 351.072 0
  c5 c5-2s-m2c  
  0.1m: 0.1m: %
h2o 8.731 11.648 33
xgboost 3.242 98.647 2943
lightgbm 1.958 4.421 126
catboost 4.567 7.497 64
  1m: 1m:  
h2o 14.123 16.418 16
xgboost 10.831 73.397 578
lightgbm 4.325 7.224 67
catboost 33.879 36.877 9
  10m: 10m:  
h2o 84.734 51.171 -40
xgboost 59.556 149.687 151
lightgbm 32.721 39.955 22
catboost 341.073 351.072 3

lightgbm improves a bit, but it's still worse on 2 sockets than 1

@szilard szilard changed the title CPU performance c5.9xlarge Xeon 8124M 3.00GHz CPU performance on different CPUs and multi-socket (NUMA) servers May 4, 2019
@szilard
Copy link
Owner Author

szilard commented Sep 9, 2020

2020-09-09 UPDATE: xgboost/lightgbm has improved in multi-core scaling / NUMA slow-down has been mitigated:

#29 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant