Add paralellisation with `OpenMP` #18

Blunde1 · 2020-05-24T14:33:02Z

The node::split_information()should be easy to paralellize.

The text was updated successfully, but these errors were encountered:

szilard · 2020-09-04T09:38:14Z

Indeed, and up to 4-8 threads/CPU cores it can have a very good benefit, though based on experience with xgboost/lightgbm the scaling beyond 8 cores is difficult/with very much diminishing returns for dataset sizes commonly found in practice (100K-1M records):

(the panels are for various dataset sizes, 0.1M (million) rows, 1M and 10M)

szilard · 2020-09-04T09:46:38Z

Also there is an actual slow down on systems with multi-CPU sockets (even for super-large datasets) for example xgboost and lightgbm are not "NUMA optimized":

More details in this repo https://github.com/szilard/GBM-perf#multi-socket-cpus or in this talk https://www.youtube.com/watch?v=qjuizRba3ZQ&t=31m00s

Blunde1 added the enhancement New feature or request label May 24, 2020

Blunde1 mentioned this issue Sep 3, 2020

Quick look at performance #25

Open

szilard mentioned this issue Sep 4, 2020

aGTBoost szilard/GBM-perf#35

Open

Blunde1 added this to the Fast and scalable `agtboost` milestone Oct 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add paralellisation with `OpenMP` #18

Add paralellisation with `OpenMP` #18

Blunde1 commented May 24, 2020

szilard commented Sep 4, 2020 •

edited

Loading

szilard commented Sep 4, 2020 •

edited

Loading

Add paralellisation with OpenMP #18

Add paralellisation with OpenMP #18

Comments

Blunde1 commented May 24, 2020

szilard commented Sep 4, 2020 • edited Loading

szilard commented Sep 4, 2020 • edited Loading

Add paralellisation with `OpenMP` #18

Add paralellisation with `OpenMP` #18

szilard commented Sep 4, 2020 •

edited

Loading

szilard commented Sep 4, 2020 •

edited

Loading