A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
Clone or download
guolinke support to override some parameters in Dataset (#1876)
* add warnings for override parameters of Dataset

* fix pep8

* add feature_penalty

* refactor

* add R's code

* Update basic.py

* Update basic.py

* fix parameter bug

* Update lgb.Dataset.R

* fix a bug
Latest commit b37065d Jan 23, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.ci [ci] updated CMake version in CI Docker image (#1947) Jan 15, 2019
.github [docs] ask to provide LightGBM version for issue (#1958) Jan 18, 2019
.nuget autoincrement year in NuGet description (#1948) Jan 15, 2019
R-package support to override some parameters in Dataset (#1876) Jan 23, 2019
compute @ 509ebe4 switched to develop branch of boost compute submodule (#1455) Jun 16, 2018
docker Updated wget for GPU docker (#1404) May 30, 2018
docs [docs] fixed minor typos in documentation (#1959) Jan 22, 2019
examples [python] made notebook example interactive (#1791) Oct 30, 2018
helpers [ci] check dynamic symbol versions at CI side (#1812) Nov 1, 2018
include/LightGBM support to override some parameters in Dataset (#1876) Jan 23, 2019
pmml [docs][python] made OS detection more reliable and little docs improv… Jun 3, 2018
python-package support to override some parameters in Dataset (#1876) Jan 23, 2019
src support to override some parameters in Dataset (#1876) Jan 23, 2019
swig update LightGBM SWIG wrapper (#1610) Aug 25, 2018
tests [python] added get_data() method to Dataset class (#1870) Dec 20, 2018
windows Refine config object (#1381) May 20, 2018
.appveyor.yml removed temp fix (#1871) Nov 25, 2018
.gitignore [docs] fixed minor typos in documentation (#1959) Jan 22, 2019
.gitmodules Initial GPU acceleration support for LightGBM (#368) Apr 9, 2017
.travis.yml [ci] removed temp brew hotfix and deprecated sudo option (#1951) Jan 17, 2019
.vsts-ci.yml [ci] removed temp brew hotfix and deprecated sudo option (#1951) Jan 17, 2019
CMakeLists.txt [docs] Unified references and fixed typo (#1695) Sep 24, 2018
CODE_OF_CONDUCT.md Create CODE_OF_CONDUCT.md (#803) Aug 18, 2017
LICENSE Add license. Oct 11, 2016
README.md [docs] fixed minor typos in documentation (#1959) Jan 22, 2019
VERSION.txt new version for master branch (#1824) Nov 7, 2018
build_r.R Update build_r.R (#1918) Dec 24, 2018

README.md

LightGBM, Light Gradient Boosting Machine

Azure Pipelines Build Status Appveyor Build Status Travis Build Status Documentation Status GitHub Issues License Python Versions PyPI Version Join the chat at https://gitter.im/Microsoft/LightGBM Slack

LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages:

  • Faster training speed and higher efficiency.
  • Lower memory usage.
  • Better accuracy.
  • Support of parallel and GPU learning.
  • Capable of handling large-scale data.

For further details, please refer to Features.

Benefitting from these advantages, LightGBM is being widely-used in many winning solutions of machine learning competitions.

Comparison experiments on public datasets show that LightGBM can outperform existing boosting frameworks on both efficiency and accuracy, with significantly lower memory consumption. What's more, parallel experiments show that LightGBM can achieve a linear speed-up by using multiple machines for training in specific settings.

News

08/15/2017 : Optimal split for categorical features.

07/13/2017 : Gitter is available.

06/20/2017 : Python-package is on PyPI now.

06/09/2017 : LightGBM Slack team is available.

05/03/2017 : LightGBM v2 stable release.

04/10/2017 : LightGBM supports GPU-accelerated tree learning now. Please read our GPU Tutorial and Performance Comparison.

02/20/2017 : Update to LightGBM v2.

02/12/2017 : LightGBM v1 stable release.

01/08/2017 : Release R-package beta version, welcome to have a try and provide feedback.

12/05/2016 : Categorical Features as input directly (without one-hot coding).

12/02/2016 : Release Python-package beta version, welcome to have a try and provide feedback.

More detailed update logs : Key Events.

External (Unofficial) Repositories

Julia-package: https://github.com/Allardvm/LightGBM.jl

JPMML (Java PMML converter): https://github.com/jpmml/jpmml-lightgbm

Treelite (model compiler for efficient deployment): https://github.com/dmlc/treelite

ONNXMLTools (ONNX converter): https://github.com/onnx/onnxmltools

SHAP (model output explainer): https://github.com/slundberg/shap

MMLSpark (Spark-package): https://github.com/Azure/mmlspark

ML.NET (.NET/C#-package): https://github.com/dotnet/machinelearning

Dask-LightGBM (distributed and parallel Python-package): https://github.com/dask/dask-lightgbm

Get Started and Documentation

Install by following guide for the command line program, Python-package or R-package. Then please see the Quick Start guide.

Our primary documentation is at https://lightgbm.readthedocs.io/ and is generated from this repository.

Next you may want to read:

Documentation for contributors:

Support

How to Contribute

LightGBM has been developed and used by many active community members. Your help is very valuable to make it better for everyone.

  • Check out call for contributions to see what can be improved, or open an issue if you want something.
  • Contribute to the tests to make it more reliable.
  • Contribute to the documents to make it clearer for everyone.
  • Contribute to the examples to share your experience with other users.
  • Add your stories and experience to Awesome LightGBM.
  • Open issue if you met problems during development.

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Reference Papers

Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, Tie-Yan Liu. "LightGBM: A Highly Efficient Gradient Boosting Decision Tree". Advances in Neural Information Processing Systems 30 (NIPS 2017), pp. 3149-3157.

Qi Meng, Guolin Ke, Taifeng Wang, Wei Chen, Qiwei Ye, Zhi-Ming Ma, Tie-Yan Liu. "A Communication-Efficient Parallel Algorithm for Decision Tree". Advances in Neural Information Processing Systems 29 (NIPS 2016), pp. 1279-1287.

Huan Zhang, Si Si and Cho-Jui Hsieh. "GPU Acceleration for Large-scale Tree Boosting". SysML Conference, 2018.

License

This project is licensed under the terms of the MIT license. See LICENSE for additional details.