Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 31 million developers.
Hide content and notifications from this user.
Learn more about blocking users
Contact Support about this user’s behavior.
Learn more about reporting abuse
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
a benchmark to test scalability of xgboost4j-spark and relevant projects
A common bricks library for building scalable and portable distributed machine learning.
Flexible Intermediate Representation for RTL
Mirror of Apache Spark
repo containing XGBoost-based ML project for various purposes
A realtime distributed OLAP datastore
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Spark with minimal hand tuning
debugging performance issues for Spark applications
Open source platform for the machine learning lifecycle
A simple optimizing Brainfuck compiler (used as demo for my QCon Beijing 2015 talk)
Efficient and Flexible Distributed Deep Learning Framework, for python, R, Julia and more
Mirror of Apache Arrow
The repo to host all the web data including images for documents in dmlc projects.
A lightweight parameter server interface
bring deep learning workloads to bare metal
Mirror of Apache Parquet
Achieving Real-time Data Analytics with Spark and EventHubs
Microsoft Azure Storage Library for Java
fast tree inference
Mirror of Apache Hive
Mirror of Apache livy (Incubating)
Deep Learning Pipelines for Apache Spark
Distributed. Columnar. Versioned. Streaming. SQL.
Example showing how events can be generated and pushed to Microsoft Azure Sevicebus Eventhubs