Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 31 million developers.
Hide content and notifications from this user.
Learn more about blocking users
Contact Support about this user’s behavior.
Learn more about reporting abuse
Distributed Tensorflow, Keras and BigDL on Apache Spark
Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks like TensorFlow.
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…
Example notebooks that show how to apply machine learning and deep learning in Amazon SageMaker
cuDF - GPU DataFrame Library
A curated list of automated machine learning papers, articles, tutorials, slides and projects
A Time Series Library for Apache Spark
Apache Pinot (Incubating) - A realtime distributed OLAP datastore
An open source python framework for automated feature engineering
A unified approach to explain the output of any machine learning model.
Open source platform for the machine learning lifecycle
Iceberg is a table format for large, slow-moving tabular data
Vectorized processing for Apache Arrow
Simple Reinforcement learning tutorials
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
MLeap: Deploy Spark Pipelines to Production
GraalVM: Run Programs Faster Anywhere 🚀
A place in which we publish scripts for reproducible benchmarks.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
JVM Profiler Sending Metrics to Kafka, Console Output or Custom Reporter
a benchmark to test scalability of xgboost4j-spark and relevant projects
Machine Learning Toolkit for Kubernetes
GeoMesa is a suite of tools for working with big geo-spatial data in a distributed fashion.
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
Running XGBoost on HDInsight Spark