[RFC] Introducing Ray AI Runtime #22488

richardliaw · 2022-02-18T02:26:25Z

Update: Initial documentation can be found here.

Hi all!

I'd like to gather some feedback on a proposal to create the "Ray AI Runtime."

Ray AI Runtime (Ray AIR) features a scalable and unified toolkit for building end-to-end ML applications. By leveraging Ray and its library ecosystem, it brings scalability and programmability to ML platforms.

Our long term vision with AIR is to own the compute story for ML and AI applications: to be the one stop shop for AI compute. AIR is designed to interoperate with other systems for storage and metadata needs, and to provide standard integration points for 3rd party libraries, so that integrators with Ray get a network effect.

Overview

Ray AIR consists of 5 key components that already exist in Ray today -- Data processing (Ray Data), Model Training (Ray Train), Reinforcement Learning (Ray RLlib), Hyperparameter Tuning (Ray Tune), and Model Serving (Ray Serve). Users can use these libraries interchangeably to scale different parts of their ML workflows.

Ray AIR introduces a unified API for seamless integration across the ecosystem of Ray libraries -- enabling you to pass data and models seamlessly between data processing, training, tuning, and inference (online and offline). If you are already using Ray, this will not break backwards compatibility.

You can run applications that use these components on your laptop, and scale out to K8s/AWS/GCP/Azure without any changes to your code.

Note that we continue to invest in making each of Ray's libraries best in class on their own (e.g., Serve, RLlib, Tune, etc.). Ray AIR is improving API compatibility between Ray's existing libraries as well as providing a reference architecture for ML platform use cases.

What is Ray AI Runtime not? As its name implies, the focus is on the compute-intensive portions of the stack and not storage and metadata services. However, we will provide integrations with data sources and metadata registries like MLFlow and WandB.

Please provide feedback below or on the linked proposal!

Proposal Link

simonsays1980 · 2022-02-22T08:57:50Z

In my opinion this is a great development and needed since some time, where the possibilities with ray are growing. I am using mostly RLlib, which I think is a great workhorse in RL and there is nothing comparable so far on the market. There it is often unclear what the standard way (or the intended way) is how things should be solved in code. Combining the APIs in a standardized manner would bring light to developers that have to think about the architecture of their solutions on top of ray.

Furthermore, I think it is a good idea to outsource many functionalities into a certain part of ray (e.g. train, tune for optimizing) such that it can be focused on the main peculiarities of some modules. RLlib can so focus on the RL part of collecting experiences and constructing modularity in RL algorithms, whereas the model training can be maintained in tune/train.

richardliaw · 2022-02-25T23:53:51Z

cc @gjoliver @sven1977 @avnishn on @simonsays1980's feedback!

richardliaw pinned this issue Feb 18, 2022

ericl changed the title ~~[rfc] Ray ML API Proposal~~ [RFC] Ray for ML Compute Feb 19, 2022

richardliaw changed the title ~~[RFC] Ray for ML Compute~~ [RFC] Introducing Ray for ML Compute Feb 21, 2022

richardliaw changed the title ~~[RFC] Introducing Ray for ML Compute~~ [RFC] Introducing Ray AI Runtime Mar 3, 2022

ericl mentioned this issue Mar 5, 2022

[RFC] Ray 2.0 Feature Proposals #22833

Closed

ericl added the RFC RFC issues label Mar 5, 2022

ericl mentioned this issue Mar 9, 2022

[ML/Train] New Trainer interface and TorchTrainer implementation #22898

Closed

6 tasks

xwjiang2010 mentioned this issue Apr 23, 2022

[Security] Support for torch lightning 1.6 and future support ray-project/ray_lightning#143

Open

waleedkadous unpinned this issue May 27, 2022

richardliaw added this to the Ray AIR milestone Jun 15, 2022

ericl added the air label Jul 16, 2022

richardliaw closed this as completed Oct 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Introducing Ray AI Runtime #22488

[RFC] Introducing Ray AI Runtime #22488

richardliaw commented Feb 18, 2022 •

edited

Loading

simonsays1980 commented Feb 22, 2022

richardliaw commented Feb 25, 2022

[RFC] Introducing Ray AI Runtime #22488

[RFC] Introducing Ray AI Runtime #22488

Comments

richardliaw commented Feb 18, 2022 • edited Loading

simonsays1980 commented Feb 22, 2022

richardliaw commented Feb 25, 2022

richardliaw commented Feb 18, 2022 •

edited

Loading