Fluid incubation proposal #1337

RongGu · 2024-05-19T15:45:53Z

On behalf of the Fluid Steering Committee, we propose to move the Fluid project to CNCF Incubation stage.

Fluid is an open source Kubernetes-native Distributed Dataset Orchestrator and Accelerator for data-intensive applications, such as big data and AI applications. Fluid is can convert distributed caching systems (such as Alluxio and JuiceFS) into observable caching services with self-management, elastic scaling, and self-healing capabilities, and it does so by supporting dataset operations. At the same time, through the data caching location information, Fluid can provide data-affinity scheduling for applications using datasets.

In summary, to resolve the issue that Kubernetes lacks the awareness and optimization for application data, Fluid put forward a series of innovative methods such as co-orchestration, intelligent awareness, joint-optimization, to form an efficient supporting platform for data-intensive applications in cloud native environment.

Key Features of Fluid

Application-oriented DataSet Unified Abstraction：DataSet not only consolidates data from multiple storage sources, but also describes the data's portablity and features, also providing observability, such as total data volume of the DataSet, current cache space size, and cache hit rate. Users can evaluate whether a cache system needs to be scaled up or down according to this information.
Lightweight but highly extensible Runtime Plugins：Dataset is an abstract concept, and the data operation needs to be implemented by the Runtime. According to the different storages, there will be different Runtime interfaces. Fluid's Runtime is divided into two categories: CacheRuntime to accelerate data access, such as AlluxioRuntime for S3, HDFS and JuiceFSRuntime for JuiceFS; the other category is ThinRuntime, which provides a unified access interface to facilitate the access to third-party storage.
Automated data operation：Providing data prefetch, migration, backup and other operations via CRDs, and supporting various trigger modes such as one-time, scheduled, and event-driven, to facilitate users to integrate them into the automated operation and maintenance system.
Data elasticity and scheduling：By combining distributed data caching technology with autoscaling, portability, observability, and affinity scheduling capabilities, data access performance can be improved through the provision of observable, elastic scaling cache capabilities and data affinity scheduling capabilities.
Runtime platform Agnostic：Support diverse environments such as native, edge, Serverless Kubernetes cluster, Kubernetes multi-cluster, and can run in various environments such as cloud platform, edge, Kubernetes multi-cluster. It can run storage client in different modes by choosing CSI Plugin and sidecar according to the differences in environments.

Signed-off-by: RongGu <gurongwalker@gmail.com>

TheFoxAtWork · 2024-06-04T17:28:54Z

#1317

angellk · 2024-06-27T05:53:51Z

@RongGu please finish filling out the information in #1317 and then close this issue - #1317 is the correct template. Thank you!

RongGu · 2024-06-30T07:07:35Z

@RongGu please finish filling out the information in #1317 and then close this issue - #1317 is the correct template. Thank you!

OK, got it. We will work on #1317 and then close this PR. Thank you@angellk!

propose fluid proecjt to incubation level

badfc76

Signed-off-by: RongGu <gurongwalker@gmail.com>

angellk added incubation tag-storage labels May 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fluid incubation proposal #1337

Fluid incubation proposal #1337

RongGu commented May 19, 2024

TheFoxAtWork commented Jun 4, 2024

angellk commented Jun 27, 2024

RongGu commented Jun 30, 2024

Fluid incubation proposal #1337

Are you sure you want to change the base?

Fluid incubation proposal #1337

Conversation

RongGu commented May 19, 2024

Key Features of Fluid

TheFoxAtWork commented Jun 4, 2024

angellk commented Jun 27, 2024

RongGu commented Jun 30, 2024