Replies: 8 comments 10 replies
-
Here are some thoughts: VisionIn 5 years time: DataFusion underpins several of the most commonly used analytics systems in the world and is widely recognized as an exemplar of a high performance, vectorized execution engine created by a powerful and broad open source community. In addition to being widely used by itself, it has spawned several similar projects in other languages, affirming the trend of a deconstructed database across the industry. What is DataFusion good for?Building new, fast data centric systems (databases, dataframe libraries, streaming systems, etc) in Rust ** Examples: Who is Datafusion targeted at?Developers (e.g. creators who who make analytic systems) |
Beta Was this translation helpful? Give feedback.
-
Agree with everything @alamb said but want to add I'd love to see is a high-performance runtime tuned to database workloads or the ability to use datafusion with a custom runtime. The work that @tustvold has done on the |
Beta Was this translation helpful? Give feedback.
-
I'd love it if it was also the query engine for a mutable OLTP/OLAP database in Rust. Of course this is an external concern to the core project, but having things like index-awareness and mutability in mind will allow for that to happen. |
Beta Was this translation helpful? Give feedback.
-
I would say datafusion is more like velox, e.g. single host runtime for olap engine. But it also dataframe api/optimizer for easy use. |
Beta Was this translation helpful? Give feedback.
-
Cool question! Some thoughts:
|
Beta Was this translation helpful? Give feedback.
-
I think a large value Datafusion can bring is being the disaggregated query processing engine. If it can be more around enabling developers to mix and extend any part of it they need, that could bring the most value. The parts as I see them: Sql parser, logical plan, physical plan, logical optimizer, physical optimizer, storage layer. If, as discussed here, Datafusion could be used to build a transactional database, that would be awesome. |
Beta Was this translation helpful? Give feedback.
-
@alamb I was wondering whether the edit: added some formatting |
Beta Was this translation helpful? Give feedback.
-
Here is an attempt to update the documentation to reflect the comments on this discussion: #6639 |
Beta Was this translation helpful? Give feedback.
-
For a variety of reasons, I have been thinking about how to articulate the "Vision" for DataFusion.
While the current website does a reasonable job explaining what DataFusion is https://arrow.apache.org/datafusion/, I want to frame the longer term discussion and wanted to hear what the community thinks.
Beta Was this translation helpful? Give feedback.
All reactions