Skip to content

crepererum/datafusion-rs

 
 

Repository files navigation

DataFusion: Big Data Platform for Rust

License Version Docs Gitter chat

DataFusion is a distributed data processing platform implemented in Rust. It is very much inspired by Apache Spark and has a similar programming style through the use of DataFrames and SQL.

DataFusion can also be used as a crate dependency in your project if you want the ability to perform SQL queries and DataFrame style data manipulation in-process against your own data sources. In that respect, DataFusion is inspired by Apache Calcite in the Java world.

Project Home Page

The project home page is now at https://datafusion.rs and contains the roadmap as well as documentation for using this crate or running DataFusion as a distributed cluster. I am using GitHub issues to track development tasks and feedback.

Gitter

There is a Gitter channel where you can ask questions about the project or make feature suggestions too.

Building DataFusion

See BUILDING.md.

Contributing

Contributors are welcome! Please see CONTRIBUTING.md for details.

About

Distributed query processing implemented in Rust

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Rust 96.9%
  • Shell 2.2%
  • Other 0.9%