Skip to content
This repository has been archived by the owner. It is now read-only.
  • No due date Last updated about 1 month ago

    Implement distributed processing: Implement serialization for Reco…

    Implement distributed processing: Implement serialization for RecordBatch and Schema (using Arrow IPC) so that data can be persisted to disk and streamed between nodes Implement basic distributed query planner Implement serialization for query plans Docker packaging for worker nodes Kubernetes to orchestrate cluster

     
    100% complete
  • No due date Last updated about 1 month ago

    Implement JOIN, ORDER BY, UNION, SUBQUERY

     
    100% complete
  • Past due by 3 months Last updated about 1 month ago

    The goal of this milestone is to have DataFusion working well enoug…

    The goal of this milestone is to have DataFusion working well enough to run single-threaded SQL queries against CSV and Parquet data sources, supporting projection, selection, cast, type coercion, sort (in memory) and simple aggregates (in memory).

     
    100% complete
You can’t perform that action at this time.