Skip to content

saeta/penguin

main
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.

Penguin

Explore the ideas of data frames, accelerated compute, tables, data structures, and distributed data in Swift.

Documentation:

Roadmap

Below is the aspirational roadmap (at an extremely high level) for the high-level goals:

  • Parse from CSV.
  • Finish up base API & document it. (Incomplete list of needs: appending rows, support in-place mutations everywhere, more powerful sorting, remove unnecessary APIs)
  • Optimize PIndexSet representations.
  • Refactor internals around a parallelism model from PenguinParallel.
  • Optimize backing store for PTypedColumn, including adding support for larger-than-RAM & file formats like Arrow.
  • Add querying / group-by support (including support for a PTableGroup --- alternate names wanted).
  • Parallelize the implementation of the operators.
  • Investigate hardware acceleration & JIT code-gen.
  • Distributed orchestration of computation.

Extensions (help wanted):

  • Connect to databases.
  • Parse additional file formats.

This is not an officially supported Google product.

About

A suite of libraries for data science & high performance computation in Swift

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages