Skip to content

v1.0.0

Latest
Compare
Choose a tag to compare
@bobbai00 bobbai00 released this 04 Apr 15:06
· 35 commits to master since this release
59c2d0b

Since 2016, our team has been developing the Texera system (https://texera.io/) to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!

Major Features

  • Supporting low/no coding data science using workflows
  • Parallel data-processing engine running on computing clusters
  • Using the Apache Pekko actor-model system
  • Supporting UDFs in Python, R, and Java
  • Supporting ML training and inference
  • Including a rich collection of ML operators
  • Interactive workflow execution model that supports pausing and resuming
  • Supporting collaborations with shared editing, shared execution, and version control
  • Supporting debugging, including line-by-line debugging in Python UDFs
  • Supporting reproducibility of data analysis
  • Region-by-region execution with full pipelining in each region
  • Storing execution results using Apache Iceberg
  • Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
  • Adopting a microservice-based architecture using Kubernetes and Docker
  • Supporting computing isolation and storage isolation of multiple tenants

For users and developers

  • For users who want to install Texera on a single machine using Docker, follow the instructions on this page.
  • For developers who want to contribute to Texera, follow the instructions on this page.