Skip to content
Switch branches/tags
Go to file
Cannot retrieve contributors at this time

Open Datastudio Open Data Studio

Open data studio is a fully managed computing service on Staroid_ cloud, built with open source development model.

That means you can enjoy all the benefits of software as a service, without giving up ability to understand the code, contribute and improve like any other open source software.

Use cases

Spark use case

  • From Python shell/ide/notebook on your laptop, interactively process massive data on your data lake with :ref:`Spark serverless`.
  • Connect your BI tools via JDBC using :ref:`Spark thriftserver`. On-demand Spark cluster is automatically configured for you.
  • Visualize your data on interactive notebook using :ref:`Apache Zeppelin`. On-demand Spark cluster is automatically configured for you.

Ray use case

  • Use ray up command to launch fully managed :ref:`Ray cluster` on the cloud.
  • Deploy your model using Ray serve with authenticated REST API endpoint.
  • Launch GPU accelerated :ref:`Jupyter` instance on the cloud.


Use all the latest machine learning technology in a single place. Open data studio continues to integrate the best technologies for machine learning.

Apache spark Ray Delta lake Nvidia CUDA Jupyter notebook Zeppelin notebook

Easy of use

Access to the latest machine learning technology shouldn't be more than a few clicks or a few lines of code away.

Fully managed

Save time and reduce risk. Open data studio is maintained by the committers of the open source project and industry experts on top of secure, reliable, and high performance cloud platform Staroid_.

Open source

Open data studio is an open source project. You can easily see source code, understand how it works, and get involved. When you need, fork and get your own version of managed service!

Also, every time you launch projects, developers of the projects get funded via StarRank_.


.. toctree::
   :maxdepth: 2