🌊Stream Processing Redefined.
RisingWave is a distributed SQL streaming database that enables simple, efficient, and reliable processing of streaming data.
Try it out in 5 minutes
docker run -it --pull=always -p 4566:4566 -p 5691:5691 risingwavelabs/risingwave:latest playground
brew tap risingwavelabs/risingwave brew install risingwave risingwave playground
wget https://github.com/risingwavelabs/risingwave/releases/download/v1.4.0/risingwave-v1.4.0-x86_64-unknown-linux-all-in-one.tar.gz tar xvf risingwave-v1.4.0-x86_64-unknown-linux-all-in-one.tar.gz ./risingwave playground
Now connect to RisingWave using
psql -h localhost -p 4566 -d dev -U root
Learn more at Quick Start.
For single-node Docker deployments, please refer to Docker Compose.
RisingWave Cloud the easiest way to run a fully-fledged RisingWave cluster. Try it out for free at: cloud.risingwave.com.
Why RisingWave for stream processing?
RisingWave specializes in providing incrementally updated, consistent materialized views — a persistent data structure that represents the results of stream processing. RisingWave significantly reduces the complexity of building stream processing applications by allowing developers to express intricate stream processing logic through cascaded materialized views. Furthermore, it allows users to persist data directly within the system, eliminating the need to deliver results to external databases for storage and query serving.
Compared to existing stream processing systems like Apache Flink, Apache Spark Streaming, and ksqlDB, RisingWave stands out in two primary dimensions: Ease-of-use and cost efficiency, thanks to its PostgreSQL-style interaction experience and Snowflake-like architectural design (i.e., decoupled storage and compute).
- Simple to learn
- RisingWave speaks PostgreSQL-style SQL, enabling users to dive into stream processing in much the same way as operating a PostgreSQL database.
- Simple to develop
- RisingWave operates as a relational database, allowing users to decompose stream processing logic into smaller, manageable, stacked materialized views, rather than dealing with extensive computational programs.
- Simple to integrate
- With integrations to a diverse range of cloud systems and the PostgreSQL ecosystem, RisingWave boasts a rich and expansive ecosystem, making it straightforward to incorporate into existing infrastructures.
- Highly efficient in complex queries
- RisingWave persists internal states in remote storage systems such as S3, and users can confidently and efficiently perform complex streaming queries (for example, joining dozens of data streams) in a production environment, without worrying about state size.
- Transparent dynamic scaling
- RisingWave's state management mechanism enables near-instantaneous dynamic scaling without any service interruptions.
- Instant failure recovery
- RisingWave's state management mechanism also allows it to recover from failure in seconds, not minutes or hours.
RisingWave isn’t a panacea for all data engineering hurdles. It has its own set of limitations:
- No programmable interfaces
- RisingWave does not provide low-level APIs in languages like Java and Scala, and does not allow users to manage internal states manually (unless you want to hack!). For coding in Java, Scala, and other languages, please consider using RisingWave's User-Defined Functions (UDF).
- No support for transaction processing
- RisingWave isn’t cut out for transactional workloads, thus it’s not a viable substitute for operational databases dedicated to transaction processing. However, it supports read-only transactions, ensuring data freshness and consistency. It also comprehends the transactional semantics of upstream database Change Data Capture (CDC).
- Not tailored for ad-hoc analytical queries
- RisingWave's row store design is tailored for optimal stream processing performance rather than interactive analytical workloads. Hence, it's not a suitable replacement for OLAP databases. Yet, a reliable integration with many OLAP databases exists, and a collaborative use of RisingWave and OLAP databases is a common practice among many users.
In-production use cases
Like other stream processing systems, the primary use cases of RisingWave include monitoring, alerting, real-time dashboard reporting, streaming ETL (Extract, Transform, Load), machine learning feature engineering, and more. It has already been adopted in fields such as financial trading, manufacturing, new media, logistics, gaming, and more. Check out customer stories.
Looking for help, discussions, collaboration opportunities, or a casual afternoon chat with our fellow engineers and community members? Join our Slack workspace!
Notes on telemetry
RisingWave collects anonymous usage statistics to better understand how the community is using RisingWave. The sole intention of this exercise is to help improve the product. Users may opt out easily at any time. Please refer to the user documentation for more details.
RisingWave is distributed under the Apache License (Version 2.0). Please refer to LICENSE for more information.
Thanks for your interest in contributing to the project! Please refer to contribution guidelines for more information.