
RisingWave is designed from the ground up as a cloud-native system with decoupled compute and storage.

Read More
| RisingWave | Spark Structured Streaming | |
|---|---|---|
| License | Apache License 2.0 | Apache License 2.0 |
| System category | Streaming database | Micro-batch stream processing engine |
| Architecture | Cloud-native, decoupled compute-storage | Batch-first architecture with micro-batch execution |
| Programming API | SQL + UDF (Python, Java, more) | DataFrame API (Scala, Java, Python), limited SQL |
| Client libraries | Java, Python, Node.js, more | Spark client bindings only |
| State management | Native state persisted in S3 or compatible object storage | In-memory state with checkpointing to HDFS/S3 |
| Query serving | Supports concurrent ad-hoc SQL query serving | Not designed for interactive queries; job-based execution |
| Correctness | Exactly-once semantics, out-of-order support, snapshot read | Exactly-once semantics, but no built-in snapshot isolation |
| Integrations and tooling | PostgreSQL ecosystem, cloud-native tools, Apache Iceberg™ | Hadoop ecosystem, Spark ecosystem |
| Learning curve | Shallow (PostgreSQL-style SQL) | Moderate to steep (requires Spark + streaming concepts) |
| Failure recovery | Instant via S3-backed storage | May require reprocessing; checkpoint restore time varies |
| Dynamic scaling | Transparent and online | Requires job restarts or auto-scaling scripts |
| Performance cost | Low — decoupled storage reduces pressure on compute | High — shuffle-intensive, micro-batch overhead |
| Typical use cases | Streaming ETL, online serving, real-time metrics | Streaming ETL, incremental batch, log pipelines |
| License | Apache License 2.0 |
| System category | Streaming database |
| Architecture | Cloud-native, decoupled compute-storage |
| Programming API | SQL + UDF (Python, Java, more) |
| Client libraries | Java, Python, Node.js, more |
| State management | Native state persisted in S3 or compatible object storage |
| Query serving | Supports concurrent ad-hoc SQL query serving |
| Correctness | Exactly-once semantics, out-of-order support, snapshot read |
| Integrations and tooling | PostgreSQL ecosystem, cloud-native tools, Apache Iceberg™ |
| Learning curve | Shallow (PostgreSQL-style SQL) |
| Failure recovery | Instant via S3-backed storage |
| Dynamic scaling | Transparent and online |
| Performance cost | Low — decoupled storage reduces pressure on compute |
| Typical use cases | Streaming ETL, online serving, real-time metrics |