RisingWave is designed from the ground up as a cloud-native system with decoupled compute and storage.
Read More
RisingWave | Spark Structured Streaming | |
---|---|---|
License | Apache License 2.0 | Apache License 2.0 |
System category | Streaming database | Micro-batch stream processing engine |
Architecture | Cloud-native, decoupled compute-storage | Batch-first architecture with micro-batch execution |
Programming API | SQL + UDF (Python, Java, more) | DataFrame API (Scala, Java, Python), limited SQL |
Client libraries | Java, Python, Node.js, more | Spark client bindings only |
State management | Native state persisted in S3 or compatible object storage | In-memory state with checkpointing to HDFS/S3 |
Query serving | Supports concurrent ad-hoc SQL query serving | Not designed for interactive queries; job-based execution |
Correctness | Exactly-once semantics, out-of-order support, snapshot read | Exactly-once semantics, but no built-in snapshot isolation |
Integrations and tooling | PostgreSQL ecosystem, cloud-native tools, Apache Iceberg™ | Hadoop ecosystem, Spark ecosystem |
Learning curve | Shallow (PostgreSQL-style SQL) | Moderate to steep (requires Spark + streaming concepts) |
Failure recovery | Instant via S3-backed storage | May require reprocessing; checkpoint restore time varies |
Dynamic scaling | Transparent and online | Requires job restarts or auto-scaling scripts |
Performance cost | Low — decoupled storage reduces pressure on compute | High — shuffle-intensive, micro-batch overhead |
Typical use cases | Streaming ETL, online serving, real-time metrics | Streaming ETL, incremental batch, log pipelines |
License | Apache License 2.0 |
System category | Streaming database |
Architecture | Cloud-native, decoupled compute-storage |
Programming API | SQL + UDF (Python, Java, more) |
Client libraries | Java, Python, Node.js, more |
State management | Native state persisted in S3 or compatible object storage |
Query serving | Supports concurrent ad-hoc SQL query serving |
Correctness | Exactly-once semantics, out-of-order support, snapshot read |
Integrations and tooling | PostgreSQL ecosystem, cloud-native tools, Apache Iceberg™ |
Learning curve | Shallow (PostgreSQL-style SQL) |
Failure recovery | Instant via S3-backed storage |
Dynamic scaling | Transparent and online |
Performance cost | Low — decoupled storage reduces pressure on compute |
Typical use cases | Streaming ETL, online serving, real-time metrics |