Deliver data to Delta Lake
Delta Lake is an open source project that enables building a Lakehouse architecture on top of data lakes.
Quick Start
Connect in minutes with SQL
Use CREATE SINK to deliver processed data from RisingWave to Delta Lake. Define your transformation logic in SQL and let RisingWave handle delivery, retries, and exactly-once semantics.
CREATE SINK dl_sink AS
SELECT
order_status,
COUNT(*) as order_count,
SUM(total_amount) as total_revenue,
AVG(total_amount) as avg_order_value,
MIN(last_updated) as first_order_time,
MAX(last_updated) as last_order_time
FROM orders_rw
WITH (
connector = 'deltalake',
type = 'append-only',
location = 's3a://my-delta-lake-bucket/path/to/table',
s3.endpoint = 'https://s3.ap-southeast-1.amazonaws.com',
s3.access.key = 'access_key',
s3.secret.key = 'secret_key'
);For comprehensive configuration details, please refer to the Delta Lake connector documentation.
Capabilities
What you can do with RisingWave + Delta Lake
Continuous Delivery
Automatically sink processed results from RisingWave to Delta Lake as new data arrives. No batch jobs needed.
Exactly-once Semantics
Guaranteed data correctness when delivering to Delta Lake with barrier-based checkpointing.
Upsert & Append
Support both append-only and upsert modes when sinking to Delta Lake, depending on your use case.
Sink Decoupling
Built-in buffering ensures RisingWave stays stable even if Delta Lake experiences temporary slowdowns.
Start streaming in minutes
Connect to Delta Lake with just a few lines of SQL. No infrastructure to manage, no code to write.