Delta Lake
DestinationOpen-Source

Deliver data to Delta Lake

Delta Lake is an open source project that enables building a Lakehouse architecture on top of data lakes.

Quick Start

Connect in minutes with SQL

Use CREATE SINK to deliver processed data from RisingWave to Delta Lake. Define your transformation logic in SQL and let RisingWave handle delivery, retries, and exactly-once semantics.

SQL
CREATE SINK dl_sink AS
SELECT
    order_status,
    COUNT(*) as order_count,
    SUM(total_amount) as total_revenue,
    AVG(total_amount) as avg_order_value,
    MIN(last_updated) as first_order_time,
    MAX(last_updated) as last_order_time
FROM orders_rw
WITH (
    connector = 'deltalake',
    type = 'append-only',
    location = 's3a://my-delta-lake-bucket/path/to/table',
    s3.endpoint = 'https://s3.ap-southeast-1.amazonaws.com',
    s3.access.key = 'access_key',
    s3.secret.key = 'secret_key'
);

For comprehensive configuration details, please refer to the Delta Lake connector documentation.

Capabilities

What you can do with RisingWave + Delta Lake

Continuous Delivery

Automatically sink processed results from RisingWave to Delta Lake as new data arrives. No batch jobs needed.

Exactly-once Semantics

Guaranteed data correctness when delivering to Delta Lake with barrier-based checkpointing.

Upsert & Append

Support both append-only and upsert modes when sinking to Delta Lake, depending on your use case.

Sink Decoupling

Built-in buffering ensures RisingWave stays stable even if Delta Lake experiences temporary slowdowns.

Start streaming in minutes

Connect to Delta Lake with just a few lines of SQL. No infrastructure to manage, no code to write.

Best-in-Class Event Streaming
for Agents, Apps, and Analytics
GitHubXLinkedInSlackYouTube
Sign up for our to stay updated.