Move your data from Google Cloud Storage to Delta Lake, continuously

Continuously ingest data from different sources, transform data on-the-fly, and then deliver data to any destinations using RisingWave’s connectors.
Google Cloud Storage
→
RisingWave
→
Delta Lake
Google Cloud Storage
↓-
RisingWave
↓-
Delta Lake
Google Cloud Storage
|
CREATE TABLE orders_rw (
    order_id INTEGER PRIMARY KEY,
    customer_id INTEGER,
    order_status VARCHAR,
    total_amount DECIMAL,
    last_updated TIMESTAMP)
INCLUDE file as file_name
INCLUDE offset -- default column name is `_rw_gcs_offset`
WITH (
    connector = 'gcs',
    gcs.bucket_name = 'bucket',
    gcs.credential = 'gcs_credential'
) FORMAT PLAIN ENCODE JSON (
    without_header = 'true',
    delimiter = ',' -- set delimiter = E'	' for tab-separated files
);
For comprehensive configuration details, please refer to the Google Cloud Storage connector documentation.
|
RisingWave
|
CREATE SINK dl_sink AS
SELECT
    order_status,
    COUNT(*) as order_count,
    SUM(total_amount) as total_revenue,
    AVG(total_amount) as avg_order_value,
    MIN(last_updated) as first_order_time,
    MAX(last_updated) as last_order_time
FROM orders_rw
WITH (
    connector = 'deltalake',
    type = 'append-only',
    location = 's3a://my-delta-lake-bucket/path/to/table',
    s3.endpoint = 'https://s3.ap-southeast-1.amazonaws.com',
    s3.access.key = 'access_key',
    s3.secret.key = 'secret_key'
);
For comprehensive configuration details, please refer to the Delta Lake connector documentation.
|
Delta Lake
The Modern Backbone for Your
Event-Driven Infrastructure
GitHubXLinkedInSlackYouTube
Sign up for our to stay updated.