Google Cloud Storage
SourceOpen-Source

Ingest data from Google Cloud Storage

Google Cloud Storage is a RESTful online file storage web service for storing and accessing data on Google Cloud Platform infrastructure.

Quick Start

Connect in minutes with SQL

Use CREATE SOURCE or CREATE TABLE to ingest data from Google Cloud Storage into RisingWave. No plugins, no middleware — just PostgreSQL-compatible SQL.

SQL
CREATE TABLE orders_rw (
    order_id INTEGER PRIMARY KEY,
    customer_id INTEGER,
    order_status VARCHAR,
    total_amount DECIMAL,
    last_updated TIMESTAMP)
INCLUDE file as file_name
INCLUDE offset -- default column name is `_rw_gcs_offset`
WITH (
    connector = 'gcs',
    gcs.bucket_name = 'bucket',
    gcs.credential = 'gcs_credential'
) FORMAT PLAIN ENCODE JSON (
    without_header = 'true',
    delimiter = ',' -- set delimiter = E'	' for tab-separated files
);

For comprehensive configuration details, please refer to the Google Cloud Storage connector documentation.

Capabilities

What you can do with RisingWave + Google Cloud Storage

Real-time Ingestion

Continuously stream data from Google Cloud Storage into RisingWave with sub-second latency. Process millions of events per second.

SQL Transformations

Join Google Cloud Storage data with other sources, apply windowing, aggregation, and filtering — all in standard SQL.

Materialized Views

Create incrementally maintained materialized views over Google Cloud Storage data. Always fresh, always queryable.

Multi-format Support

Supports Avro, JSON, Protobuf, CSV, and more. Compatible with Schema Registry for schema evolution.

Start streaming in minutes

Connect to Google Cloud Storage with just a few lines of SQL. No infrastructure to manage, no code to write.

Best-in-Class Event Streaming
for Agents, Apps, and Analytics
GitHubXLinkedInSlackYouTube
Sign up for our to stay updated.