Skip to main content

Google Cloud Storage

Description

Continuously and incrementally stream data directly from any S3 bucket into your Onehouse-managed lakehouse. Each file is processed exactly once. If the content of an object key is modified by overwriting it, Onehouse may or may not process the updated content, depending on when the object is consumed. From correctness and data completeness perspective, it is recommended to create a new file instead of modifying the content of an existing object.

Follow the setup guide within the Onehouse console to get started. Click Sources > Add New Source > Google Cloud Storage.

Schema Information

Onehouse will infer the source schema by reading a sample of the files to be ingested.