Sources
Overview
Onehouse allows you to ingest data from external sources into your data lakehouse with Stream Captures. You can set up Stream Captures with the following external data sources (and more coming soon):
External source | AWS projects | GCP projects |
---|---|---|
AWS S3 | ✅ Supported | Not supported |
Google Cloud Storage | Not supported | ✅ Supported |
Apache Kafka | ✅ Supported | ✅ Supported |
Confluent Kafka | ✅ Supported | ✅ Supported |
Amazon MSK Kafka | ✅ Supported | Not supported |
Confluent CDC | ✅ Supported | ✅ Supported |
In addition to external sources, any Stream Capture can use an existing Onehouse table as the source.
Add external sources
Under the Connections section in the Onehouse nav bar, open the Sources page, then click Add New Source. From here, you can add external sources.
Select an external source to add, then follow the instructions within Onehouse.