Catalogs
Overview
Catalogs are a searchable inventory of your data assets with their associated metadata. The metadata contains information about tables, partitions, indexes, and more.
Onehouse allows you to connect your data catalogs, which you can later sync with the Stream Captures you create. You can connect the following data catalogs (and more coming soon):
Catalog | AWS projects | GCP projects |
---|---|---|
AWS Glue Metastore | ✅ Supported | Not supported |
Hive Metastore | ✅ Supported | ✅ Supported |
DataProc Metastore | Not supported | ✅ Supported |
BigQuery + BigLake | ✅ Supported | ✅ Supported |
DataHub | ✅ Supported | ✅ Supported |
Onetable (aka XTable) | ✅ Supported | ✅ Supported |
Databricks Unity Catalog | ✅ Supported | ✅ Supported |
Snowflake | ✅ Supported | ✅ Supported |
Add data catalogs
Under the Connections section in the Onehouse nav bar, open the Catalogs page, then click Add New Catalog. From here, you can add a catalog.