BigQuery and BigLake
Description
Google Cloud's Data Catalog serves as a centralized metadata management system for both BigQuery and BigLake storage systems on Google Cloud. It enables users to store, manage, and discover metadata about tables, schemas, and partitions across the two services. The catalog supports search and discovery of datasets. With a unified view of all data assets, the BigQuery + BigLake catalog simplifies data governance, lineage tracking, and access control.
Cloud Provider Support
- AWS: Not supported
- GCP: ✅ Supported
Setup guide
- Enter a Name to identify the data catalog in Onehouse
- Select BigQuery as the Type
- Enter the Project Name
- Optional: Add BigLake Connection ID (for BigLake tables i.e.
projects/<project-id>/locations/<location>/connections/<connection-id>) - Optional: You may choose to require a partition filter
- Optional: You may choose to add Iceberg support by checking the "Use Iceberg" checkbox
Note that as of now, BigQuery Catalog can only be created in GCP projects.
BigQuery + BigLake Catalog example
