DataProc Metastore
Description
DataProc Metastore is a fully managed, highly available metadata management service for Apache Hive and other compatible data processing engines on Google Cloud. It provides a centralized repository for storing and managing metadata about tables, partitions, and schemas, making it easier to process and analyze data using engines like Apache Spark, Presto, and others on Google Cloud Dataproc.
Setup guide
- Enter a Name to identify the data catalog in Onehouse
- Select DataProc Metastore as the Type
- Enter the Servers
Note that DataProc metastores can only be created in GCP projects.