Hive Metastore
Description
Hive Metastore (HMS) is a central repository for storing metadata about the structure and location of data in Apache Hive, a data warehouse system built on top of Hadoop. The metastore allows users to store and manage table schema, partition information, and other metadata, supporting data processing and querying with several processing engines.
Setup guide
- Enter a Name to identify the data catalog in Onehouse
- Select Hive Metastore as the Type
- Enter the Servers
Multi-format Catalog Sync
Onehouse natively supports syncing to your Hive Metastore in multiple Open Table Formats(Apache Hudi and Apache Iceberg using Apache XTable). This means that a single copy of your data will now be synced to HMS as Hudi format and Iceberg format metadata, enabling table format interoperability - allowing you to use the best format for your use-case.
In order to set this up, select the formats that you would like to sync as and define the format suffix for the table name (Iceberg format will default to _iceberg). Thus any Iceberg format tables will be registered as tableName_iceberg in your HMS. An example of this can be found in the table below.
Format | Table Name (in catalog) |
---|---|
Hudi | tableName_ro (read optimized view) tableName_rt (real time or snapshot view) |
Iceberg | tableName_iceberg |
Onehouse managed Iceberg tables should not be written to via external writers - this could corrupt the data in the table