ποΈ AWS Lakeformation Integration with Onehouse
The below guide shows how AWS Lakeformation integrates with the Onehouse managed lakehouse
ποΈ Build a Sagemaker ML model on a Onehouse lakehouse
The below guide provides a pattern for bringing machine learning capabilities to data in stored in Onehouse using Amazon Sagemaker
ποΈ Examine Onehouse quarantine tables with AWS Glue and Athena
Problem: On AWS, Onehouse quarantine tables donβt have Glue metadata so that they cannot be queried using Athena.
ποΈ Cross-region Hudi Disaster Recovery using Savepoints
The below guide provides a pattern for creating a cross-region disaster recovery solution for Hudi using savepoints.
ποΈ Onehouse Kafka Development Recommendations
When designing messages for Apache Kafka, consider these best practices to ensure efficient, reliable, and scalable data pipelines:
ποΈ Table Types: Merge on Read and Copy on Write
Merge on Read (MoR) and Copy on Write (CoW) are the two table types available in Apache Hudi that determine how these data updates are managed.
ποΈ Onehouse Cross Account Data Access
At times, Onehouse customers may want to read data from sources in other AWS
ποΈ Onehouse Table Format Conversion and Catalog Sync
This guide outlines the steps to convert an existing Apache Hudi table to Apache Iceberg, and synchronize the data to the Snowflake Catalog. This process includes guaranteeing file visibility for Onehouse and performing a metadata synchronization job utilizing both the Onetable (Xtable) catalog and the Databricks Unity Catalog.