📄️ Apache Hudi Quickstart with SQL
Below is a subset of the Apache Hudi Spark SQL quickstart documentation, which will help you get up and running with SQL Clusters. Also see the Apache Hudi docs for full quickstarts on SQL DDL and SQL DML.
📄️ Bronze to Silver ETL Pipeline with SQL
This guide demonstrates how to implement a silver pipeline consuming from a bronze table created by Onehouse stream capture using Hudi and SQL.
📄️ Connect to a SQL Cluster via Bastion Host
By default, SQL Clusters require that you connect from within your VPC. You can achieve this with a VPN or by connecting through a bastion host.
📄️ Orchestrate Onehouse SQL with MWAA (Amazon Managed Airflow)
This guide provides a tutorial for setting up MWAA to orchestrate queries via the Onehouse SQL Endpoint.
📄️ Connect dbt Cloud
This guide outlines the steps to connect dbt Cloud to a Onehouse SQL Cluster and run the Jaffle Shop example project.
📄️ Connect dbt core
dbt is a framework that makes it easy to transform data, with official support for Onehouse documented at//docs.getdbt.com/docs/cloud/connect-data-platform/connect-onehouse.
📄️ Connect DbVisualizer
You can query tables with DbVisualizer by connecting to the SQL Cluster from within your VPC. Follow these steps within DbVisualizer:
📄️ Connect beeline
beeline is a client for running Spark SQL queries on a Thrift Server via the command line with a JDBC connection.