Integrate Amazon Athena with your Onehouse Lakehouse
This guide shows how to seamlessly integrate Amazon Athena with your Onehouse Managed Lakehouse. This will allow you to power serverless analytics at scale on top of the data in your Lakehouse.
Querying tables in the Data Lakehouse with AWS Athena
AWS Athena automatically queries AWS Glue to locate tables registered by Onehouse. If a table isn't found, ensure your Onehouse Capture Stream is configured to use the AWS Glue catalog. Once the Capture Stream runs, the corresponding Glue catalog entry should be created or updated.
Using Preset.io with Amazon Athena and Onehouse
Preset.io provides documentation at https://docs.preset.io/docs/aws-athena on how to setup Athena and Preset. Below is an example of a URI connection string.
awsathena+pandas://XXXXXX:YYYYYYY@athena.us-west-2.amazonaws.com/onehouse_demo?s3_staging_dir=s3://athena-query-results/&work_group=primary
Change the AWS access key, AWS secret key, AWS region, "onehouse_demo" is the Onehouse database name (same as AWS Glue database name), s3_staging_dir is the location of where Athena stores temporary query results and work_group is "primary" by default.