Data Engineering

Deploying Walden on AKS

3 minute read Published: 2022-06-14

Walden is our reference implementation of a data warehouse. After adding instructions for its deployment on Amazon's EKS last month, we are now also supporting it on Microsoft's Azure Kubernetes Service (AKS).

Deploying Walden on EKS

3 minute read Published: 2022-05-17

Walden is our reference implementation of a data warehouse. We are now supporting it on Amazon's Elastic Kubernetes Service. Follow deployment instructions here, or read more information about our experience deploying a data warehouse on AWS below.

Adding Alluxio to Walden

4 minute read Published: 2022-04-18

We have added Alluxio to Walden, our reference implementation of a small data lake. Alluxio provides a unified view into one or more underlying storage sources, adding caching and translation on top of them. This can greatly improve overall Trino performance across queries, while also enabling support for external storage types like NFS that are not supported natively by Trino.

Introducing Walden

5 minute read Published: 2021-02-15

We have built Walden, a small data lake for (mostly) solitary use, consisting of a set of configurations and images for deployment into a Kubernetes cluster. We are releasing the code as free and open source software, hoping to lower some of the barriers to entry to the world of big data and AI. Check it out on our github, or read below for more info!