Adnan Rahic

Making Data Warehouses fast. A developer's story.

A developer went from 1.5-second BigQuery responses to under 500ms with 100 concurrent users. Here's the open-source tool they used to do it.

Making Data Warehouses fast. A developer's story.
#1about 3 minutes

The developer's struggle with data warehouse latency

High latency in applications built on data warehouses creates a poor user experience and presents a significant challenge for developers.

#2about 5 minutes

Differentiating between OLAP and OLTP database workloads

Data warehouses use OLAP for complex, low-volume queries on large datasets, contrasting with OLTP's high-volume, simple transactions.

#3about 3 minutes

Understanding the key factors of query latency

User-perceived performance is impacted by network delays and data scan times, making sub-second responses a critical goal.

#4about 7 minutes

Exploring BigQuery's caching and concurrency limitations

BigQuery's cache only works for identical queries and its concurrency is capped per project, impacting real-world application performance.

#5about 4 minutes

Benchmarking BigQuery's performance under concurrent load

Load testing reveals that BigQuery maintains a consistent query latency of around two seconds regardless of user concurrency up to its hard limit.

#6about 2 minutes

Introducing Cube as a semantic analytics API layer

Cube provides a semantic layer over data warehouses, enabling caching, pre-aggregations, and access control to build fast data apps.

#7about 3 minutes

Setting up a local Cube development environment

A local Cube instance can be configured using Docker Compose to connect to BigQuery and automatically generate data schemas.

#8about 5 minutes

How pre-aggregations dramatically improve query speed

Pre-aggregations act as materialized views that store condensed query results, reducing a query's response time from seconds to milliseconds.

#9about 3 minutes

Comparing benchmark results of Cube vs direct BigQuery

Benchmarks show that using Cube's pre-aggregation layer results in a nearly five-fold performance increase over querying BigQuery directly.

#10about 8 minutes

Answering questions on Cube's architecture and use cases

The discussion covers when to implement a caching layer, how Cube improves performance, and its utility for medium-sized databases.

Related jobs
Jobs that call for the skills explored in this talk.

job ad

Saby Company
Delebio, Italy

Intermediate

d

Saby Company
Delebio, Italy

Junior

Featured Partners

Related Articles

View all articles
BB
Benedikt Bischof
Making Data Warehouses Fast: A Developer’s Story
Welcome to this issue of the WeAreDevelopers Live Talk series. This article recaps an interesting talk by Adnan Rahic who teaches the audience how to make data warehouses.About the Speaker: Adnan is senior developers advocate at Cube. His passion lie...
Making Data Warehouses Fast: A Developer’s Story
CH
Chris Heilmann
Dev Digest 139 - Soft and hard queries
News and ArticlesLet's start with Amelia Wattenberger's excellent essay Bridging the hard and the soft talking about humans and data. Do you sometimes also miss aspects of the early web like a lack of big data and freedom to create without numeric go...
Dev Digest 139 - Soft and hard queries

From learning to earning

Jobs that call for the skills explored in this talk.