Adnan Rahic

Nov 24, 2021 • JavaScript Congress

Making Data Warehouses fast. A developer's story.

A developer went from 1.5-second BigQuery responses to under 500ms with 100 concurrent users. Here's the open-source tool they used to do it.

#1about 3 minutes

The developer's struggle with data warehouse latency

High latency in applications built on data warehouses creates a poor user experience and presents a significant challenge for developers.

#2about 5 minutes

Differentiating between OLAP and OLTP database workloads

Data warehouses use OLAP for complex, low-volume queries on large datasets, contrasting with OLTP's high-volume, simple transactions.

#3about 3 minutes

Understanding the key factors of query latency

User-perceived performance is impacted by network delays and data scan times, making sub-second responses a critical goal.

#4about 7 minutes

Exploring BigQuery's caching and concurrency limitations

BigQuery's cache only works for identical queries and its concurrency is capped per project, impacting real-world application performance.

#5about 4 minutes

Benchmarking BigQuery's performance under concurrent load

Load testing reveals that BigQuery maintains a consistent query latency of around two seconds regardless of user concurrency up to its hard limit.

#6about 2 minutes

Introducing Cube as a semantic analytics API layer

Cube provides a semantic layer over data warehouses, enabling caching, pre-aggregations, and access control to build fast data apps.

#7about 3 minutes

Setting up a local Cube development environment

A local Cube instance can be configured using Docker Compose to connect to BigQuery and automatically generate data schemas.

#8about 5 minutes

How pre-aggregations dramatically improve query speed

Pre-aggregations act as materialized views that store condensed query results, reducing a query's response time from seconds to milliseconds.

#9about 3 minutes

Comparing benchmark results of Cube vs direct BigQuery

Benchmarks show that using Cube's pre-aggregation layer results in a nearly five-fold performance increase over querying BigQuery directly.

#10about 8 minutes

Answering questions on Cube's architecture and use cases

The discussion covers when to implement a caching layer, how Cube improves performance, and its utility for medium-sized databases.

Saby Company
Delebio, Italy

Remote

Intermediate

Node.js

Admir Ag123
Vienna, Austria

Intermediate

JavaScript

TypeScript

Saby Company
Delebio, Italy

Junior

Java

Node.js

The developer's struggle with data warehouse latency

Differentiating between OLAP and OLTP database workloads

Understanding the key factors of query latency

Exploring BigQuery's caching and concurrency limitations

Benchmarking BigQuery's performance under concurrent load

Introducing Cube as a semantic analytics API layer

Setting up a local Cube development environment

How pre-aggregations dramatically improve query speed

Comparing benchmark results of Cube vs direct BigQuery

Answering questions on Cube's architecture and use cases

job ad

Javascript developer

d

Matching moments

A DBA's journey to running SQL Server on Kubernetes

Adjusting Pod Eviction Timings in Kubernetes

Answering questions on data volume, challenges, and databases

Remote Driving on Plant Grounds with State-of-the-Art Cloud Technologies

Q&A on implementation details and technology choices

Challenges for omnichannel applications at ALDI: Data distribution and offline capabilities

Q&A on performance, parallelism, and organizational impact

Convert batch code into streaming with Python

Q&A on security, custom functionality, and performance

Anvil: Full Stack Web Apps With Nothing But Python

Overcoming challenges of data size and security

Web-based Information Visualization

Meeting modern application and data platform demands

Tomorrow's cloud data platforms - fully managed database-as-a-service (DBaaS)

Introducing Kusto for interactive analytics at Microsoft scale

From Tables to Graphs in Minutes: Supercharging Kusto Graph Analytics with AI-Powered Development

Featured Partners

Related Videos

Database Magic behind 40 Million operations/s

Things I learned while writing high-performance JavaScript applications

Scaling: from 0 to 20 million users

Swapping Low Latency Data Storage Under High Load

The Data Mesh as the end of the Datalake as we know it

Interactive server side components

In-Memory Computing - The Big Picture

Lessons learned from building a thriving Vue.js SaaS application

Related Articles

From learning to earning

Data Engineer (f/m/d) - AI

Full Stack Software Developer

Full Stack Software Engineer - Control Plane

Senior Backend Node.js Engineer - BI APP - onsite in Hamburg or Berlin

Sr Software Development Engineer - Query Processing, DBS Redshift

Consultant - Business Intelligence / Data Warehousing

Data Analyst - Cloud, BI & Data Engineering

Resident Solutions Architect

QA Engineer - Core Database (remote)