Álvaro Martín Lozano
Implementing continuous delivery in a data processing pipeline
#1about 4 minutes
From research concepts to production-ready data products
The Volkswagen Data Lab shifted its focus from demonstrating proof-of-concepts to building and deploying real-world data solutions for its clients.
#2about 7 minutes
Core concepts of continuous delivery for data
Continuous delivery for data pipelines requires adapting standard CI/CD principles, where data is the deliverable, by progressing through version control, integration, and deployment stages.
#3about 11 minutes
Implementing a pipeline with immutable, versioned data
The five-step pipeline relies on treating data as immutable, creating a new versioned output for each run to enable simple rollbacks and reproducibility.
#4about 6 minutes
The challenge of orchestrating chained data jobs
Managing dependencies between jobs becomes complex when each job consumes versioned, immutable data inputs from upstream processes.
#5about 5 minutes
Pros and cons of the immutable data approach
While this method offers powerful benefits like reproducibility and instant rollbacks, it introduces challenges in orchestration complexity and increased storage costs.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
28:19 MIN
The distinct roles of CI and CD pipelines
#90DaysOfDevOps - The DevOps Learning Journey
12:28 MIN
Using continuous delivery to enable business agility
The Affordances of Quality
14:52 MIN
Applying software engineering practices to data pipelines
Enjoying SQL data pipelines with dbt
02:53 MIN
Defining continuous integration, delivery, and deployment
CI/CD with Github Actions
01:16 MIN
Tracing the evolution of DevOps from silos to superhighways
Navigating the AI Wave in DevOps
11:32 MIN
Adopting trunk-based development and continuous delivery
100 times more frequent deployments: How did we create a high performance team?
23:19 MIN
Testing against production with continuous deployment
How to Destroy a Monolith?
18:08 MIN
Implementing an AI-in-the-loop continuous learning cycle
A solution to embed container technologies into automotive environments
Featured Partners
Related Videos
Python-Based Data Streaming Pipelines Within Minutes
Bobur Umurzokov
Charting the Journey to Continuous Deployment with a Value Stream Map
Josh Armitage
Enabling automated 1-click customer deployments with built-in quality and security
Christoph Ruggenthaler
CI/CD Patterns and Antipatterns - Things your Pipeline Should (Not) Do
Daniel Raniz Raneland
Industrializing your Data Science capabilities
Dubravko Dolic & Hüdaverdi Cakir
Progressive Delivery in Kubernetes
Carlos Sanchez
Practical tips and tricks for CI/CD success
Zan Markan
Plan CI/CD on the Enterprise level!
Pawel Piwosz
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.






Data Engineer - AWS Cloud & Data Pipelines 98% remote ID2398S
mund consulting AG
Intermediate
Gitlab
Confluence
Machine Learning


Senior Data Engineer - AWS Cloud & Big Data Pipelines 98% remote ID2394S
mund consulting AG
Senior
GIT
Docker
Data analysis
Continuous Integration

DevOps Architect Pipeline / Dev Container / OpenShift
Siemens AG
Berlin, Germany
GIT
CMake
Linux
DevOps
Gitlab
+5