Dora Petrella

Oct 6, 2023 • World Congress 2023

How We Built a Machine Learning-Based Recommendation System (And Survived to Tell the Tale)

How do you find the perfect substitute for an out-of-stock item? Learn how we adapted a natural language model to solve this critical e-commerce challenge.

#1about 5 minutes

Defining the business need for product recommendations

A recommendation system for substitute products is needed across multiple touchpoints to prevent lost sales from out-of-stock items.

#2about 2 minutes

Analyzing the limitations of the existing recommender

The previous system, based on the Jaccard coefficient, produced low-quality recommendations, particularly for new or unpopular items.

#3about 5 minutes

Using the Prod2Vec algorithm for recommendations

The Prod2Vec algorithm, adapted from Word2Vec, learns product relationships by analyzing co-occurrence within user session context windows.

#4about 2 minutes

Improving predictions with Meta-Prod2Vec and metadata

Incorporating product metadata like category and brand into the model (Meta-Prod2Vec) significantly improves recommendation quality for long-tail items.

#5about 2 minutes

Implementing the end-to-end MLOps pipeline

The production system uses dbt for data transformation, a Vertex AI pipeline for model training, and Elasticsearch for efficient vector similarity search.

#6about 3 minutes

Evaluating model performance with offline and online metrics

Offline metrics like NDCG confirmed model quality, while mirror traffic analysis showed a 45% increase in product recommendation coverage.

#7about 3 minutes

Visualizing product relationships with embedding projector

Using TensorFlow's Embedding Projector tool reveals how the model groups similar products into distinct clusters in a high-dimensional space.

#8about 3 minutes

Adopting pragmatic baselines and automated data analysis

Key project takeaways include using simple business-logic baselines for benchmarking and automating exploratory data analysis within the ML pipeline itself.

#9about 1 minute

Understanding the project team and final timeline

The project was completed in nine months by a cross-functional team of data engineers, data scientists, and software developers.

Admir Comp

Remote

Intermediate

DevOps

Andrew Comp
Berlin, Germany

Intermediate

Java

JavaScript

Real-world examples of machine learning in e-commerce

02:56 MIN

Real-world examples of machine learning in e-commerce

Data Science in Retail

Unlock full access

Log in or set up an account to access this feature and more.

Real-world applications and key takeaways

01:54 MIN

Real-world applications and key takeaways

Machine learning 101: Where to begin?

Unlock full access

Log in or set up an account to access this feature and more.

How AI powers e-commerce from logistics to discovery

05:15 MIN

How AI powers e-commerce from logistics to discovery

Intelligence Everywhere: The Future of Consumer Tech

Unlock full access

Log in or set up an account to access this feature and more.

Demo of a unified model and business monitoring dashboard

07:54 MIN

Demo of a unified model and business monitoring dashboard

Deployed ML models need your feedback too

Unlock full access

Log in or set up an account to access this feature and more.

Future ideas for personalized vacation planning

02:19 MIN

Future ideas for personalized vacation planning

Hacking Your Vacation: Using Data for Fun

Unlock full access

Log in or set up an account to access this feature and more.

Adopting a holistic AI strategy across business functions

05:57 MIN

Adopting a holistic AI strategy across business functions

Fireside Chat with Werner Vogels, VP & CTO, Amazon.com & Daniel Gebler, CTO at Picnic

Unlock full access

Log in or set up an account to access this feature and more.

Overview of the data and machine learning tech stack

01:29 MIN

Overview of the data and machine learning tech stack

Empowering Retail Through Applied Machine Learning

Unlock full access

Log in or set up an account to access this feature and more.

Navigating the machine learning project lifecycle

10:46 MIN

Navigating the machine learning project lifecycle

Intelligent Automation using Machine Learning

Unlock full access

Log in or set up an account to access this feature and more.

Featured Partners

Data Science in Retail

Data Science in Retail

Julian Joseph

about 3 years ago • WeAreDevelopers LIVE

Deployed ML models need your feedback too

Deployed ML models need your feedback too

David Mosen

about 5 years ago • World Congress 2021

Hybrid AI: Next Generation Natural Language Processing

Hybrid AI: Next Generation Natural Language Processing

Jan Schweiger

about 4 years ago • World Congress 2022

MLOps - What’s the deal behind it?

MLOps - What’s the deal behind it?

Nico Axtmann

about 4 years ago • World Congress 2022

What non-automotive Machine Learning projects can learn from automotive Machine Learning projects

What non-automotive Machine Learning projects can learn from automotive Machine Learning projects

Jan Zawadzki

about 4 years ago • World Congress 2022

Empowering Retail Through Applied Machine Learning

Empowering Retail Through Applied Machine Learning

Christoph Fassbach & Daniel Rohr

about 2 years ago • World Congress 2024

How AI Models Get Smarter

How AI Models Get Smarter

Ankit Patel

about 7 months ago • World Congress 2025

DevOps for Machine Learning

DevOps for Machine Learning

Hauke Brammer

about 5 years ago • World Congress 2021

Related Articles

View all articles

DD

Dilek Demir

Data Science & more: The Lopez dilemma

Catwalk, Data Science, Hollywood, Google Images, Haute Couture, StackOverflow, Comfort Zone, Dota 2 and Versace – all these topics are connected and influenced by each other. Read here how and why!In 2000 Jennifer Lopez's green Versace dress went vi...

Data Science & more: The Lopez dilemma

CH

Chris Heilmann

Coffee with Developers - Maria Apazoglou - Making AI understandable for all in production

Hello and welcome to another edition of Coffee with Developers. Today, we're excited to share an intriguing conversation with Maria Apazoglou, a leading figure in the AI space at Thomson Reuters. Maria's career journey, insights on AI, and the exciti...

Coffee with Developers - Maria Apazoglou - Making AI understandable for all in production

BB

Benedikt Bischof

MLops – Deploying, Maintaining And Evolving Machine Learning Models in Production

Welcome to this issue of the WeAreDevelopers Live Talk series. This article recaps an interesting talk by Bas Geerdink who gave advice on MLOps.‍About the speaker:‍Bas is a programmer, scientist, and IT manager. At ING, he is responsible for the Fast...

MLops – Deploying, Maintaining And Evolving Machine Learning Models in Production

CH

Chris Heilmann

WWC24 Talk - Brenda Romero - Stay: Surviving and Thriving in Tech

Brenda Romero discusses her tech career journey, overcoming burnout, and inspiring future game developers at WWC24.Here is what she had to say in the video:Hey everyone! Thanks for joining us!Reflections on a Rough YearLast year, I gave a talk about ...

WWC24 Talk - Brenda Romero - Stay: Surviving and Thriving in Tech

From learning to earning

Jobs that call for the skills explored in this talk.

Machine Learning Engineer (m/f/d)

evoila Frankfurt GmbH
Mainz, Germany

Senior

Keras

DevOps

Tensorflow

ML/DevOps Engineer at dynamic AI/ Computer Vision company

Nomitri
Berlin, Germany

DevOps

Gitlab

Docker

Ansible

Grafana

+6

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

Machine Learning & Data Engineer

vengine GmbH
Hamburg, Germany

Junior

Intermediate

Python

Data Engineer (f/m/d) - AI

smartclip Europe GmbH
Hamburg, Germany

Intermediate

Senior

ETL

Java

Scala

Machine Learning Engineer Personalization & Recommendations

Hilo By Aktiia
Lausanne, Switzerland

Intermediate

Machine Learning

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

Senior Delivery Consultant - Machine Learning (GenAI), ProServe EMEA

Amazon.com, Inc

Senior

PyTorch

Machine Learning