Iulia Feroli

Harry Potter and the Elastic Semantic Search

How can you find text about bravery with a negative sentiment? Learn to build a semantic search engine using Elasticsearch and the world of Harry Potter.

Harry Potter and the Elastic Semantic Search
#1about 3 minutes

The evolution of NLP from early models to modern LLMs

Tracing the rapid advancement of natural language processing from early models like Word2Vec to the powerful generative AI we see today.

#2about 5 minutes

How vector embeddings represent language as numbers

Vector embeddings turn words and sentences into numerical arrays, allowing computers to understand semantic relationships through mathematical operations.

#3about 7 minutes

Using vector similarity and LLMs for semantic operations

The distance between vectors in an embedding space represents semantic similarity, enabling operations like finding related concepts or answering questions.

#4about 4 minutes

Using Elasticsearch as a vector database for search

Elasticsearch serves as a vector database to store document embeddings and integrates with models from sources like Hugging Face for inference.

#5about 7 minutes

Demonstrating advanced keyword search with the Python client

The Elasticsearch Python client enables complex, multi-field queries with boolean logic to filter data based on precise criteria before adding semantic layers.

#6about 4 minutes

Enriching data with sentiment analysis pipelines

An inference pipeline can automatically apply a sentiment analysis model to all documents, adding a new field to enable filtering by positive or negative tone.

#7about 4 minutes

Implementing semantic search with embedding models

By converting all text into vectors using an embedding model, you can perform a k-NN search to find the most semantically relevant results for a query.

#8about 5 minutes

Refining results with hybrid search techniques

Hybrid search combines the power of semantic vector search with traditional keyword filters and exclusions to create highly relevant and precise results.

#9about 19 minutes

Audience Q&A on models and implementation

The speaker answers audience questions about ensuring relevance, handling out-of-vocabulary terms, updating data sources, and debugging model outputs.

Related jobs
Jobs that call for the skills explored in this talk.

test

Milly
Vienna, Austria

Intermediate

test

Milly
Vienna, Austria

Intermediate

Featured Partners

Related Articles

View all articles
EM
Eli McGarvie
13 AI Tools You Have to Try
First, it was NFTs, then it was Web3, and now it’s generative AI… it’s probably time to stop collecting pictures of monkeys and kitties. Chatbots and generative AI are the next big thing. This time we’ve jumped on a trend that has real-world applicat...
13 AI Tools You Have to Try
LM
Luis Minvielle
What Are Large Language Models?
Developers and writers can finally agree on one thing: Large Language Models, the subset of AIs that drive ChatGPT and its competitors, are stunning tech creations. Developers enjoying the likes of GitHub Copilot know the feeling: this new kind of te...
What Are Large Language Models?
CH
Chris Heilmann
All the videos of Halfstack London 2024!
Last month was Halfstack London, a conference about the web, JavaScript and half a dozen other things. We were there to deliver a talk, but also to record all the sessions and we're happy to share them with you. It took a bit as we had to wait for th...
All the videos of Halfstack London 2024!

From learning to earning

Jobs that call for the skills explored in this talk.

AI Engineer, London

AI Engineer, London

Eloquent AI

52K
Intermediate
Node.js
GraphQL
TypeScript
Microservices