Jodie Burchell

Oct 30, 2024 • WeAreDevelopers LIVE

Lies, Damned Lies and Large Language Models

What if 40% of your LLM's answers are just plain wrong? Learn how to measure factuality and build more reliable AI applications.

#1about 2 minutes

Understanding the dual nature of large language models

LLMs can generate both creative, coherent text and factually incorrect "hallucinations," posing a significant challenge for real-world applications.

#2about 4 minutes

The architecture and evolution of LLMs

The combination of the scalable Transformer architecture and massive text datasets enables models like GPT to develop "parametric knowledge" as they grow in size.

#3about 3 minutes

How training data quality influences model behavior

The quality of web-scraped datasets like Common Crawl, even after filtering, directly contributes to model hallucinations by embedding misinformation.

#4about 2 minutes

Differentiating between faithfulness and factuality hallucinations

Hallucinations are categorized as either faithfulness errors, which contradict a given source text, or factuality errors, which stem from incorrect learned knowledge.

#5about 3 minutes

Using the TruthfulQA dataset to measure misinformation

The TruthfulQA dataset provides a benchmark for measuring an LLM's tendency to repeat common misconceptions and conspiracy theories across various categories.

#6about 6 minutes

A practical guide to benchmarking LLM hallucinations

A step-by-step demonstration shows how to use Python, LangChain, and Hugging Face Datasets to run the TruthfulQA benchmark on a model like GPT-3.5 Turbo.

#7about 4 minutes

Exploring strategies to reduce LLM hallucinations

Key techniques to mitigate hallucinations include careful prompt crafting, domain-specific fine-tuning, output evaluation, and retrieval-augmented generation (RAG).

#8about 4 minutes

A deep dive into retrieval-augmented generation

RAG reduces hallucinations by augmenting prompts with relevant, up-to-date information retrieved from a vector database of document embeddings.

#9about 2 minutes

Overcoming challenges with advanced RAG techniques

Naive RAG can fail due to poor retrieval or generation, but advanced methods like Rowan selectively apply retrieval to significantly improve factuality.

Admir Ag123
Vienna, Austria

Intermediate

JavaScript

TypeScript

Milly
Vienna, Austria

Intermediate

.NET

TypeScript

+1

Milly
Vienna, Austria

Intermediate

.NET

TypeScript

+1

Understanding the problem of LLM hallucinations

02:29 MIN

Understanding the problem of LLM hallucinations

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Addressing the key challenges of large language models

02:55 MIN

Addressing the key challenges of large language models

Large Language Models ❤️ Knowledge Graphs

Why web data is essential for training large language models

01:27 MIN

Why web data is essential for training large language models

How to scrape modern websites to feed AI agents

Explaining how large language models work and why they hallucinate

05:49 MIN

Explaining how large language models work and why they hallucinate

Innovating Developer Tools with AI: Insights from GitHub Next

Addressing the core challenges of large language models

05:18 MIN

Addressing the core challenges of large language models

Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps

Understanding the risks of large language models

06:47 MIN

Understanding the risks of large language models

Inside the Mind of an LLM

Understanding the limitations of large language models

02:20 MIN

Understanding the limitations of large language models

Knowledge graph based chatbot

Demonstrating LLM hallucinations with a basic chatbot

03:29 MIN

Demonstrating LLM hallucinations with a basic chatbot

Make it simple, using generative AI to accelerate learning

Featured Partners

Creating Industry ready solutions with LLM Models

Creating Industry ready solutions with LLM Models

Vijay Krishan Gupta & Gauravdeep Singh Lotey

about 2 years ago • WeAreDevelopers LIVE

Inside the Mind of an LLM

Inside the Mind of an LLM

Emanuele Fabbiani

about 4 months ago • World Congress 2025

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about a year ago • World Congress 2024

Give Your LLMs a Left Brain

Give Your LLMs a Left Brain

Stephen Chin

about a year ago • World Congress 2024

What do language models really learn

What do language models really learn

Tanmay Bakshi

about 5 years ago • WeAreDevelopers LIVE

Three years of putting LLMs into Software - Lessons learned

Three years of putting LLMs into Software - Lessons learned

Simon A.T. Jiménez

about 4 months ago • World Congress 2025

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon - Make LLMs make sense with GraphRAG

Martin O'Hanlon

about 9 months ago • WeAreDevelopers LIVE

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Alex Soto

about 4 months ago • World Congress 2025

Related Articles

View all articles

LM

Luis Minvielle

What Are Large Language Models?

Developers and writers can finally agree on one thing: Large Language Models, the subset of AIs that drive ChatGPT and its competitors, are stunning tech creations. Developers enjoying the likes of GitHub Copilot know the feeling: this new kind of te...

What Are Large Language Models?

DC

Daniel Cranney

How machine learning can help us tell fact from fiction

A decade ago, machine learning was everywhere. While the rise of generative AI has meant artificial intelligence has stolen the spotlight to some degree, it’s machine learning (ML) that silently powers its most impressive achievements.From chatbots t...

How machine learning can help us tell fact from fiction

CH

Chris Heilmann

Dev Digest 137 - AI'm not sure about this

Hello fellow developer, this is the 1st "out of the can" edition of 3 as I am on vacation in Greece going "whee are you cute" at donkeys. So, fewer news, but lots of great resources. Enjoy! News and ArticlesOpenAI has been the big topic winning in th...

Dev Digest 137 - AI'm not sure about this

KD

Krissy Davis

The Best Large Language Models on The Market

Large language models are sophisticated programs that enable machines to comprehend and generate human-like text. They have been the foundation of natural language processing for almost a decade. Although generative AI has only recently gained popula...

The Best Large Language Models on The Market

From learning to earning

Jobs that call for the skills explored in this talk.

Machine Learning & Data Engineer

vengine GmbH
Hamburg, Germany

Junior

Intermediate

Python

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

AI/ML Engineer - LLM Systems

Anexia Internetdienstleistungs Gmbh

€54K

DevOps

Docker

Ansible

PyTorch

+2

ML/DevOps Engineer at dynamic AI/ Computer Vision company

Nomitri
Berlin, Germany

DevOps

Gitlab

Docker

Ansible

Grafana

+6

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

Backend Engineer / Verification Platform

Authentic Memory

Remote

Linux

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Data Scientist (m/w/d) Large Language Models

Eucon GmbH

Remote

Docker

Kubernetes

Machine Learning

Natural Language Processing