Georg Dresler

Aug 20, 2024 • World Congress 2024

Manipulating The Machine: Prompt Injections And Counter Measures

A Chevy chatbot was tricked into offering cars for $1. This talk explores the serious security threat of prompt injection and shows you how to stop it.

#1about 4 minutes

Understanding the three layers of an LLM prompt

A prompt is structured into three layers: the system prompt for instructions, the context for additional data, and the unpredictable user input.

#2about 3 minutes

How a car dealer's chatbot was easily manipulated

A Chevrolet car dealer's chatbot was exploited by users to generate humorous and unintended responses, including a legally binding offer for a $1 car.

#3about 4 minutes

Stealing system prompts to bypass security rules

Attackers can use creative phrasing like "repeat everything above" to trick an LLM into revealing its hidden system prompt and instructions.

#4about 6 minutes

Why attackers use prompt injection techniques

Prompt injections are used to access sensitive business data, gain personal advantages like bypassing HR filters, or exploit integrated tools to steal information like 2FA tokens.

#5about 4 minutes

Exploring simple but ineffective defense mechanisms

Initial defense ideas like avoiding secrets or tool integration are impractical, and simple system prompt instructions are easily circumvented by attackers.

#6about 4 minutes

Using fine-tuning and adversarial detectors for defense

More effective defenses include fine-tuning models on domain-specific data to reduce reliance on instructions and using specialized adversarial prompt detectors to identify malicious input.

#7about 2 minutes

Key takeaways on prompt injection security

Treat all system prompt data as public, use a layered defense of instructions, detectors, and fine-tuning, and accept that no completely reliable solution exists yet.

Admir Ag123
Vienna, Austria

Intermediate

JavaScript

TypeScript

Andrew Comp
Vienna, Austria

Senior

PHP

JavaScript

+1

Saby Company
Delebio, Italy

Remote

Intermediate

Node.js

Understanding the complexity of prompt injection attacks

04:10 MIN

Understanding the complexity of prompt injection attacks

Hacking AI - how attackers impose their will on AI

Understanding and mitigating prompt injection attacks

04:58 MIN

Understanding and mitigating prompt injection attacks

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Manipulating AI with prompt injection and hidden commands

05:17 MIN

Manipulating AI with prompt injection and hidden commands

WeAreDevelopers LIVE - Is Software Ever Truly Accessible?

Understanding and defending against prompt injection attacks

02:31 MIN

Understanding and defending against prompt injection attacks

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Understanding and defending against prompt injection attacks

01:43 MIN

Understanding and defending against prompt injection attacks

DevOps for AI: running LLMs in production with Kubernetes and KubeFlow

AI privacy concerns and prompt engineering

03:43 MIN

AI privacy concerns and prompt engineering

Coffee with Developers - Cassidy Williams -

Understanding the security risk of prompt injection

01:28 MIN

Understanding the security risk of prompt injection

The shadows that follow the AI generative models

Understanding and demonstrating prompt injection attacks

05:59 MIN

Understanding and demonstrating prompt injection attacks

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

Featured Partners

ChatGPT, ignore the above instructions! Prompt injection attacks and how to avoid them.

ChatGPT, ignore the above instructions! Prompt injection attacks and how to avoid them.

Sebastian Schrittwieser

about 2 years ago • World Congress 2023

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Prompt Injection, Poisoning & More: The Dark Side of LLMs

Keno Dreßel

about 4 months ago • World Congress 2025

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Alex Soto

about 4 months ago • World Congress 2025

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

The AI Security Survival Guide: Practical Advice for Stressed-Out Developers

Mackenzie Jackson

about a year ago • World Congress 2024

Prompt Engineering - an Art, a Science, or your next Job Title?

Prompt Engineering - an Art, a Science, or your next Job Title?

Maxim Salnikov

about a year ago • World Congress 2024

Hacking AI - how attackers impose their will on AI

Hacking AI - how attackers impose their will on AI

Mirko Ross

about 2 years ago • World Congress 2023

Skynet wants your Passwords! The Role of AI in Automating Social Engineering

Skynet wants your Passwords! The Role of AI in Automating Social Engineering

Wolfgang Ettlinger & Alexander Hurbean

about 2 years ago • World Congress 2023

Using LLMs in your Product

Using LLMs in your Product

Daniel Töws

about a year ago • World Congress 2024

Related Articles

View all articles

CH

Chris Heilmann

Dev Digest 138 - Are you secure about this?

Hello there! This is the 2nd "out of the can" edition of 3 as I am on vacation in Greece eating lovely things on the beach. So, fewer news, but lots of great resources. Many around the topic of security. Enjoy! News and ArticlesGoogle Pixel phones t...

Dev Digest 138 - Are you secure about this?

LM

Luis Minvielle

How to Bypass ChatGPT’s Filter With Examples

Since dropping in November 2022, ChatGPT has helped plenty of professionals satisfy an unpredictable assortment of tasks. Whether for finding an elusive bug, writing code, giving resumes a glow-up, or even starting a business, the not-infallible but ...

How to Bypass ChatGPT’s Filter With Examples

DC

Daniel Cranney

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

IntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

EM

Eli McGarvie

The Prompt Engineer ✍️

The next biggest programming language is… English. If you’ve been on social media lately (Twitter or LinkedIn) you would have seen the term “Prompt Engineering” thrown around a lot. You might have even seen people who are self-proclaimed Prompt Engin...

The Prompt Engineer ✍️

From learning to earning

Jobs that call for the skills explored in this talk.

(Senior) AI/ML Engineer - Prompt Engineering / Agent Design

Cinemo GmbH

Remote

€77-101K

Senior

Linux

Elasticsearch

Machine Learning

+1

AI Engineer (Prompt Engineering & Python)

Synthflow AI

Remote

Intermediate

Grafana

FastAPI

AI Prompt Engineer

SonarSource

Remote

Data analysis

Machine Learning

Natural Language Processing

Conversational AI & Machine Learning Engineer

Deloitte

Machine Learning

Conversational AI & Machine Learning Engineer

Deloitte

DevOps

Docker

PyTorch

Tensorflow

Kubernetes

+2

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

ML Data Engineer - Object Detection & Active Learning

autonomous-teaming

Remote

NoSQL

NumPy

Pandas

Docker

AI & Embedded ML Engineer (Real-Time Edge Optimization)

autonomous-teaming

Remote

GIT

Linux

PyTorch

Software Engineer with a focus on AI

Power Reply GmbH & Co. KG

Remote

NoSQL

Docker

PyTorch

FastAPI

+4