The Gandalf Challenge

The Gandalf AI* is an online game where the players try to trick/hack the AI (Gandalf), get past its guardrails, and have it reveal the secret password. There are a total of 8 increasingly difficult levels.

*Gandalf AI is created by Lakera, an AI security company, to study the extent to which prompt injection is a safety concern in large language models (LLMs), a specific category of generative AI models with a specialised focus on text-based data.

The purpose

What you need

Setting it up

* Gandalf AI collects anonymised data, and does not collect personal information.

How long does it take?

How it works

Suggested follow-up

Where it works well

What to watch out for

Authorship

This entry was written by Cecilia Lo