Practical AI

AI in the shadows: From hallucinations to blackmail

45 min • 7 juli 2025

In the first episode of an "AI in the shadows" theme, Chris and Daniel explore the increasing concerning world of agentic misalignment. Starting out with a reminder about hallucinations and reasoning models, they break down how today’s models only mimic reasoning, which can lead to serious ethical considerations. They unpack a fascinating (and slightly terrifying) new study from Anthropic, where agentic AI models were caught simulating blackmail, deception, and even sabotage — all in the name of goal completion and self-preservation. 

Featuring:

Links:

Register for upcoming webinars here!

Senaste avsnitt

Podcastbild

00:00 -00:00
00:00 -00:00