Sveriges mest populära poddar

Linear Digressions

How Do You Evaluate An AI Agent? (The Agents Season, Episode 7)

32 min•1 juni 2026

Knowing when an AI agent has failed sounds straightforward — until it isn't. Agents have a frustrating habit of finishing confidently while quietly doing the wrong thing, or looping endlessly without ever crashing in an obvious way. This episode tackles one of the thorniest problems in the agentic world: evaluation. If failure is hard to see, how do you measure it systematically? And how do you know when your agent is actually working?

Fler avsnitt av Linear Digressions

Agent Economics (The Agents Season, Episode 10)

22 juni•24 min

Agent Trust, Oversight and Control (The Agents Season, Episode 9)

15 juni•26 min

Many Agents, Many Problems (The Agents Season, Episode 8)

8 juni•28 min

AI Agent Failure Modes (The Agents Season, Episode 6)

25 maj•33 min

Agentic Planning (The Agents Season, Episode 5)

18 maj•24 min

Memory Management for AI Agents (The Agents Season, Episode 4)

10 maj•25 min

Lost in the Middle (The Agents Season, Episode 3)

ReAct and Tool Usage (The Agents Season, Episode 2)

27 apr.•24 min

What's an AI Agent? And Why's That Hard to Define? (The Agents Season, Episode 1)

20 apr.•19 min

Unfaithful Chain of Thought

13 apr.•25 min

Linear Digressions med Katie Malone finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.