Sveriges mest populära poddar

AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts

Can Grok and Claude run a business? We just did it

1 tim 29 min•29 december 2025

Andon Labs tests AI autonomy by letting agents run businesses in messy reality with real customers, consequences. In VendingBench, an agent starts with $500 and an empty vending machine, researches trends and suppliers, emails wholesalers, restocks, tracks sales, and iterates for profit. When deployed at Anthropic, humans red-teamed it with sob stories, discount demands, and bizarre requests like tungsten cubes, triggering “bank runs” of freebie seekers. Long histories caused drift and hallucinations, including dramatic escalations and invented security reports. Multi-agent supervisors often amplified each other into hype or doom. Better tools and memory compression help, but long-horizon planning stays fragile.

Fler avsnitt av AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts

Everyone just got HACKED, Elon's Big Bet and AI Agents

22 maj•1 tim

Google is about to TAKE OFF...

22 maj•1 tim 47 min

Google's INSANE new AI Agent

8 maj•1 tim 29 min

The Claude Code Nightmare, LLM Emotions, AI Neuroscience and the Death of Software | Wes & Dylan

7 apr.•1 tim 35 min

Sara Imari Walker "AI is Life" | Simulations, the Universe and the Origins of Life

24 mars•1 tim 45 min

this EX-OPENAI RESEARCHER just released it...

18 mars•1 tim 49 min

Joscha Bach "Bootstrapping a GODLIKE Mind"

17 mars•1 tim 37 min

GROK 4.20 and the "SOCIETY OF MINDS"

10 mars•1 tim 24 min

OpenClaw can't stop

9 mars•1 tim 19 min

SpaceX and xAI is the biggest deal in History | ClawBot / Open Claw starts a business | AI in space

7 mars•50 min

AI Pod by Wes Roth and Dylan Curious | Artificial Intelligence News and Interviews With Experts med Wes Roth and Dylan Curious finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.