Start / The Daily AI Show / Are reasoning llms changing the game ep 506

Are Reasoning LLMs Changing The Game? (Ep. 506)

53 min • 14 juli 2025

Want to keep the conversation going?

Join our Slack community at thedailyaishowcommunity.com

the team explores whether today’s AI models are just simulating thought or actually beginning to “think.” They break down advances in reasoning models, reinforcement learning, and world modeling, debating if AI’s step-by-step problem-solving can fairly be called thinking. The discussion dives into philosophy, practical use cases, and why the definition of “thinking” itself might need rethinking.

Key Points Discussed

Early chain-of-thought prompting looked like reasoning but was just simulated checklists, exposing AI’s explainability problem.

Modern LLMs now demonstrate intrinsic deliberation, spending compute to weigh alternatives before responding.

Reinforcement learning trains models to value structured thinking, not just the right answer, helping them plan steps and self-correct.

Deduction, induction, abduction, and analogical reasoning methods are now modeled explicitly in advanced systems.

The group debates whether this step-by-step reasoning counts as “thinking” or is merely sophisticated processing.

Beth notes that models lack personal perspective or sensory grounding, limiting comparisons to human thought.

Karl stresses client perception—many non-technical users interpret these models’ behavior as thinking.

Brian draws a line at novel output—until models produce ideas outside their training data, it remains prediction.

Andy argues that if we call human reasoning “thinking,” then machine reasoning using similar steps deserves the label too.

Symbolic reasoning, code execution, and causality representation are key to closing the reasoning gap.

Memory, world models, and external tool access push models toward human-like problem solving.

Yann LeCun’s view that embodied AI will be required for human-level reasoning features heavily in the discussion.

The debate surfaces differing views: practical usefulness vs. philosophical accuracy in labeling AI behavior.

Conclusion: AI as a “process engine” may satisfy both camps, but the line between reasoning and thinking is getting blurry.

Timestamps & Topics

00:00:00 🧠 Reasoning models vs. chain-of-thought prompts

00:02:05 💡 Native deliberation as a breakthrough

00:03:15 🏛️ Thinking Fast and Slow analogy

00:05:14 🔍 Deduction, induction, abduction, analogy

00:07:03 🤔 Does problem-solving = thinking?

00:09:00 📜 Legal hallucination as reasoning failure

00:12:41 ⚙️ Symbolic logic and code interpreter role

00:16:36 🛠️ Deterministic vs. generative outcomes

00:20:05 📊 Real-world use case: invoice validation

00:23:06 💬 Why non-experts believe AI “thinks”

00:26:08 🛤️ Reasoning as multi-step prediction

00:29:47 🎲 AlphaGo’s strange but optimal moves

00:32:14 🧮 Longer processing vs. actual thought

00:35:10 🌐 World models and sensory grounding gap

00:38:57 🎨 Human taste and preference vs. AI outputs

00:41:47 🧬 Creativity as human advantage—for now

00:44:30 📈 Karl’s business growth powered by O3 reasoning

00:47:01 ⚡ Future: lightning-speed multi-agent parallelism

00:51:15 🧠 Memory + prediction defines thinking engines

00:53:16 📅 Upcoming shows preview and community CTA

#ThinkingMachines #LLMReasoning #ChainOfThought #ReinforcementLearning #WorldModeling #SymbolicAI #AIphilosophy #AIDebate #AgenticAI #DailyAIShow

The Daily AI Show Co-Hosts:

Andy Halliday, Beth Lyons, Brian Maucere, Jyunmi Hatcher, and Karl Yeh

Senaste avsnitt

Are Reasoning LLMs Changing The Game? (Ep. 506)

Senaste avsnitt

Are Reasoning LLMs Changing The Game? (Ep. 506)

The Workplace Proxy Agent Conundrum

Groks Surge, Coders Yawn, and Much More (Ep. 505)

V JEPA 2: Does AI Finally Get Physics (Ep. 504)

Grok Did What?... and Other AI News (Ep. 503)

False Positives: Exposing the AI Detector Myth in Higher Ed (Ep. 502)

Revisiting our 2025 AI Predictions (Ep. 501)

Episode #500. How AI Has Changed Us

The AI Sermon Authenticity Conundrum

Is Prompt Engineering Already Dead? (Ep. 499)

Big AI New From Amazon, Meta, Cloudflare and More (Ep 498)

Demystifying Model Context Protocol (MCP) (Ep. 497)

Zuck Bucks: The High-Stakes War for AI Talent (Ep. 496)

The Life-or-Data Conundrum

Our Best AI Tangents Unleashed (Ep. 495)

AI Diplomacy: What LLM Do You Trust? (Ep. 494)

AI Wins A Lawsuit and This Week's AI News (Ep. 493)

The Agentic Advantage: Beyond the AI Pilot Paradox (Ep. 492)

Apple's Perplexity Play: The End of Google's Search Empire? (Ep. 491)

The AI Hiring Conundrum

Let's Talk About AI For Good (Ep. 490)

Diversity isn't the garnish: Why inclusion powers better AI (Ep. 489)

Big AI News! Did OpenAI "Unfollow" Microsoft (Ep. 488)

Is Genspark the future? (Ep. 487)

Cheap AI for All? The Ethics and Power Plays (Ep. 486)

The Public Voice AI Conundrum

Custom GPTs Just Leveled Up But Are They Breaking? (Ep. 485)

AI News - o3 Discounts, Big Decisions, and Power Plays (Ep. 483)

Is Perplexity Labs The Future of AI Work? (Ep. 484)

AI for the Curious Citizen: Science in the Age of Algorithms (Ep. 482)

AI Agent Orchestration: What You MUST Know (Ep. 481)

The Infinite Content Conundrum

Mastering ChatGPT Memory (Ep. 480)

Agents, AI, and the End of Software As We Know It (Ep. 479)

The Week’s Wildest AI News (Ep. 478)

Mary Meeker’s Q2 AI Report: The Data Behind the Hype (Ep. 477)

Eat, prAI, Love & Searching for meaning (Ep. 476)

AI-Powered Cultural Restoration Conundrum

2-Weeks of AI & What Actually Mattered (Ep. 475)

All About What Google Dropped (Ep. 474)

Big AI News and Hidden Gems (Ep. 473)

Anthropic's BOLD move and Claude 4 (Ep. 472)

When AI Goes Off Script (Ep. 471)

The AI Proxy Conundrum

AI That's Actually Helping People Right Now (Ep. 470)

Absolute Zero AI: The Model That Teaches Itself? (Ep. 469)

AI News: Big Drops & Bold Moves (Ep. 469)

Going Full Stack with AI: Competing, Not Just Selling. (Ep. 467)

AI Advice for 2025 Graduates (Ep. 466)

The Resurrection Memory Conundrum

It’s An AI Reality Check For The Last 2 Weeks (Ep. 465)

Is AI Helping Or Killing Sales? (Ep. 464)

Trump, Robots, and Absolute Zero: AI News Now! (Ep. 463)

AI Agents with Your Wallet: The Future of Autonomous Spending (Ep. 462)

Pope Leo XIV's AI Warning: History Is Repeating Itself (Ep. 461)

The AI Evolution Conundrum

CoT Evolved 3 New Chains for the Reasoning AI Era (Ep. 460)

AI Is Entering the Era of Experience (Ep. 459)

OpenAI’s Shift, Nvidia’s Speed, Apple’s AI Gambit (Ep. 458)

AI Agents Have Vertical SaaS Under Siege (Ep. 457)

The AGI Crossroads of 2027: Slow down or Speed up? (Ep. 456)

The Infinite Encore Conundrum

What just happened in AI? (Ep. 455)

Prompting AI: Why "Good" Prompts Backfire (Ep. 454)

This Week's Most Interesting AI News (Ep. 453)

Recycling Robots & Smarter Sustainability (Ep. 452)

Does AGI Even Matter? (Ep. 451)

The ASI Climate Triage Conundrum

The BIG AI Use Cases We Use Right Now! (Ep. 450)

AI Rollout Mistakes That Will Sink Your Strategy (Ep. 449)

AI News: The Stories You Can't Ignore (Ep. 448)

Forecasting the Future AI in Weather Predictions (Ep. 447)

Building Your AI First Business: Who's the ONE Additional Human You Need? (Ep. 446)

The Real World Filter Conundrum

Did that just happen in AI? (Ep. 445)

When to use OpenAI's latest models: 4.1, o3, and o4-mini (Ep. 444)

Big AI News Drops! (Ep. 443)

H&M Is Using AI Models Who’s Next? (Ep. 442)