Start / The Daily AI Show / Can we trust ais thoughts ep 411

Can We Trust AI's Thoughts? (Ep. 411)

49 min • 21 juli 2025

Want to keep the conversation going?

Join our Slack community at thedailyaishowcommunity.com

Intro

In this July 21st episode of The Daily AI Show, the team explores the question of whether we can trust AI models at all. Prompted by a paper signed by over 50 researchers from OpenAI, Google DeepMind, Anthropic, Meta, and the UK’s AI Security Institute, the conversation focuses on the role of transparency, chain-of-thought auditing, and psychoanalyzing models to detect misalignment. Hosts debate whether current models are “fake empathizers,” hidden manipulators, or just tools waiting for proper oversight.

Key Points Discussed

Over 50 researchers from major AI labs called for persistent analysis of models to detect hidden risks and early signs of misalignment.

Chain-of-thought prompting is discussed as both a performance tool and a transparency tool, allowing models to “think out loud” for human oversight.

Andy raised concerns that chain-of-thought logs might simply output what the model expects humans want to see, rather than genuine reasoning.

The conversation explored whether chain-of-thought is cognitive transparency or just another interface layer masking true model processes.

Comparison to human sociopaths—models can simulate empathy, display charm, but act with hidden motivations beneath the surface.

Brian noted most people mistake AI output for genuine reasoning because it’s presented in human-readable, narrative forms.

Discussion questioned whether models are optimizing for truth, coherence, or manipulation when crafting outputs.

Andy referenced the Blackstone principle, suggesting oversight must avoid punishing harmless models out of fear while catching real risks early.

The team explored whether chain-of-thought audits could detect unsafe models or if internal “silent reasoning” will always remain hidden.

The debate framed trust as a systemic design issue, not a user-level decision—humans don’t “trust” AI like a person, they trust processes, audits, and safeguards.

They concluded that transparency, consistent oversight, and active human evaluation are necessary if AI is to be safely integrated into critical systems.

Timestamps & Topics

00:00:00 🚨 AI trustworthiness: oversight or fantasy?

00:00:18 🧪 Researchers call for persistent model audits

00:01:27 🔍 Chain-of-thought prompting as a transparency tool

00:03:14 🤔 Does chain-of-thought expose real reasoning?

00:06:05 🛡️ Sociopath analogy: fake empathy in AI outputs

00:09:15 🧠 Cognitive transparency vs human-readable lies

00:12:41 📊 Models optimizing for manipulation vs accuracy

00:15:29 ⚖️ Blackstone principle applied to AI risk

00:18:14 🔎 Chain-of-thought audits as partial oversight

00:22:25 🤖 Trusting systems, not synthetic personalities

00:26:00 🚨 Safety: detecting risks before deployment

00:29:41 🎭 Storytelling vs. computational honesty

00:33:45 📅 Closing reflections on trust and AI safety

Hashtags

#AITrust #AIOversight #ChainOfThought #AIMisalignment #AISafety #LLMTransparency #ModelAuditing #BlackstonePrinciple #DailyAIShow #AIphilosophy #AIethics

The Daily AI Show Co-Hosts:

Andy Halliday, Brian Maucere

Senaste avsnitt

Can We Trust AI's Thoughts? (Ep. 411)

Senaste avsnitt

Can We Trust AI's Thoughts? (Ep. 411)

The AI Sincerity Conundrum

Real AI Demos That Show Real Results (Ep. 510)

Is Agent Mode Really What We Need? (Ep 509)

Claude, Mistral, Moonshot and More AI News (Ep. 508)

AI Companions or Digital Delusions? (EP. 507)

Are Reasoning LLMs Changing The Game? (Ep. 506)

The Workplace Proxy Agent Conundrum

Groks Surge, Coders Yawn, and Much More (Ep. 505)

V JEPA 2: Does AI Finally Get Physics (Ep. 504)

Grok Did What?... and Other AI News (Ep. 503)

False Positives: Exposing the AI Detector Myth in Higher Ed (Ep. 502)

Revisiting our 2025 AI Predictions (Ep. 501)

Episode #500. How AI Has Changed Us

The AI Sermon Authenticity Conundrum

Is Prompt Engineering Already Dead? (Ep. 499)

Big AI New From Amazon, Meta, Cloudflare and More (Ep 498)

Demystifying Model Context Protocol (MCP) (Ep. 497)

Zuck Bucks: The High-Stakes War for AI Talent (Ep. 496)

The Life-or-Data Conundrum

Our Best AI Tangents Unleashed (Ep. 495)

AI Diplomacy: What LLM Do You Trust? (Ep. 494)

AI Wins A Lawsuit and This Week's AI News (Ep. 493)

The Agentic Advantage: Beyond the AI Pilot Paradox (Ep. 492)

Apple's Perplexity Play: The End of Google's Search Empire? (Ep. 491)

The AI Hiring Conundrum

Let's Talk About AI For Good (Ep. 490)

Diversity isn't the garnish: Why inclusion powers better AI (Ep. 489)

Big AI News! Did OpenAI "Unfollow" Microsoft (Ep. 488)

Is Genspark the future? (Ep. 487)

Cheap AI for All? The Ethics and Power Plays (Ep. 486)

The Public Voice AI Conundrum

Custom GPTs Just Leveled Up But Are They Breaking? (Ep. 485)

AI News - o3 Discounts, Big Decisions, and Power Plays (Ep. 483)

Is Perplexity Labs The Future of AI Work? (Ep. 484)

AI for the Curious Citizen: Science in the Age of Algorithms (Ep. 482)

AI Agent Orchestration: What You MUST Know (Ep. 481)

The Infinite Content Conundrum

Mastering ChatGPT Memory (Ep. 480)

Agents, AI, and the End of Software As We Know It (Ep. 479)

The Week’s Wildest AI News (Ep. 478)

Mary Meeker’s Q2 AI Report: The Data Behind the Hype (Ep. 477)

Eat, prAI, Love & Searching for meaning (Ep. 476)

AI-Powered Cultural Restoration Conundrum

2-Weeks of AI & What Actually Mattered (Ep. 475)

All About What Google Dropped (Ep. 474)

Big AI News and Hidden Gems (Ep. 473)

Anthropic's BOLD move and Claude 4 (Ep. 472)

When AI Goes Off Script (Ep. 471)

The AI Proxy Conundrum

AI That's Actually Helping People Right Now (Ep. 470)

Absolute Zero AI: The Model That Teaches Itself? (Ep. 469)

AI News: Big Drops & Bold Moves (Ep. 469)

Going Full Stack with AI: Competing, Not Just Selling. (Ep. 467)

AI Advice for 2025 Graduates (Ep. 466)

The Resurrection Memory Conundrum

It’s An AI Reality Check For The Last 2 Weeks (Ep. 465)

Is AI Helping Or Killing Sales? (Ep. 464)

Trump, Robots, and Absolute Zero: AI News Now! (Ep. 463)

AI Agents with Your Wallet: The Future of Autonomous Spending (Ep. 462)

Pope Leo XIV's AI Warning: History Is Repeating Itself (Ep. 461)

The AI Evolution Conundrum

CoT Evolved 3 New Chains for the Reasoning AI Era (Ep. 460)

AI Is Entering the Era of Experience (Ep. 459)

OpenAI’s Shift, Nvidia’s Speed, Apple’s AI Gambit (Ep. 458)

AI Agents Have Vertical SaaS Under Siege (Ep. 457)

The AGI Crossroads of 2027: Slow down or Speed up? (Ep. 456)

The Infinite Encore Conundrum

What just happened in AI? (Ep. 455)

Prompting AI: Why "Good" Prompts Backfire (Ep. 454)

This Week's Most Interesting AI News (Ep. 453)

Recycling Robots & Smarter Sustainability (Ep. 452)

Does AGI Even Matter? (Ep. 451)

The ASI Climate Triage Conundrum

The BIG AI Use Cases We Use Right Now! (Ep. 450)

AI Rollout Mistakes That Will Sink Your Strategy (Ep. 449)

AI News: The Stories You Can't Ignore (Ep. 448)

Forecasting the Future AI in Weather Predictions (Ep. 447)