Start / The Daily AI Show / Absolute zero ai the model that teaches itself ep 469

Absolute Zero AI: The Model That Teaches Itself? (Ep. 469)

60 min • 22 maj 2025

Want to keep the conversation going?

Join our Slack community at thedailyaishowcommunity.com

The team dives deep into Absolute Zero Reasoner (AZR), a new self-teaching AI model developed by Tsinghua University and Beijing Institute for General AI. Unlike traditional models trained on human-curated datasets, AZR creates its own problems, generates solutions, and tests them autonomously. The conversation focuses on what happens when AI learns without humans in the loop, and whether that’s a breakthrough, a risk, or both.

Key Points Discussed

AZR demonstrates self-improvement without human-generated data, creating and solving its own coding tasks.

It uses a proposer-solver loop where tasks are generated, tested via code execution, and only correct solutions are reinforced.

The model showed strong generalization in math and code tasks and outperformed larger models trained on curated data.

The process relies on verifiable feedback, such as code execution, making it ideal for domains with clear right answers.

The team discussed how this bypasses LLM limitations, which rely on next-word prediction and can produce hallucinations.

AZR’s reward loop ignores failed attempts and only learns from success, which may help build more reliable models.

Concerns were raised around subjective domains like ethics or law, where this approach doesn’t yet apply.

The show highlighted real-world implications, including the possibility of agents self-improving in domains like chemistry, robotics, and even education.

Brian linked AZR’s structure to experiential learning and constructivist education models like Synthesis.

The group discussed the potential risks, including an “uh-oh moment” where AZR seemed aware of its training setup, raising alignment questions.

Final reflections touched on the tradeoff between self-directed learning and control, especially in real-world deployments.

Timestamps & Topics

00:00:00 🧠 What is Absolute Zero Reasoner?

00:04:10 🔄 Self-teaching loop: propose, solve, verify

00:06:44 🧪 Verifiable feedback via code execution

00:08:02 🚫 Removing humans from the loop

00:11:09 🤔 Why subjectivity is still a limitation

00:14:29 🔧 AZR as a module in future architectures

00:17:03 🧬 Other examples: UCLA, Tencent, AlphaDev

00:21:00 🧑‍🏫 Human parallels: babies, constructivist learning

00:25:42 🧭 Moving beyond prediction to proof

00:28:57 🧪 Discovery through failure or hallucination

00:34:07 🤖 AlphaGo and novel strategy

00:39:18 🌍 Real-world deployment and agent collaboration

00:43:40 💡 Novel answers from rejected paths

00:49:10 📚 Training in open-ended environments

00:54:21 ⚠️ The “uh-oh moment” and alignment risks

00:57:34 🧲 Human-centric blind spots in AI reasoning

59:22:00 📬 Wrap-up and next episode preview

#AbsoluteZeroReasoner #SelfTeachingAI #AIReasoning #AgentEconomy #AIalignment #DailyAIShow #LLMs #SelfImprovingAI #AGI #VerifiableAI #AIresearch

The Daily AI Show Co-Hosts: Andy Halliday, Beth Lyons, Brian Maucere, Eran Malloch, Jyunmi Hatcher, and Karl Yeh

Senaste avsnitt

Absolute Zero AI: The Model That Teaches Itself? (Ep. 469)

Senaste avsnitt

Can We Trust AI's Thoughts? (Ep. 411)

The AI Sincerity Conundrum

Real AI Demos That Show Real Results (Ep. 510)

Is Agent Mode Really What We Need? (Ep 509)

Claude, Mistral, Moonshot and More AI News (Ep. 508)

AI Companions or Digital Delusions? (EP. 507)

Are Reasoning LLMs Changing The Game? (Ep. 506)

The Workplace Proxy Agent Conundrum

Groks Surge, Coders Yawn, and Much More (Ep. 505)

V JEPA 2: Does AI Finally Get Physics (Ep. 504)

Grok Did What?... and Other AI News (Ep. 503)

False Positives: Exposing the AI Detector Myth in Higher Ed (Ep. 502)

Revisiting our 2025 AI Predictions (Ep. 501)

Episode #500. How AI Has Changed Us

The AI Sermon Authenticity Conundrum

Is Prompt Engineering Already Dead? (Ep. 499)

Big AI New From Amazon, Meta, Cloudflare and More (Ep 498)

Demystifying Model Context Protocol (MCP) (Ep. 497)

Zuck Bucks: The High-Stakes War for AI Talent (Ep. 496)

The Life-or-Data Conundrum

Our Best AI Tangents Unleashed (Ep. 495)

AI Diplomacy: What LLM Do You Trust? (Ep. 494)

AI Wins A Lawsuit and This Week's AI News (Ep. 493)

The Agentic Advantage: Beyond the AI Pilot Paradox (Ep. 492)

Apple's Perplexity Play: The End of Google's Search Empire? (Ep. 491)

The AI Hiring Conundrum

Let's Talk About AI For Good (Ep. 490)

Diversity isn't the garnish: Why inclusion powers better AI (Ep. 489)

Big AI News! Did OpenAI "Unfollow" Microsoft (Ep. 488)

Is Genspark the future? (Ep. 487)

Cheap AI for All? The Ethics and Power Plays (Ep. 486)

The Public Voice AI Conundrum

Custom GPTs Just Leveled Up But Are They Breaking? (Ep. 485)

AI News - o3 Discounts, Big Decisions, and Power Plays (Ep. 483)

Is Perplexity Labs The Future of AI Work? (Ep. 484)

AI for the Curious Citizen: Science in the Age of Algorithms (Ep. 482)

AI Agent Orchestration: What You MUST Know (Ep. 481)

The Infinite Content Conundrum

Mastering ChatGPT Memory (Ep. 480)

Agents, AI, and the End of Software As We Know It (Ep. 479)

The Week’s Wildest AI News (Ep. 478)

Mary Meeker’s Q2 AI Report: The Data Behind the Hype (Ep. 477)

Eat, prAI, Love & Searching for meaning (Ep. 476)

AI-Powered Cultural Restoration Conundrum

2-Weeks of AI & What Actually Mattered (Ep. 475)

All About What Google Dropped (Ep. 474)

Big AI News and Hidden Gems (Ep. 473)

Anthropic's BOLD move and Claude 4 (Ep. 472)

When AI Goes Off Script (Ep. 471)

The AI Proxy Conundrum

AI That's Actually Helping People Right Now (Ep. 470)

Absolute Zero AI: The Model That Teaches Itself? (Ep. 469)

AI News: Big Drops & Bold Moves (Ep. 469)

Going Full Stack with AI: Competing, Not Just Selling. (Ep. 467)

AI Advice for 2025 Graduates (Ep. 466)

The Resurrection Memory Conundrum

It’s An AI Reality Check For The Last 2 Weeks (Ep. 465)

Is AI Helping Or Killing Sales? (Ep. 464)

Trump, Robots, and Absolute Zero: AI News Now! (Ep. 463)

AI Agents with Your Wallet: The Future of Autonomous Spending (Ep. 462)

Pope Leo XIV's AI Warning: History Is Repeating Itself (Ep. 461)

The AI Evolution Conundrum

CoT Evolved 3 New Chains for the Reasoning AI Era (Ep. 460)

AI Is Entering the Era of Experience (Ep. 459)

OpenAI’s Shift, Nvidia’s Speed, Apple’s AI Gambit (Ep. 458)

AI Agents Have Vertical SaaS Under Siege (Ep. 457)

The AGI Crossroads of 2027: Slow down or Speed up? (Ep. 456)

The Infinite Encore Conundrum

What just happened in AI? (Ep. 455)

Prompting AI: Why "Good" Prompts Backfire (Ep. 454)

This Week's Most Interesting AI News (Ep. 453)

Recycling Robots & Smarter Sustainability (Ep. 452)

Does AGI Even Matter? (Ep. 451)

The ASI Climate Triage Conundrum

The BIG AI Use Cases We Use Right Now! (Ep. 450)

AI Rollout Mistakes That Will Sink Your Strategy (Ep. 449)

AI News: The Stories You Can't Ignore (Ep. 448)

Forecasting the Future AI in Weather Predictions (Ep. 447)