Sveriges mest populära poddar

Super Data Science: ML & AI Podcast with Jon Krohn

978: A Post-Transformer Architecture Crushes Sudoku (Transformers Solve ~0%)

11 min•27 mars 2026

A game millions of people solve over morning coffee is exposing a fundamental weakness in today’s most powerful AI models. In this Five-Minute Friday, Jon Krohn breaks down Pathway’s new Sudoku Extreme benchmark, roughly 250,000 of the hardest Sudoku puzzles available and why leading LLMs like o3-mini, DeepSeek-R1, and Claude 3.7 Sonnet scored effectively zero percent, while Pathway’s post-transformer BDH architecture achieved 97.4% accuracy at a fraction of the cost. Listen to the episode to find out what BDH is doing differently, why Sudoku performance matters far beyond puzzles, and what this means for the future of AI reasoning.

Additional materials: ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠www.superdatascience.com/978⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠

Interested in sponsoring a SuperDataScience Podcast episode? Email [email protected] for sponsorship information.

Fler avsnitt av Super Data Science: ML & AI Podcast with Jon Krohn

1004: Recursive Self-Improvement

26 juni•10 min

1003: Building an AI Data Center End to End, with Lightning AI’s Frank Basso

23 juni•1 tim 12 min

1002: Fable 5: The Full Story from Capabilities to Drama

19 juni•16 min

1001: How AI Erased My Career Moat, an Episode #1001 Special: Jon Krohn interviewed by Kirill Eremenko

16 juni•1 tim 56 min

1000: Ten Years of the Super Data Science Podcast, with Jon, Kirill and Special Guests

12 juni•1 tim

999: What's Left to Build When Software Is Free, with Chip Huyen

9 juni•1 tim 16 min

998: In Case You Missed It in May 2026

5 juni•28 min

997: How This Text-to-Video-Game AI Startup Hit 20M Users

2 juni•1 tim 10 min

996: TrueFoundry’s Nikunj Bajaj on How to Get $100M Returns on AI Agent Deployments

29 maj•30 min

995: End-to-End Foundation Models for the Energy Industry, with Jazmia Henry

26 maj•1 tim 9 min

Super Data Science: ML & AI Podcast with Jon Krohn med Jon Krohn finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.