Grok 4, which has excellent benchmarks and which xAI claims is ‘the world's smartest artificial intelligence,’ is the big news.
If you set aside the constant need to say ‘No, Grok, No,’ is it a good model, sir?
My take in terms of its capabilities, which I will expand upon at great length later this week: It is a good model. Not a great model. Not the best model. Not ‘the world's smartest artificial intelligence.’ There do not seem to be any great use cases to choose it over alternatives, unless you are searching Twitter. But it is a good model.
There is a catch. There are many reasons one might not want to trust it, on a different level than the reasons not to trust models from other labs. There has been a series of epic failures and poor choices, which will be difficult to [...]
---
Outline:
(01:33) The System Prompt
(08:07) MechaHitler
(10:17) The Official Explanation of MechaHitler
(17:53) Worse Than MechaHitler
(22:22) Unintended Behavior
(26:22) Off Based
(28:37) Your Face Will Be Stuck That Way
(30:24) I Couldn't Do Solve Problem In Several Hours So It Must Be Very Hard
(38:57) Safety Third
(44:25) How Bad Are Things?
---
First published:
July 14th, 2025
Source:
https://www.lesswrong.com/posts/YmdCN5GBwkud5ZzYx/worse-than-mechahitler
---
Narrated by TYPE III AUDIO.
---
Images from the article:
Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.
En liten tjänst av I'm With Friends. Finns även på engelska.