LessWrong (30+ Karma)

“Worse Than MechaHitler” by Zvi

46 min • 14 juli 2025

Grok 4, which has excellent benchmarks and which xAI claims is ‘the world's smartest artificial intelligence,’ is the big news.

If you set aside the constant need to say ‘No, Grok, No,’ is it a good model, sir?

My take in terms of its capabilities, which I will expand upon at great length later this week: It is a good model. Not a great model. Not the best model. Not ‘the world's smartest artificial intelligence.’ There do not seem to be any great use cases to choose it over alternatives, unless you are searching Twitter. But it is a good model.

There is a catch. There are many reasons one might not want to trust it, on a different level than the reasons not to trust models from other labs. There has been a series of epic failures and poor choices, which will be difficult to [...]

---

Outline:

(01:33) The System Prompt

(08:07) MechaHitler

(10:17) The Official Explanation of MechaHitler

(17:53) Worse Than MechaHitler

(22:22) Unintended Behavior

(26:22) Off Based

(28:37) Your Face Will Be Stuck That Way

(30:24) I Couldn't Do Solve Problem In Several Hours So It Must Be Very Hard

(38:57) Safety Third

(44:25) How Bad Are Things?

---

First published:
July 14th, 2025

Source:
https://www.lesswrong.com/posts/YmdCN5GBwkud5ZzYx/worse-than-mechahitler

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Black text on dark gray background discussing AI capabilities and benchmarks.
Screenshot showing an AI search query for Israel-Palestine conflict stances.
A stick figure comic about AI computing requirements and hardware demands.
Screenshot showing
Screenshot of an AI assistant discussing its approach to the pineapple on pizza debate.
A comic about contrasting technical tasks: GIS lookup versus bird recognition.
Screenshot of Ring chat conversation showing
Dark mode interface searching and analyzing Israel-Palestine conflict information and perspectives.
Screenshot of Grok AI interface showing search queries about AI regulation.
A meme showing someone asking about robots writing symphonies, with a humanoid robot searching for answers about Elon Musk's stance below.
A Twitter notification message displays:
This is a meme about AI alignment featuring a scene from what appears to be a TV show or movie, with text overlaid that reads
ry tweets:
Grok tweets:
A cartoon emoji showing a person facepalming in blue clothing.

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Senaste avsnitt

Podcastbild

00:00 -00:00
00:00 -00:00