Sveriges mest populära poddar
LessWrong (Curated & Popular)

“o3” by Zach Stein-Perlman

1 min21 december 2024
I'm editing this post.

OpenAI announced (but hasn't released) o3 (skipping o2 for trademark reasons).

It gets 25% on FrontierMath, smashing the previous SoTA of 2%. (These are really hard math problems.) Wow.

72% on SWE-bench Verified, beating o1's 49%.

Also 88% on ARC-AGI.

---

First published:
December 20th, 2024

Source:
https://www.lesswrong.com/posts/Ao4enANjWNsYiSFqc/o3

---

Narrated by TYPE III AUDIO.

Fler avsnitt av LessWrong (Curated & Popular)

Visa alla avsnitt av LessWrong (Curated & Popular)

LessWrong (Curated & Popular) med LessWrong finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.