Sveriges mest populära poddar

LessWrong (Curated & Popular)

Samhälle och kultur Teknologi

“o3” by Zach Stein-Perlman

1 min•21 december 2024

I'm editing this post.

OpenAI announced (but hasn't released) o3 (skipping o2 for trademark reasons).

It gets 25% on FrontierMath, smashing the previous SoTA of 2%. (These are really hard math problems.) Wow.

72% on SWE-bench Verified, beating o1's 49%.

Also 88% on ARC-AGI.

---

First published:
December 20th, 2024

Source:
https://www.lesswrong.com/posts/Ao4enANjWNsYiSFqc/o3

---

Narrated by TYPE III AUDIO.

Fler avsnitt av LessWrong (Curated & Popular)

"Who Got Breasts First and How We Got Them" by rba

30 juni•21 min

"The worthlessness of vitamin D is mildly exaggerated" by dynomight

30 juni•36 min

"What is up with e/acc?" by KatjaGrace

27 juni•4 min

"Existential AI safety needs an effective social movement. PauseAI is building it" by Maxime Fournes, Espedair Street

27 juni•1 tim 3 min

"Surprising facts about the slave trade" by Joseph Miller

26 juni•13 min

"AI catastrophe: more like a genocide than a thought experiment" by KatjaGrace

26 juni•2 min

"AI pause: the case for ASAP" by KatjaGrace

25 juni•2 min

"The Invisible Side of AI Governance" by Charbel-Raphaël

23 juni•28 min

"A Theory of Prompt Injection (and why you should study roles)" by Charles Ye, softboiledheart

23 juni•32 min

"Machinic Psychopharmacology: Do LLMs Self-Medicate?" by Sid Black, Joseph Bloom

22 juni•53 min

LessWrong (Curated & Popular) med LessWrong finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.