I'm back from holiday, and this week Tania and I talk about a project she completed as part of the ARENA AI safety curriculum to replicate the findings of evaluations on frontier AI capabilities.
Link to reasoning paper: https://arxiv.org/abs/2502.09696
Link to the Inspect dashboard: https://inspect-evals-dashboard.streamlit.app/
ARENA AI Safety course: https://www.arena.education/
Fler avsnitt av The AI Security Podcast
Visa alla avsnitt av The AI Security PodcastThe AI Security Podcast med Harriet Farlow (HarrietHacks) finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.
