Sveriges mest populära poddar

LessWrong (Curated & Popular)

Samhälle och kultur Teknologi

"ARC Evals new report: Evaluating Language-Model Agents on Realistic Autonomous Tasks" by Beth Barnes

8 min•4 augusti 2023

Blogpost version

Paper

We have just released our first public report. It introduces methodology for assessing the capacity of LLM agents to acquire resources, create copies of themselves, and adapt to novel challenges they encounter in the wild.

Background

ARC Evals develops methods for evaluating the safety of large language models (LLMs) in order to provide early warnings of models with dangerous capabilities. We have public partnerships with Anthropic and OpenAI to evaluate their AI systems, and are exploring other partnerships as well.

Source:
https://www.lesswrong.com/posts/EPLk8QxETC5FEhoxK/arc-evals-new-report-evaluating-language-model-agents-on

Narrated for LessWrong by TYPE III AUDIO.

Share feedback on this narration.

[125+ Karma Post] ✓

Fler avsnitt av LessWrong (Curated & Popular)

"What is up with e/acc?" by KatjaGrace

27 juni•4 min

"Existential AI safety needs an effective social movement. PauseAI is building it" by Maxime Fournes, Espedair Street

27 juni•1 tim 3 min

"Surprising facts about the slave trade" by Joseph Miller

26 juni•13 min

"AI catastrophe: more like a genocide than a thought experiment" by KatjaGrace

26 juni•2 min

"AI pause: the case for ASAP" by KatjaGrace

25 juni•2 min

"The Invisible Side of AI Governance" by Charbel-Raphaël

23 juni•28 min

"A Theory of Prompt Injection (and why you should study roles)" by Charles Ye, softboiledheart

23 juni•32 min

"Machinic Psychopharmacology: Do LLMs Self-Medicate?" by Sid Black, Joseph Bloom

22 juni•53 min

"Can activation verbalizers surface an internal chain of thought?" by oakhu, ryan_greenblatt

22 juni•1 tim 20 min

"The LLM shoggoth meme is weirder than you think" by HedonicEscalator

21 juni•14 min

LessWrong (Curated & Popular) med LessWrong finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.