Sveriges mest populära poddar

Module 3: Reinforcement Learning from Human Feedback

10 min•20 februari 2026

This episode addresses how Reinforcement Learning from Human Feedback (RLHF) adds the final layer of alignment after supervised fine-tuning, shifting the training signal from “right vs wrong” to “better vs worse.” We explore how preference rankings create a reward signal (reward models plus PPO) and the newer shortcut (DPO) that learns preferences directly, then connect RLHF to safety through the Helpful, Honest, Harmless goal. We also unpack the “alignment tax,” the trade-off between being safe and being genuinely useful, and close by setting up the next module on running models at scale, starting with GPU memory limits, plus a personal reflection on starting later without being behind.

Fler avsnitt av The AI Concepts Podcast

Module 6: RAG | Long Context vs RAG - Do You Still Need Retrieval at All

12 juni•9 min

Module 6: RAG | GraphRAG - When Relationships Matter More Than Text

10 juni•8 min

Module 6: RAG | Query Transformation - When the Question Is the Bottleneck

10 juni•7 min

Module 6: RAG | Parent-Child Indexing - Search Small, Retrieve Big

10 juni•7 min

Module 6: RAG | Reranking - The Second Stage That Gets Retrieval Right

10 juni•10 min

Module 6: RAG | Dense and Sparse Search - Why Vector Search Alone Is Not Enough

10 juni•11 min

Module 6: RAG | Chunking - Where You Cut Decides What Gets Found

29 apr.•11 min

Module 6: RAG | Data Ingestion - Before Your Documents Can Be Found

27 apr.•12 min

Module 6: RAG | Vector Databases - Where That Meaning Gets Stored

27 apr.•10 min

Module 6: RAG | Embeddings - Teaching Machines to Understand Meaning

27 apr.•8 min

The AI Concepts Podcast med Sheetal ’Shay’ Dhar finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.