Post-training gets your model to behave the way you want it to. As AMD VP of AI Sharon Zhou explains to Ben on this episode, the frontier labs are convinced, but the average developer is still figuring out how post-training works under the hood and why they should care. In their focused discussion, Sharon and Ben get into the process and trade-offs, techniques like supervised fine-tuning, reinforcement learning, in-context learning, and RAG, and why we still need post-training in the age of agents. (It’s how to get the agent to actually work.) Check it out.
Fler avsnitt av Generative AI in the Real World
Visa alla avsnitt av Generative AI in the Real WorldGenerative AI in the Real World med O'Reilly finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.
