A deep dive on Dwarkesh Patel interview with Eric Jang into how AlphaGo conquered Go by combining a value network, a policy network, and Monte Carlo tree search. We unpack how these two nets shrink the game’s vast space, how self-play trains better strategies, and what this implies for solving hard real‑world problems in science and education—while noting the limits when moving from closed games to open-ended tasks like language models.
Note: This podcast was AI-generated, and sometimes AI can make mistakes. Please double-check any critical information.
Sponsored by Embersilk LLC
Fler avsnitt av Intellectually Curious
Visa alla avsnitt av Intellectually CuriousIntellectually Curious med Mike Breault finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.
