I have a much better understanding of Sutton’s perspective now. I wanted to reflect on it a bit.
(00:00:00) - The steelman
(00:02:42) - TLDR of my current thoughts
(00:03:22) - Imitation learning is continuous with and complementary to RL
(00:08:26) - Continual learning
(00:10:31) - Concluding thoughts
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe
Fler avsnitt av Dwarkesh Podcast
Visa alla avsnitt av Dwarkesh PodcastDwarkesh Podcast med Dwarkesh Patel finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.
