Sveriges mest populära poddar

Data Skeptic

Vetenskap Teknologi

[MINI] Reinforcement Learning

23 min•9 februari 2018

In many real world situations, a person/agent doesn't necessarily know their own objectives or the mechanics of the world they're interacting with. However, if the agent receives rewards which are correlated with the both their actions and the state of the world, then reinforcement learning can be used to discover behaviors that maximize the reward earned.

Fler avsnitt av Data Skeptic

Give Users the Wheel

23 juni•35 min

17 juni•35 min

Student Spotlight: Aaron Payne, Data Analyst

The Future is Agentic in Recommender Systems

25 apr.•49 min

Book Ratings and Recommendations

27 mars•39 min

Disentanglement and Interpretability in Recommender Systems

10 mars•31 min

Collective Altruism in Recommender Systems

27 feb.•55 min

Niche vs Mainstream

18 feb.•34 min

Healthy Friction in Job Recommender Systems

2 feb.•27 min

Fairness in PCA-Based Recommenders

26 jan.•50 min

Data Skeptic med Kyle Polich finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.