Sveriges mest populära poddar

Data Skeptic

Vetenskap Teknologi

Building the howto100m Video Corpus

23 min•19 augusti 2019

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen.

This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities.

Related Links

The paper will be presented at ICCV 2019

Antoine on Github

Antoine's homepage

Fler avsnitt av Data Skeptic

Book Ratings and Recommendations

27 mars•39 min

Disentanglement and Interpretability in Recommender Systems

10 mars•31 min

Collective Altruism in Recommender Systems

27 feb.•55 min

Niche vs Mainstream

18 feb.•34 min

Healthy Friction in Job Recommender Systems

2 feb.•27 min

Fairness in PCA-Based Recommenders

26 jan.•50 min

Video Recommendations in Industry

26 dec. 2025•38 min

Eye Tracking in Recommender Systems

18 dec. 2025•52 min

Cracking the Cold Start Problem

8 dec. 2025•40 min

Designing Recommender Systems for Digital Humanities

23 nov. 2025•37 min

Data Skeptic med Kyle Polich finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.