Sveriges mest populära poddar

MolmoBot: A Vision-Language Model for Zero-Shot Robot Manipulation

38 min•24 mars 2026

Vision-language model (VLM) for zero-shot robot manipulation, trained entirely in simulation without real-world data; achieves 79.2% success rate on real-world tabletop tasks, outperforming π₀.₅ baseline at 39.2%.

Fler avsnitt av Embodied AI 101

AI Model Collapse: The Danger of Training on AI-Generated Data

31 mars•32 min

# High-Level Automated Reasoning with Qwen2.5-7B

31 mars•28 min

Co-Training Large Behavior Models: Multimodal Data for Robot Manipulation

31 mars•33 min

MolmoBot: Opening a New Era of Simulated Training in Robotics

30 mars•22 min

HyDRA: Hybrid Memory for Dynamic Video World Models

30 mars•36 min

DexWM: Leveraging Human Videos for Dexterous Robot World Models

30 mars•31 min

World Models in Robotics

29 mars•27 min

MolmoBot: Training Robot Manipulation Entirely in Simulation

28 mars•25 min

SIMART: Decomposing Monolithic Meshes into Sim-Ready Articulated Assets

28 mars•45 min

LeWorldModel: A Stable JEPA World Model from Pixels

28 mars•14 min

Embodied AI 101 med Shaoqing Tan finns tillgänglig på flera plattformar. Informationen på denna sida kommer från offentliga podd-flöden.