Sign in to confirm you’re not a bot
This helps protect our community. Learn more
Comments are turned off. Learn more
AgiBot GO-1: The Evolution of Generalist Embodied Foundation Model from VLA to ViLLA
65Likes
8,866Views
Mar 102025
Today, AgiBot launches Genie Operator-1 (GO-1), an innovative generalist embodied foundation model. GO-1 introduces the novel Vision-Language-Latent-Action (ViLLA) framework, combining a Vision-Language Model (VLM) and Mixture of Experts (MoE). The VLM utilizes internet-scale heterogeneous data to establish a solid foundation for scene and object understanding. The MoE consists of two key components: the Latent Planner, which learns from cross-embodiment and human operation data to develop general action understanding, and the Action Expert, which uses over a million real robot demonstrations to achieve high-frequency and dexterous manipulation.
How this content was made
Altered or synthetic content
Sound or visuals were significantly edited or digitally generated. Learn more