Loading Events

Fall 2025 GRASP on Robotics: Jie Tan, Google DeepMind, “Gemini Robotics: Bringing AI into the Physical World”

November 21, 2025 at 10:30 AM - 11:45 AM
Details
Date: November 21, 2025
Time: 10:30 AM - 11:45 AM
Event Category: Seminar
Organizer
General Robotics, Automation, Sensing and Perception (GRASP) Lab
Venue
Wu and Chen Auditorium (Room 101), Levine Hall 3330 Walnut Street
Philadelphia
PA 19104
Google Map

This event will be in-person ONLY in Wu and Chen Auditorium.

ABSTRACT

Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. In this talk, I will present Gemini Robotics, an advanced Vision-Language-Action (VLA) generalist model capable of directly controlling robots. Gemini Robotics executes smooth movements to tackle a wide range of complex manipulation tasks while also being robust to variations in object types and positions, handling unseen environments as well as following diverse, open vocabulary instructions. With additional fine-tuning, Gemini Robotics can be specialized to new capabilities including solving long-horizon, highly dexterous tasks, learning new short-horizon tasks from as few as 100 demonstrations and adapting to completely novel robot embodiments. Furthermore, I will discuss the challenges, learnings and future research directions on robot foundation models.