BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.17.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20241025T103000
DTEND;TZID=America/New_York:20241025T114500
DTSTAMP:20241011T191325Z
CREATED:20241011T191325Z
LAST-MODIFIED:20241011T191325Z
UID:12384-1729852200-1729856700@seasevents.nmsdev7.com
SUMMARY:Fall 2024 GRASP on Robotics: Ruslan Salakhutdinov\, Carnegie Mellon University\, "Multimodal AI Agents"
DESCRIPTION:This will be a hybrid event with in-person attendance in Wu and Chen and virtual attendance on Zoom. \nABSTRACT\nIn recent years\, the rise of Large Language Models (LLMs) with advanced general capabilities has paved the way towards building language-guided agents that can perform complex\, multi-step tasks on behalf of users\, much like human assistants. Building agents that can perceive\, plan\, and act autonomously has long been a central goal of artificial intelligence research. In this talk I will introduce Multimodal AI agents capable of planning\, reasoning\, and executing actions on the web\, that can not only comprehend textual information but also effectively navigate and interact with visual settings I will next present an inference-time search algorithm for agents to explicitly perform exploration and multi-step planning in interactive web environments. Our approach is a form of best-first tree search that operates within the actual environment space\, and is complementary with most existing state-of-the-art agents. Finally\, I will introduce VisualWebArena\, a novel framework for evaluating multimodal autonomous language agents\, and offer insights towards building stronger autonomous agents for both digital and physical environments.
URL:https://seasevents.nmsdev7.com/event/fall-2024-grasp-on-robotics-ruslan-salakhutdinov-carnegie-mellon-university-multimodal-ai-agents/
LOCATION:Wu and Chen Auditorium (Room 101)\, Levine Hall\, 3330 Walnut Street\, Philadelphia\, PA\, 19104\, United States
CATEGORIES:Seminar
ORGANIZER;CN="General Robotics%2C Automation%2C Sensing and Perception (GRASP) Lab":MAILTO:grasplab@seas.upenn.edu
END:VEVENT
END:VCALENDAR