BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.17.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20250416T143000
DTEND;TZID=America/New_York:20250416T143000
DTSTAMP:20250411T143317Z
CREATED:20250411T143317Z
LAST-MODIFIED:20250411T143317Z
UID:13906-1744813800-1744813800@seasevents.nmsdev7.com
SUMMARY:ESE Ph.D. Thesis Defense: "Training Adaptive and Sample-Efficient Autonomous Agents"
DESCRIPTION:AI agents\, both in the physical and digital worlds\, should generalize from their training data to three increasingly difficult levels of deployment: training tasks and environments\, training tasks and environments with variations\, and completely new tasks and environments. Moreover\, like humans\, they are expected to learn from as little training data as possible\, especially in the physical world\, and adapt with as little adaptation data as possible. This thesis is founded around and describes work that tackles these levels of generalization with an additional emphasis on sample-efficiency. \nWe start with a focus on training data efficiency and the simplest level of generalization from training data to training tasks and environments (a.k.a.\, level 1). AI agents\, especially in the physical world\, are usually trained via one of two paradigms: imitation learning or reinforcement learning. First\, we propose a plug-in model class to improve behavior cloning with any deep neural network (DNN) backbone that is particularly effective in the low-data regime. Second\, we leverage our proposed model class to guarantee the conformance of any DNN world model to physics and medical constraints\, in a highly data-efficient manner. Third\, we improve the sample-efficiency of reinforcement learning agents\, by an order of magnitude\, by leveraging expert interventions. \nNext\, we tackle the challenge of generalization to training tasks and environments with variations as well as completely new tasks and environments (a.k.a.\, levels 2 and 3)\, keeping both training and adaptation sample-efficiency in mind. Here\, we pre-train REGENT\, a retrieval-augmented generalist agent that can adapt to unseen robotics and game-playing environments via in-context learning\, without any finetuning. REGENT outperforms state-of-the-art generalist agents after pre-training on an order-of-magnitude fewer datapoints and with up to 3x fewer parameters. We also propose a strategy\, inspired by adaptive control\, to improve the robustness of the image encoder of REGENT\, an essential component for handling environment variations. \nFinally\, we bring REGENT to the real world by converting a Vision Language Action model (VLA) to a REGENTic VLA capable of generalizing to unseen objects and tasks through retrieval-augmentation and in-context learning. Further task-specific REGENTic-tuning substantially improves reliability\, surpassing a VLA directly fine-tuned on the same data. \nWe conclude by outlining future directions to expand the envelope of tasks and environments to which a general AI agent can adapt.
URL:https://seasevents.nmsdev7.com/event/ese-ph-d-thesis-defense-training-adaptive-and-sample-efficient-autonomous-agents/
LOCATION:Room 512\, Levine Hall\, 3330 Walnut Street\, Philadelphia\, PA\, 19104\, United States
CATEGORIES:Dissertation or Thesis Defense
ORGANIZER;CN="Electrical and Systems Engineering":MAILTO:eseevents@seas.upenn.edu
END:VEVENT
END:VCALENDAR