BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.16.4//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20220313T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20221106T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20230209T123000
DTEND;TZID=America/New_York:20230209T133000
DTSTAMP:20260618T025219
CREATED:20230203T164528Z
LAST-MODIFIED:20230203T164528Z
UID:8366-1675945800-1675949400@seasevents.nmsdev7.com
SUMMARY:ESE Spring Seminar - "Reliable Data-Driven Decision-Making Systems"
DESCRIPTION:Despite impressive success in domains such as vision and language\, machine learning is still far from reliable integration into many challenging real-world scenarios\, such as healthcare\, where the coverage of existing data and the ability to collect new\, diverse data are limited. This talk focuses on mathematically formulating and addressing some of the challenges in data-driven decision-making systems\, studied in the reinforcement learning (RL) framework. I will discuss decision-making based on two sources of data: historical (offline) data and actively-collected data. In learning from offline data\, I first mathematically formulate the challenge of partial data coverage. I show that this formulation combined with pessimistic offline RL unifies the major offline learning paradigms: imitation learning and conventional offline RL. I then present statistically-optimal and practical offline RL algorithms that simultaneously exploit expressive models\, such as deep neural networks\, and historical datasets with any coverage\, to learn good decision-making policies. In learning from interactive data\, I present general formulations and theoretically-guaranteed algorithms that exploit problem structure and expressive models to collect data for learning good policies\, with efficacy demonstrated in a variety of navigation and locomotion tasks.
URL:https://seasevents.nmsdev7.com/event/ese-spring-seminar-title-tbd/
LOCATION:Raisler Lounge (Room 225)\, Towne Building\, 220 South 33rd Street\, Philadelphia\, PA\, 19104\, United States
CATEGORIES:Colloquium
ORGANIZER;CN="Electrical and Systems Engineering":MAILTO:eseevents@seas.upenn.edu
END:VEVENT
END:VCALENDAR