ESE Guest Seminar – “Safe Offline RL for Constrained Markov Decision Process: Theory and Practice”
/
Greenberg Lounge (Room 114), Skirkanich Hall
210 South 33rd Street, Philadelphia, PA, United States
Many constrained sequential decision-making processes such as safe AV navigation, wireless network control, caching, cloud computing, etc., can be cast as Constrained Markov Decision Processes (CMDP). Reinforcement Learning (RL) algorithms have been used to learn optimal policies for unknown unconstrained MDP. Extending these RL algorithms to unknown CMDP, brings the additional challenge of not only […]

