BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.16.3//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20250129T120000
DTEND;TZID=America/New_York:20250129T131500
DTSTAMP:20260602T190912
CREATED:20241119T143600Z
LAST-MODIFIED:20241119T143600Z
UID:12658-1738152000-1738156500@seasevents.nmsdev7.com
SUMMARY:ASSET Seminar: "Efficient Sharing of AI Infrastructures with Specialized Serverless Computing"
DESCRIPTION:Abstract: \nThe efficient sharing of AI infrastructures is becoming increasingly important in both public and private data centers. This demand is driven by two key factors: the proliferation of specialized AI models tailored for different users and applications\, and the highly dynamic nature of requests\, which are often on-demand. Dedicated GPU allocation in such scenarios results in prohibitively high costs and inefficient resource utilization. \nIn this talk\, I will introduce serverless computing as a promising paradigm for addressing these challenges by enabling efficient\, on-demand sharing of AI infrastructures. I will highlight its use cases and discuss key barriers to broader adoption. Following this\, I will present ServerlessLLM\, a state-of-the-art system designed to tackle key challenges in serverless large language model (LLM) inference\, particularly cold-start latency. Specifically\, I will cover ServerlessLLM’s novel contributions\, including its checkpoint format design\, locality-aware scheduling\, and inference request live migration. Finally\, I will outline open challenges beyond efficiency\, such as fairness\, privacy\, and sustainability\, which are critical for the future of serverless AI systems. \nZoom Link (if unable to attend in-person): https://upenn.zoom.us/j/95090162762
URL:https://seasevents.nmsdev7.com/event/asset-seminar-yao-fu-university-of-edinburgh/
LOCATION:Amy Gutmann Hall\, Room 414\, 3333 Chestnut Street\, Philadelphia\, 19104\, United States
END:VEVENT
END:VCALENDAR