BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.16.3//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20250204T153000
DTEND;TZID=America/New_York:20250204T163000
DTSTAMP:20260602T175830
CREATED:20250130T181928Z
LAST-MODIFIED:20250130T181928Z
UID:13060-1738683000-1738686600@seasevents.nmsdev7.com
SUMMARY:CIS Seminar: "Thinking Outside the GPU: Systems for Scalable Machine Learning Pipelines"
DESCRIPTION:Scalable and efficient machine learning (ML) systems have been instrumental in fueling recent advancements in ML capabilities. However\, further scaling these systems requires more than simply increasing the number and performance of accelerators. This is because modern ML deployments rely on complex pipelines composed of many diverse and interconnected systems.  \nIn this talk\, I will emphasize the importance of building scalable systems across the entire ML pipeline. In particular\, I will explore how large-scale ML training pipelines\, including those deployed at Meta\, require distributed data storage and ingestion systems to manage massive training datasets. Optimizing these data systems is essential as data demands continue to grow. To achieve this\, I will demonstrate how synergistic optimizations across the training data pipeline can unlock performance and efficiency gains beyond what isolated system optimizations can achieve. While these synergistic optimizations are critical\, deploying them requires navigating a large system design space. To address this challenge\, I will next introduce cedar\, a framework that automates the optimization and orchestration of ML data processing for diverse training workloads. Finally\, I will discuss further opportunities in advancing the scalability\, security\, and capabilities of the hardware and software systems that continue to drive increasingly sophisticated ML training and inference pipelines.
URL:https://seasevents.nmsdev7.com/event/cis-seminar-thinking-outside-the-gpu-systems-for-scalable-machine-learning-pipelines/
LOCATION:Wu & Chen Auditorium
ORGANIZER;CN="Computer and Information Science":MAILTO:cherylh@cis.upenn.edu
END:VEVENT
END:VCALENDAR