BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.15.18//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20220313T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20221106T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20231130T153000
DTEND;TZID=America/New_York:20231130T163000
DTSTAMP:20260404T001015
CREATED:20231120T182112Z
LAST-MODIFIED:20231120T182112Z
UID:10183-1701358200-1701361800@seasevents.nmsdev7.com
SUMMARY:CIS Seminar: "Diffusion Models in Computer Vision"
DESCRIPTION:Denoising diffusion models represent a recent emerging topic in computer vision\, demonstrating impressive results in generative modeling. A diffusion model is a deep generative model that is based on two stages\, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage\, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage\, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion. Diffusion models are widely appreciated for the quality and diversity of the generated images. In this talk I will present our recent work on how diffusion models can be employed for solving computer vision problems. First\, I will discuss temporal action segmentation for comprehending human behaviors in complex videos\, which aims to process a long video and produce a sequence that delineates the action category for each frame. I will present a framework based on the denoising diffusion model that iteratively produces action predictions starting with random noise\, conditioned on the features of the input video. To effectively capture three key characteristics of human actions\, namely the position prior\, the boundary ambiguity\, and the relational dependency\, we propose a cohesive masking strategy for the conditioning features.  Next\, I will briefly discuss how diffusion models are employed to solve the problems of person image synthesis\, cloth-changing person re-identification\, and limited field of view cross-view geo-localization and present state of results. \nAlthough the use of diffusion models has yielded positive results in text-to-image generation\, there is a notable lack of research regarding the understanding of these models.  For example\, there is a rising need to understand how to design effective prompts that produce the desired outcome. Next\, I will briefly talk about our ongoing work on Reverse Stable Diffusion: What prompt was used to generate this image?  I will end this talk by briefly discussing our recent work that underscores the significance of incorporating symmetries into diffusion models\, by enforcing equivariance to a general set of transformations within DDPM’s reverse denoising learning process.
URL:https://seasevents.nmsdev7.com/event/cis-seminar-diffusion-models-in-computer-vision/
LOCATION:Wu and Chen Auditorium (Room 101)\, Levine Hall\, 3330 Walnut Street\, Philadelphia\, PA\, 19104\, United States
ORGANIZER;CN="Computer and Information Science":MAILTO:cherylh@cis.upenn.edu
END:VEVENT
END:VCALENDAR