Name: ASSET Seminar: “Controlling Language Models”
Start: 2025-03-26T12:00:00-04:00
End: 2025-03-26T13:15:00-04:00
Location: Amy Gutmann Hall, Room 414

ASSET Seminar: “Controlling Language Models”

March 26, 2025 at 12:00 PM - 1:15 PM

Share this event

Add to Calendar

Details

Date: March 26, 2025

Time: 12:00 PM - 1:15 PM

Event Tags:ASSET, CIS, AI

Venue

Amy Gutmann Hall, Room 414 3333 Chestnut Street
Philadelphia
19104 Google Map

Abstract:

Controlling language models is key to unlocking their full potential and making them useful for downstream tasks. Successfully deploying these models often requires both task-specific customization and rigorous auditing of their behavior. In this talk, I will begin by introducing a customization method called Prefix-Tuning, which adapts language models by updating only 0.1% of their parameters. Next, I will address the need for robust auditing by presenting a Frank-Wolfe-inspired algorithm for red-teaming language models, which provides a principled framework for discovering diverse failure modes. Finally, I will rethink the root cause of these control challenges, and propose a new generative model for text, called Diffusion-LM, which is controllable by design.

Zoom Link (if unable to attend in-person): https://upenn.zoom.us/j/93867005722

ASSET Seminar: “Controlling Language Models”

March 26, 2025 at 12:00 PM - 1:15 PM

Details

Venue

Read More