Name: ASSET Seminar: “Robustness in the Era of LLMs: Jailbreaking Attacks and Defenses”
Start: 2024-09-25T12:00:00-04:00
End: 2024-09-25T13:15:00-04:00
Location: Raisler Lounge (Room 225), Towne Building

ASSET Seminar: “Robustness in the Era of LLMs: Jailbreaking Attacks and Defenses”

September 25, 2024 at 12:00 PM - 1:15 PM

Share this event

Add to Calendar

Details

Date: September 25, 2024

Time: 12:00 PM - 1:15 PM

Event Tags:ASSET, CIS, AI

Venue

Raisler Lounge (Room 225), Towne Building 220 South 33rd Street
Philadelphia
PA 19104 Google Map

View Venue Website

Abstract:

Despite efforts to align large language models (LLMs) with human intentions, popular LLMs such as chatGPT, Llama, Claude, and Gemini are susceptible to jailbreaking attacks, wherein an adversary fools a targeted LLM into generating objectionable content. For this reason, interest has grown in improving the robustness of LLMs against such attacks. In this talk, we review the current state of the jailbreaking literature, including new questions about robust generalization, discussions of new black-box attacks on LLMs, defenses against jailbreaking attacks, and a new leaderboard to evaluate the robust generalization of production LLMs.

Zoom Link (if unable to attend in-person): https://upenn.zoom.us/j/93335180566

ASSET Seminar: “Robustness in the Era of LLMs: Jailbreaking Attacks and Defenses”

September 25, 2024 at 12:00 PM - 1:15 PM

Details

Venue

Read More