Name: ASSET Seminar: “How do LLMs generalize on out-of-distribution tasks? insights from model’s internal representations”
Start: 2025-09-24T12:00:00-04:00
End: 2025-09-24T13:15:00-04:00
Location: Amy Gutmann Hall, Room 414

ASSET Seminar: “How do LLMs generalize on out-of-distribution tasks? insights from model’s internal representations”

September 24, 2025 at 12:00 PM - 1:15 PM

Share this event

Add to Calendar

Details

Date: September 24, 2025

Time: 12:00 PM - 1:15 PM

Event Category: Seminar

Event Tags:ASSET, CIS, AI, IDEAS

Organizer

AI-enabled Systems: Safe, Explainable, and Trustworthy (ASSET) Center

Email: asset-info@seas.upenn.edu

Website: View Organizer Website

Venue

Amy Gutmann Hall, Room 414 3333 Chestnut Street
Philadelphia
19104 Google Map

A mystery of large language models (LLMs) is their ability to solve novel tasks, notably through a few demonstrations in the prompt (in-context learning). Such tasks often require the model to generalize far beyond its training distribution, raising the question: how do LLMs achieve this form of out-of-distribution (OOD) generalization? For example, in symbolized language reasoning where names/labels are replaced by arbitrary symbols, yet the model can infer the correct name-label mapping without any finetuning.

In this talk, I will open the black box of LLMs and reveal how three facets of LLM behavior are interconnected: emergent phenomena during training, OOD generalization, and a model’s representation of compositions. Focusing on induction heads, I will show that learning the right compositional structure is a key to OOD generalization, and this learning process exhibits sharp transitions in training dynamics. Further, I propose that “”common bridge representation hypothesis””—where a latent subspace in the embedding space acts as a bridge to align multiple attention heads across early and later layers—may be the key geometric structure underlying the success of transformers.

Zoom: https://upenn.zoom.us/j/91447341103

ASSET Seminar: “How do LLMs generalize on out-of-distribution tasks? insights from model’s internal representations”

September 24, 2025 at 12:00 PM - 1:15 PM

Details

Organizer

Venue

Read More