Name: CIS Seminar: “Rethinking Data Use in Large Language Models”
Start: 2024-02-08T15:30:00-05:00
End: 2024-02-08T16:30:00-05:00
Location: Wu and Chen Auditorium (Room 101), Levine Hall

CIS Seminar: “Rethinking Data Use in Large Language Models”

February 8, 2024 at 3:30 PM - 4:30 PM

Share this event

Add to Calendar

Details

Date: February 8, 2024

Time: 3:30 PM - 4:30 PM

Event Tags:CIS

Organizer

Computer and Information Science

Phone: 215-898-8560

Email: cherylh@cis.upenn.edu

Website: View Organizer Website

Venue

Wu and Chen Auditorium (Room 101), Levine Hall 3330 Walnut Street
Philadelphia
PA 19104 Google Map

View Venue Website

Large language models (LMs) such as ChatGPT have revolutionized natural language processing and artificial intelligence more broadly. In this talk, I will discuss my research on understanding and advancing these models, centered around how they use the very large text corpora they are trained on. First, I will describe our efforts to understand how these models learn to perform new tasks after training, demonstrating that their so-called in context learning capabilities are almost entirely determined by what they learn from the training data. Next, I will introduce a new class of LMs—nonparametric LMs—that repurpose this training data as a data store from which they retrieve information for improved accuracy and updatability. I will describe my work on establishing the foundations of such models, including one of the first broadly used neural retrieval models and an approach that simplifies a traditional, two-stage pipeline into one. I will also discuss how nonparametric models open up new avenues for responsible data use, e.g., by segregating permissive and copyrighted text and using them differently. Finally, I will envision the next generation of LMs we should build, focusing on efficient scaling, improving factuality, and decentralization.

CIS Seminar: “Rethinking Data Use in Large Language Models”

February 8, 2024 at 3:30 PM - 4:30 PM

Details

Organizer

Venue

Read More