Loading Events

CIS Seminar: ” Empowering Large Language Models with Efficient and Automated Systems”

March 26, 2024 at 3:30 PM - 4:30 PM
Details
Date: March 26, 2024
Time: 3:30 PM - 4:30 PM
  • Event Tags:
  • Organizer
    Computer and Information Science
    Phone: 215-898-8560
    Venue
    Wu and Chen Auditorium (Room 101), Levine Hall 3330 Walnut Street
    Philadelphia
    PA 19104
    Google Map

    Large Language Models (LLMs) have brought remarkable advancements to the computing industry. However, a high barrier exists between the LLMs and the vast majority of researchers and practitioners, brought by the engineering challenges with the enormous model sizes and the substantial compute requirements. In this talk, I’ll discuss my research on system innovations to democratize LLMs, which includes (1) Alpa and AlpaServe, the first system to automate model-parallel training and accelerate serving with model parallelism, and (2) vLLM, a high-throughput and memory-efficient serving engine for large language models, accelerated with PagedAttention. I will conclude by presenting the short-term research challenges and long-term trends in LLM systems.