ASSET Seminar: “The coverage principle in language models: From pre-training to test-time scaling”
/
Amy Gutmann Hall, Room 414
3333 Chestnut Street, Philadelphia, United States
Test-time compute has emerged as a new axis for scaling language model capabilities, yet we lack a principled understanding of this paradigm. What are the right algorithms and trade-offs for test-time scaling? What properties of the pre-trained model enable it? And can we better align pre-training recipes for test-time success? This talk addresses these questions […]

