ASSET Seminar: “Rethinking Test-Time Thinking: From Token-Level Rewards to Robust Generative Agents”
/
Amy Gutmann Hall, Room 414
3333 Chestnut Street, Philadelphia, United States
We present a unified perspective on test-time thinking as a lens for improving generative AI agents through finer-grained reward modeling, data-centric reasoning, and robust alignment. Beginning with GenARM, we introduce an […]

