BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.17.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20250910T120000
DTEND;TZID=America/New_York:20250910T131500
DTSTAMP:20250821T202705Z
CREATED:20250821T202705Z
LAST-MODIFIED:20250821T202705Z
UID:20912-1757505600-1757510100@seasevents.nmsdev7.com
SUMMARY:ASSET Seminar: "Rethinking Test-Time Thinking: From Token-Level Rewards to Robust Generative Agents"
DESCRIPTION:We present a unified perspective on test-time thinking as a lens for improving generative AI agents through finer-grained reward modeling\, data-centric reasoning\, and robust alignment. Beginning with GenARM\, we introduce an inductive bias for denser\, token-level reward modeling that guides generation during decoding\, enabling token-level alignment without retraining. While GenARM targets reward design\, ThinkLite-VL focuses on the data side of reasoning. It proposes a self-improvement framework that selects the most informative samples via MCTS-guided search\, yielding stronger visual reasoning with fewer labels. Taking this a step further\, MORSE-500 moves beyond selection to creation: it programmatically generates targeted\, controllable multimodal data to systematically probe and stress-test models’ reasoning abilities. We then interrogate a central assumption in inference-time alignment: Does Thinking More Always Help? Our findings reveal that increased reasoning steps can degrade performance–not due to better or worse reasoning per se\, but due to rising variance in outputs\, challenging the naive scaling paradigm. Finally\, AegisLLM applies test-time thinking in the service of security\, using an agentic\, multi-perspective framework to defend against jailbreaks\, prompt injections\, and unlearning attacks–all at inference time. Together\, these works chart a path toward generative agents that are not only more capable\, but more data-efficient\, introspective\, and robust in real-world deployment. \n  \nSeminar Recording: https://drive.google.com/file/d/13jOKuou0QzqkMo9QHEdoHA1nCIxOPsbm/view?usp=drive_link
URL:https://seasevents.nmsdev7.com/event/asset-seminar-rethinking-test-time-thinking-from-token-level-rewards-to-robust-generative-agents/
LOCATION:Amy Gutmann Hall\, Room 414\, 3333 Chestnut Street\, Philadelphia\, 19104\, United States
CATEGORIES:Seminar
ORGANIZER;CN="AI-enabled Systems%3A Safe%2C Explainable%2C and Trustworthy (ASSET) Center":MAILTO:asset-info@seas.upenn.edu
END:VEVENT
END:VCALENDAR