CIS Seminar: “Reliable and Socially Aligned LLMs: Are We There Yet?”
/
Wu and Chen Auditorium (Room 101), Levine Hall
3330 Walnut Street, Philadelphia, PA, United States
Large language models (LLMs) are powerful but not yet reliable: they hallucinate, misalign with human values, and struggle with social reasoning. In this talk, I will trace a path from diagnosing failure modes such as hallucinations, to uncovering the pitfalls of aligning models with noisy human preferences and diverse values, and finally to emerging frontiers […]

