BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.17.1//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20260308T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20261101T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20251210T120000
DTEND;TZID=America/New_York:20251210T131500
DTSTAMP:20250821T204554Z
CREATED:20250821T204554Z
LAST-MODIFIED:20250821T204554Z
UID:20917-1765368000-1765372500@seasevents.nmsdev7.com
SUMMARY:ASSET Seminar: "Reality Checks"
DESCRIPTION:Despite its success\, leaderboard chasing has become something researchers dread and mock. When implemented properly and executed faithfully\, leaderboard chasing can lead to both faster and easily reproducible progress in science\, as evident from the amazing progress we have seen with machine learning\, or more broadly artificial intelligence\, in recent decades. It does not however mean that it is easy to implement and execute leaderboard chasing properly. In this talk\, I will go over four case studies demonstrating the issues that ultimately prevent leaderboard chasing from a valid scientific approach. The first case study is on the lack of proper hyperparameter tuning in continual learning\, the second on the lack of consensus on evaluation metrics in machine unlearning\, the third on the challenges of properly evaluating the evaluation metrics in free-form text generation\, and the final one on wishful thinking. By going over these cases\, I hope we can collectively acknowledge some of our own fallacies\, think of underlying causes behind these fallacies and come up with better ways to approach artificial intelligence research. \n  \nZoom: https://upenn.zoom.us/j/96405514259
URL:https://seasevents.nmsdev7.com/event/asset-seminar-title-tbd-9/
LOCATION:Amy Gutmann Hall\, Room 414\, 3333 Chestnut Street\, Philadelphia\, 19104\, United States
CATEGORIES:Seminar
ORGANIZER;CN="AI-enabled Systems%3A Safe%2C Explainable%2C and Trustworthy (ASSET) Center":MAILTO:asset-info@seas.upenn.edu
END:VEVENT
END:VCALENDAR