BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Penn Engineering Events - ECPv6.15.18//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Penn Engineering Events
X-ORIGINAL-URL:https://seasevents.nmsdev7.com
X-WR-CALDESC:Events for Penn Engineering Events
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20210314T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20211107T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20220313T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20221106T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20220321T153000
DTEND;TZID=America/New_York:20220321T163000
DTSTAMP:20260406T031342
CREATED:20220311T134352Z
LAST-MODIFIED:20220311T134352Z
UID:6501-1647876600-1647880200@seasevents.nmsdev7.com
SUMMARY:CIS Seminar: "Fast and Effective Analytics for Big Multi-Dimensional Data"
DESCRIPTION:Today\, automated processes\, Internet‑of‑Things deployments\, and Web and mobile applications generate an overwhelming amount of high‑dimensional data. Meanwhile\, computational resources remain limited\, and advances in machine learning (ML) create a pressing need to support increasingly expensive and complex analytical tasks. Unfortunately\, traditional data management techniques offer limited support for high‑dimensional data\, ML tasks\, and adaptation to data properties\, often resulting in reduced performance. Similarly\, due to the difficulty of providing invariances to specific data distortions\, applications often resort to inadequate ML methods\, reducing their effectiveness. \nIn my work\, I ask how we can address the lack of task‑aware and data‑driven adaptations in data management and ML methods. Specifically\, I will discuss three solutions for (i) data representations and (ii) computational methods using techniques to exploit similarities\, shapes\, densities\, and distributions in data. Motivated by the ubiquity of high-dimensional time series\, I will first present a similarity-preserving representation to minimize storage footprint and accelerate specific ML analytics for time-series data. Then\, I will discuss a variance-aware quantization method for indexing high-dimensional data. Finally\, I will present a method for anomaly detection in streaming data to account for distribution drifts. In all three examples\, the proposed methods substantially improve performance and accuracy\, demonstrating the benefit of designing task-aware and data-driven solutions for large-scale data science applications.
URL:https://seasevents.nmsdev7.com/event/cis-seminar-fast-and-effective-analytics-for-big-multi-dimensional-data/
LOCATION:Zoom – Email CIS for link\, cherylh@cis.upenn.edu
ORGANIZER;CN="Computer and Information Science":MAILTO:cherylh@cis.upenn.edu
END:VEVENT
END:VCALENDAR