Seminar on Statistics and Data Science - Clustering via Uncoupled Regression (CURE)

5月8日

9:30am - 11:00am

研讨会, 演讲, 讲座

In this talk, we first consider a canonical clustering problem where one receives unlabeled samples drawn from a balanced mixture of two elliptical distributions and aims for a classifier to estimate the labels. Many popular methods including PCA and k-means require individual components of the mixture to be somewhat spherical, and perform poorly when they are stretched. To overcome this issue, we propose a non-convex program seeking for an affine transform to turn the data into a one-dimensional point cloud concentrating around -1 and 1, after which clustering becomes easy. Our theoretical contributions are two-fold: (1) we show that the non-convex loss function exhibits desirable landscape properties as long as the sample size exceeds some constant multiple of the dimension, and (2) we leverage this to prove that an efficient first-order algorithm achieves near-optimal statistical precision even without good initialization. We also propose a general methodology for multi-class clustering tasks with flexible choices of feature transforms and loss objectives.

5月8日

9:30am - 11:00am

立即登记

地点

https://hkust.zoom.us/j/5616960008

讲者/表演者

Dr. Kaizheng WANG
Princeton University and Columbia University

主办单位

Department of Mathematics

联系方法

mathseminar@ust.hk

付款详情

对象

Alumni, Faculty and Staff, PG Students, UG Students

语言

英语

其他活动

7月14日

研讨会, 演讲, 讲座

IAS / School of Science Joint Lecture - Boron Clusters

Abstract The study of carbon clusters led to the discoveries of fullerenes, carbon nanotubes, and graphene. Are there other elements that can form similar nanostructures? To answer this questio...

5月15日

研讨会, 演讲, 讲座

IAS / School of Science Joint Lecture - Laser Spectroscopy of Computable Atoms and Molecules with Unprecedented Accuracy

Abstract Precision spectroscopy of the hydrogen atom, a fundamental two-body system, has been instrumental in shaping quantum mechanics. Today, advances in theory and experiment allow us to ext...