Seminar on Data Science and Machine Learning - Global Convergence of Block Coordinate Descent in Deep Learning

7月12日

3:00pm - 4:00pm

研討會, 演講, 講座

Deep learning has aroused extensive attention due to its great empirical success. The efficiency of the block coordinate descent (BCD) methods has been recently demonstrated in deep neural network (DNN) training. However, theoretical studies on their convergence properties are limited due to the highly nonconvex nature of DNN training. In this paper, we aim at providing a general methodology for provable convergence guarantees for this type of methods. In particular, for most of the commonly used DNN training models involving both two- and three-splitting schemes, we establish the global convergence to a critical point at a rate of O(1/k), where k is the number of iterations. The results extend to general loss functions which have Lipschitz continuous gradients and deep residual networks (ResNets). Our key development adds several new elements to the Kurdyka-Lojasiewicz inequality framework that enables us to carry out the global convergence analysis of BCD in the general scenario of deep learning.

7月12日

3:00pm - 4:00pm

立即登記

地點

Room 2463, Academic Building (near Lifts 25 - 26)

講者/表演者

Mr. Tim Tsz-Kit Lau
Department of Statistics, Northwestern University, Illinois, USA

主辦單位

Department of Mathematics

聯絡方法

mathseminar@ust.hk

付款詳情

對象

Alumni, Faculty and Staff, PG Students, UG Students

語言

英語

其他活動

6月16日

研討會, 演講, 講座

IAS / School of Science Joint Lecture - Shaping Tumor Cell Plasticity and Therapy Resistance in Glioblastoma

Abstract Tumor heterogeneity fueled by plasticity and genetic diversification of cancer cells is key to therapy failure of malignant glioma. The speaker's team implemented spatial and genetic p...

5月11日

研討會, 演講, 講座

IAS / School of Science Joint Lecture - Regioselective Pyridine C-H-Functionalization and Skeletal Editing

Abstract Pyridines belong to the most abundant heteroarenes in medicinal chemistry and in agrochemical industry. In the lecture, highly regioselective pyridine C-H functionalization through a d...