Department of Mathematics - Seminar on Statistics - Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning

7月18日

4:00pm - 5:00pm

研讨会, 演讲, 讲座

In this paper, we propose a robust policy evaluation algorithm in reinforcement learning, to feature outlier contamination and heavy-tailed reward distributions. We further develop a fully-online method to conduct statistical inference for the modeling parameters. Our method converges faster to the minimum asymptotic variance than the classical temporal difference (TD) learning and avoids the selection of the step sizes. Numerical experiments are provided on the effectiveness of the proposed algorithm in real-world reinforcement learning experiments, which highlight the efficiency and robustness of our approach when compared to the existing online bootstrap method. This work is joint with Jiyuan Tu (SUFE), Xi Chen (NYU), and Weidong Liu (SJTU).

7月18日

4:00pm - 5:00pm

立即登记

地点

Room 2302 (Lifts 17/18)

讲者/表演者

Prof. Yichen ZHANG
Purdue University

主办单位

Department of Mathematics

联系方法

付款详情

对象

Alumni, Faculty and staff, PG students, UG students

语言

英语

其他活动

6月16日

研讨会, 演讲, 讲座

IAS / School of Science Joint Lecture - Shaping Tumor Cell Plasticity and Therapy Resistance in Glioblastoma

Abstract Tumor heterogeneity fueled by plasticity and genetic diversification of cancer cells is key to therapy failure of malignant glioma. The speaker's team implemented spatial and genetic p...

5月11日

研讨会, 演讲, 讲座

IAS / School of Science Joint Lecture - Regioselective Pyridine C-H-Functionalization and Skeletal Editing

Abstract Pyridines belong to the most abundant heteroarenes in medicinal chemistry and in agrochemical industry. In the lecture, highly regioselective pyridine C-H functionalization through a d...